It does NOT increase context length. I believe it's just a matter of saving network transfer between Ollama server and the client, after all, the final API call to the actual LLM is using the whole… - Stephen Cow Chau - Medium

a basic question - what help does tokenizing and detokenizing do?
1
Sampriti
Stephen Cow Chau
·Follow
Jan 12, 2024
--
It does NOT increase context length.
I believe it's just a matter of saving network transfer between Ollama server and the client, after all, the final API call to the actual LLM is using the whole text, and the decode happens, I believe, is before Ollama send data to the underneath llama.cpp server.
--
--
Written by Stephen Cow Chau238 Followers
·379 Following
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams