Does anyone else have experience with koboldcpp? How do I make it give me longer outputs?

PenisWenisGenius@lemmynsfw.com · 5 months ago

Does anyone else have experience with koboldcpp? How do I make it give me longer outputs?

tal@lemmy.today · 5 months ago

Is max tokens different from context size?

No. Same thing. If you hover over the question mark by “Max Tokens” in the Kobold AI Web UI:

“Max number of tokens of context to submit to the AI for sampling. Make sure this is higher than Amount to Generate. Higher values increase VRAM/RAM usage.”