• 30 Posts
  • 69 Comments
Joined 1 year ago
cake
Cake day: June 12th, 2023

help-circle



















  • I use text-generation-webui mostly. If you’re only using GGUF files (llama.cpp), koboldcpp is a really good option

    A lot of it is the automatic prompt formatting, there’s probably like 5-10 specific formats that are used, and using the right one for your model is very important to achieve optimal output. TheBloke usually lists the prompt format in his model card which is handy

    Rope and yarn refer to extending the default context of a model through hacky (but functional) methods and probably deserve their own write up















  • Yeah definitely need to still understand the open source limits, they’re getting pretty dam good at generating code but their comprehension isn’t quite there, I think the ideal is eventually having 2 models, one that determines the problem and what the solution would be, and another that generates the code, so that things like “fix this bug” or more vague questions like “how do I start writing this app” would be more successful