building ai character models
reading time: 0.47 mins
published: 2024-07-09
updated: 2024-09-03
training character models that are actually fun and interesting
[PLACEHOLDER]
open questions + notes
How fast do they have to be able to talk/think/respond?
- first reaction?
- first response?
- intermediate thoughts?
How smart do they have to be?
- be helpful?
ICL >>> finetuning
solving long-context + prefix caching
tokenizers
GPTs
- LLaMA 3 series
- Mistral
- RWKV
testing out some katex