Hacker News .hnnew | past | comments | ask | show | jobs | submit | future-shock-ai's submissionslogin
1.From 300KB to 69KB per Token: How LLM Architectures Solve the KV Cache Problem (future-shock.ai)
157 points by future-shock-ai 64 days ago | past | 10 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: