Hacker News .hnnew | past | comments | ask | show | jobs | submitlogin

What's really needed is a way to easily prune context. If I could go and manually manage the entire chat with a model, I could squeeze way more juice out of a typical ~200k token coding session.

Instead I have a good instance going, but the model fumbles for 20k tokens and then that session heavily rotted. Let me cut it out!



Even just a rollback to previous checkpoint would be killer frsture


Zed's agent mode lets you do this, don't know about others


that's a standard feature in cursor, windsurf, etc.


Local LLMs let you edit the context however you want, including the responses generated by the LLM so it will later think it said what you want it to say which can help put it on the right track.

LLMs-as-a-service don't offer this because it makes it trivial to bypass their censoring.


I've heard it repeated so many times that once things start to go sideways, trying to get back on track is a mistake. Have you had real-world success hacking context using rewritten responses?


I have experimented with "hey claude i am about to reset your context, please give me a prompt that will allow you to continue your work" and then reviewing that and tweaking it before feeding it back in.


/compress is the command to do this in most cli agents


That will reduce the context to a summary, not prune a bunch of irrelevant stuff




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: