More

tekacs · 2026-03-13T00:08:43 1773360523

It kinda... does? The problem is that folks have been flailing on the right UX for this.

This is what build vs. plan mode _does_ in OpenCode. OpenAI has taken a different approach in Codex, where Plan mode can perform any actions (it just has an extra plan tool), but in OC in plan mode, IIRC write operations are turned off.

The screenshot shows that the experience had just flipped from Plan to Build mode, which is why the system reminder nudged it into acting!

Now... I forget, but OC may well be flipping automatically when you accept a plan, or letting the model flip it or any other kind of absurdity, but... folks are definitely trying to do the approval split in-harness, they're just failing badly at the UX so far.

And I fully believe that Plan vs. Build is a roundly mediocre UX for this.

beart · 2026-03-13T02:20:31 1773368431

The switch from plan mode to build is not always clearly defined. On a number of occasions, I've been in plan mode and enter a secondary follow up prompt to modify the plan. However, instead of updating the plan, the follow up text is taken as approval to build and it automatically switches to building.

Ask mode, on the other hand, has always explicitly indicated that I need to switch out of ask mode to perform any actions.

This is my experience with Cursor CLI.

evolighting · 2026-03-13T01:01:14 1773363674

Does Codex actually have a Plan Mode, or is there a mode switch I'm missing? I find myself having to manually tell it to 'make a plan' every time.

and if it has directory permissions, sometimes it just skips the confirmation step and starts executing as soon as it thinks the plan is ready.

ianbutler · 2026-03-13T01:22:02 1773364922

cmd-shift-p (at least in vscode)

FergusArgyll · 2026-03-13T01:30:34 1773365434

shift-tab in cli

evolighting · 2026-03-13T02:17:01 1773368221

It actually work, got "Plan mode (shift+tab to cycle)" at corner.

reading the manual , there is Slash commands /plan /plan switch to Plan mode

It seems that, unlike OpenCode, Codex doesn't show a notice for mode by default.

harrall · 2026-03-13T02:56:17 1773370577

This applies well if you’re writing code.

But often I am using Claude to investigate a problem like this “why won’t this mDNS sender work” and it needs a bunch of trial and error steps to find the problem and each subsequent step is a brand new unanticipated command.

ramoz · 2026-03-13T00:18:42 1773361122

The OpenCode plan experience has been pretty bad (the community has accepted this, at least on Discord). The community's adopted a handful of plugins to make the experience better, and also guardrail when the agent switches versus doesn't

tekacs · 2026-03-08T12:44:58 1772973898

I think this means cost above. As in the extra cost you pay.

tekacs · 2026-03-07T16:16:05 1772900165

As a heavy magit user prior to jj, I can attest that I've just felt much less need for it in the wake of jj. Things like JJ's split being interactive and a lot of the commands having really neat short forms has meant that for me, as attached as I was, I still found myself benefiting so much that I switched.

tekacs · 2026-03-02T05:19:00 1772428740

It's worth scrolling down to the current implementation status part:

https://github.com/Dicklesworthstone/frankensqlite#current-i...

Although I will admit that even after reading it, I'm not exactly sure what the current implementation status is.

measurablefunc · 2026-03-02T06:11:57 1772431917

It's fake. It doesn't exist. It never happened. The whole thing is an LLM hallucination. You can notice that it's all half implemented if you read the code: https://github.com/Taufiqkemall2/frankensqlite/blob/main/cra...

tonyedgecombe · 2026-03-02T14:06:17 1772460377

We are going to get overwhelmed with this stuff aren't we.

measurablefunc · 2026-03-02T17:52:51 1772473971

The people who understand basic logic will be fine but I'm starting to think that's a very small group of people.

tekacs · 2026-02-25T09:57:13 1772013433

Honestly... ask an AI agent to update them for you.

They do an excellent job of reading documentation and searching to pick and choose and filter config that you might care about.

After decades of maintaining them myself, this was a huge breath of fresh air for me.

tekacs · 2026-02-24T21:45:12 1771969512

I kind of don't know what to think of startups that keep launching with things like this as their main facility?

Perhaps you can have a moment's attention like that, but... was it not apparent to everyone that this was going to be launched by the lab imminently?

Similarly, there are a bunch of stories about people's startups being killed by relatively trivial feature launches by OpenAI or Anthropic. And I find myself just super confused why people are chasing the super simple intermediary roles and then presenting as surprised.

tekacs · 2026-02-24T21:35:32 1771968932

My guess would be that they're building Apple internal hardware as a precursor? So that Apple can be the test customer?

tekacs · 2026-02-16T09:53:48 1771235628

I think that's reasonable, but then they should have the ability for the agent to, on the next call, override it. Even if it requires the agent to have read the file once or something.

In the absence of that you end up with what several of the harnesses ended up doing, where an agent will use a million tool calls to very slowly read a file in like 200 line chunks. I think they _might_ have fixed it now (or agent-fixes, my agent harness might be fixing it), but Codex used to do this and it made it unbelievably slow.

reactordev · 2026-02-16T15:01:26 1771254086

You’re describing peek.

An agent needs to be able to peek before determining “Can I one shot this or does it need paging?”

tekacs · 2026-02-17T22:54:35 1771368875

Yep, I previously implemented it under that name in my own harness. That being said, there is value in actually performing a normal read, because you do often complete it on that first glance.

reactordev · 2026-02-19T16:17:25 1771517845

Confession, I too implemented a “smart” read. A read unless it’s over a size, then it’s paged, or if it’s a specific format, a summary. However, I also supply `cat`

tekacs · 2026-02-13T06:20:16 1770963616

It seems hard to tell what to think of a company that is simultaneously trying to poison the content that it sends to agents [1] and also doing things like this.

I understand their arguments for it - and completely disagree - so I can't help but think that anyone who is on the pro-AI side of things would do well to steer clear of them if possible.

[1]: https://arstechnica.com/ai/2025/03/cloudflare-turns-ai-again...

tekacs · 2026-02-13T03:04:46 1770951886

I mean, is it possible that they could run the full-size model on it, but doing so on the smaller amount of hardware that they have is a worse trade-off for now, and it's better to run more of the smaller model so that it can actually provide capacity to people?