More

earth2mars · 2026-03-05T19:15:00 1772738100

I am actually super impressed with Codex-5.3 extra high reasoning. Its a drop in replacement (infact better than Claude Opus 4.6. lately claude being super verbose going in circles in getting things resolved). I stopped using claude mostly and having a blast with Codex 5.3. looking forward to 5.4 in codex.

whynotminot · 2026-03-05T20:42:59 1772743379

I still love Opus but it's just too expensive / eats usage limits.

I've found that 5.3-Codex is mostly Opus quality but cheaper for daily use.

Curious to see if 5.4 will be worth somewhat higher costs, or if I'll stick to 5.3-Codex for the same reasons.

satvikpendem · 2026-03-05T19:31:55 1772739115

Same, it also helps that it's way cheaper than Opus in VSCode Copilot, where OpenAI models are counted as 1x requests while Opus is 3x, for similar performance (no doubt Microsoft is subsidizing OpenAI models due to their partnership).

CryZe · 2026-03-05T21:42:12 1772746932

I've been using both Opus 4.6 and Codex 5.3 in VSCode's Copilot and while Opus is indeed 3x and Codex is 1x, that doesn't seem to matter as Opus is willing to go work in the background for like an hour for 3 credits, whereas Codex asks you whether to continue every few lines of code it changes, quickly eating way more credits than Opus. In fact Opus in Copilot is probably underpriced, as it can definitely work for an hour with just those 12 cents of cost. Which I'm not sure you get anywhere else at such a low price.

Update: I don't know why I can't reply to your reply, so I'll just update this. I have tried many times to give it a big todo list and told it to do it all. But I've never gotten it to actually work on it all and instead after the first task is complete it always asks if it should move onto the next task. In fact, I always tell it not to ask me and yet it still does. So unless I need to do very specific prompt engineering, that does not seem to work for me.

satvikpendem · 2026-03-05T21:45:30 1772747130

That shouldn't really make a difference because you can just prompt Codex to behave the same way, having it load a big list of todo items perhaps from a markdown file and asking it to iterate until it's finished without asking for confirmation, and that'll still cost 1x over Opus' 3x.

braebo · 2026-03-05T21:44:25 1772747065

I struggle to believe this. Codex can’t hold a candle to Claude on any task I’ve given it.

earth2mars · 2026-02-20T21:59:58 1771624798

The only reason I was sticking to Android for years is this. And I think there is no moat for Android. I would rather switch to iOS if both platforms are same restrictive.

singpolyma3 · 2026-02-21T01:11:17 1771636277

You'll miss having a keyboard that works

cromka · 2026-02-21T02:24:06 1771640646

It'll be sorted in about 9 days.

aryonoco · 2026-02-21T01:19:10 1771636750

I did this last year. Reluctantly. And using iOS still hurts. But it’s better than that Google crap.

I developed my own Android ROMs from 2009-2011, complete with my own tuned kernel. I ran the local Android developers MeetUp group and evangelised Android development. When Honeycomb launched I helped OEMs test their beta firmware. For free.

But as Google has become certified Evil, the direction of Android has been very clear. In practice I honestly can’t say it’s now any more open than iOS. Except it has a lot more avenues for Google to mine your data to sell ads. And the quality of third party apps on it is decidedly worse.

I thought long and hard about getting a Linux phone. But I need a good camera on my phone to take random snaps of kids/pets/etc. And the Linux phones just aren’t there.

I hate the shitty duopoly we have ended up with. But I now realise that the openness of x86 and pc as platform really was an accident of history.

earth2mars · 2026-02-20T15:13:55 1771600435

When you say quorum what do you mean? Is it like an agent swarm or using all of them in your workflow and independently they perform better than opus? Curious how you use (tooling and purpose - coding?)

earth2mars · 2026-02-19T12:11:03 1771503063

Yes it did well. Also some other word problems it did well too. Reasoning seems good. But maybe not a great code model

earth2mars · 2026-01-16T23:33:49 1768606429

prats226 · 2026-01-17T00:07:49 1768608469

https://nanonets.com/cookbooks/structured-llm-outputs/uncons...

earth2mars · 2026-01-14T19:44:27 1768419867

https://blog.tldrversion.com/ - vibe coded to the most part

earth2mars · 2025-12-22T18:16:58 1766427418

not a communist, but the communist manifesto articulated this problem very well in people end up doing work that does not matter to them because of capitalism. imagine a world, where people do that, they are passionate about and not have to worry about basic means and even some wants (entertainment, comfort living etc). a world of abundance for everyone where people just do what they are super passionate about. will AI help towards that or not is a big question.

knowitnone3 · 2025-12-22T22:44:40 1766443480

name one pure communist country that has thrived. China has 516 billionaires.

earth2mars · 2025-12-04T13:49:33 1764856173

How CoreWeave(wand) a competitor? I think they are a partner in in infrastructure.

earth2mars · 2025-11-14T19:59:27 1763150367

https://gemini.google.com/share/00967146a995 works perfectly fine with gemini 2.5 pro

lanewinfield · 2025-11-14T20:02:39 1763150559

nice. I restrict to 2000 tokens for mine, how many was that?

esafak · 2025-11-14T21:37:58 1763156278

how do you do that?

agildehaus · 2025-11-15T03:42:29 1763178149

I'm assuming the "Gemini 2.5" referenced on this site is Flash, not Pro. Pro is insane, and 3.0 is just around the corner.

earth2mars · 2025-11-15T00:41:44 1763167304

I used exactly the same prompt this site uses. Nothing else.

earth2mars · 2025-10-23T21:47:47 1761256067

I have a feeling the Trainium thing might get scrapped at some point of time as its probably not worth their effort in trying to retrofit. I assume AMD probalby at their doorstep banging to show better value