HN2new | past | comments | ask | show | jobs | submitlogin

I am convinced. I've been giving it tasks the past couple hours that Opus 4.1 was failing on and it not only did them but cleaned up the mess Opus made. It's the real deal.


On that same vein, I had just tried Opus 4.1 yesterday, and it succesfully completed tasks that Sonnet 4 and Opus 4 failed at.


When it came out on Tuesday I wanted to throw my laptop out of the window. I don't know what happened but results were total garbage earlier this week. It got better the past couple days but so far with gpt-5 being able to solve problems without as much correction I'm going to use it more.


Interesting, I've had the complete opposite experience. Opus 4.1 feels like a generational improvement compared to GPT-5.


It is funny how it can be like this sometimes. I think a lot depends on coding styles, languages, prompting, etc.


And it's almost 10x cheaper via flex, and in #1 position on lmarena. It's not even close.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: