Hacker News .hnnew | past | comments | ask | show | jobs | submit | kansface's commentslogin

Yes, exactly this. If I didn't care about price at all, I'd exclusively use this model. It functions more like an actual engineer. I'm in the midst of a DB migration, and eg 5.5 continually suggests stuff like "use DB X instead of DB Y for task Z because its 30% faster" which is an impossibility of reality, given we are migrating DBs. Fable jumped in, reduced allocs by literally 46x, found multiple bugs 4.8 and 5.5 created (max file system usage, correctness issues, etc), and continually suggested awesome improvements unprompted. As in, it would finish a task and then suggest we tackle this other existing problem I didn't know about in a very specific manner... this is the first model that feels like its coming for my job.

I'm having the same experience. I'm in the process of implementing a new CRDT for realtime collaborative editing. There just aren't a lot of implementations of CRDTs kicking around online for opus or any of the other models to have good design instincts.

Fable is doing - so far - a great job. I just had one big question around how part of it should work. I had a design sketch, but with some big unknowns. I asked fable to figure it out via reasoning and prototyping, and it did - it even, under its own initiative, wrote a fuzzer for its prototype which explored and verified that its reasoning was correct. It absolutely nailed it. And it found, and fixed, a couple bugs that I'd missed.

I'm sure its weaknesses will become apparent in time. But, wow this thing is a beast. Its the first time I'm reading the work of an LLM without spotting obvious weaknesses in its reasoning and code. I'm really impressed.


I was about to ask where you work that you’re implementing new CRDTs and then I noticed your username! Thanks for all that you do!

I work on the live collab at my company, and using AI while coding has into recently sort of “clicked” for me. We use an (I’m pretty sure) unheard of algorithm for collaborative editing, and I’ve had a long term goal of turning it into an implementation of EG Walker, but our document model is very complex and most out of the box CRDTs don’t quite fit. Maybe Fable will be what gets me over the hump.


Long shot here because I'm not knowledgeable enough about CRDTs but maybe something like DSON would help? I saw a talk about it a while ago and it might be useful.

https://blog.helsing.ai/posts/dson-a-delta-state-crdt-for-re...

https://www.youtube.com/watch?v=4QkLD7JhD_I&pp=ygUJZHNvbiBjc...


Ty, checking this out!

I’d be fascinated to hear more if you’re willing to share. What is special about your document model which makes existing tools like automerge a bad fit?

We have cross-field invariants that merging at the data structure level can't ensure (in an obvious way, at least), and "lose the semantic meaning of a conflict". The main idea behind their approach is that certain parts of the model can have custom "mergers" that are able to run business logic to maintain these invariants.

Worth noting, the decision to eschew CRDTs predates my time here, and I've pushed for a CRDT rewrite quite a bit since I believe it could be done. The other main concern they had was memory usage, but it seems like EG Walker would solve that. Our system uses a "Commit DAG", (an Event DAG by another name), and does a three-way merge using a common ancestor of the diverged documents, and so a lot of the bones of EG Walker are there, and I'm exploring ways in which we could gradually move to it.


Hello joseph,

I saw scanning the comments and saw you mentioned CRDT. Just wanted to mention that I implemented a CRDT-flavoured sync engine for the product I'm working on a while ago, I think it was with Opus 4.6 if I'm not mistaken (or earlier) so it's not something new to Fable 5, just fyi.


Yeah, you've certainly been able to get Opus to write a CRDT. It just needs a lot of hand-holding to make it correct. Opus always seems pretty bad at coming up with invariants and using them to make a piece of software correct. Without invariants, you end up with lots of hacky workarounds to avoidable problems.

So far at least - and its been less than a day - Fable seems better at this.

I think I also do my CRDTs differently from others. I've grown to like the pure-oplog approach after making eg-walker. LLMs are much worse at this!


> wrote a fuzzer for its prototype which explored and verified that its reasoning was correct. It absolutely nailed it.

For such a data structure, "nailing it" means a formal proof of correctness. Fuzzing, as useful as it is, is merely throwing dirt at the wall and seeing if anything sticks.


I’ll ask it for a formal proof when I get home and see how it goes.

I’ve read plenty of papers with “formal proofs of correctness” that turned out to have huge flaws. Machine verifiable proofs I trust. But I’ve personally found more bugs with fuzzing than I have via proofs.


In the real world, many of us don't have the time to create formal proofs. But our instinct in testing where edge cases may exist in code that we wrote is a type of refactoring that happens in our brains during the coding process. Hand the coding off to a machine and you have no idea where to start looking for the flaws.

> Hand the coding off to a machine and you have no idea where to start looking for the flaws.

I have found this quickly becomes false. I have learned I cannot review llm generated code as if it is written by a trusted senior developer (where I often just do a quick look, see nothing obvious and hit approve). Once you start reading the code in depth with the goal of understanding you quickly see the places where flaws are likely. Sure I start with no clue where to look, but it doesn't take long to see things.


> this is the first model that feels like its coming for my job

Damn you must be good, I've been feeling this for around 2 years now


It's been obvious for at least 2 years, anyone who doesn't see the writing on the wall simply hasn't learned how to use these well or has severe exponential blindness.

"But it doesn't do well when writing my undertrained language" - yeah, fine. Yet. Reasonable code in that is probably one RAG + verification scaffold deployment around Mythos or maybe mythos+1. Just like it was for you learning it, because you knew how to _program_.


Yeah I agree. We're headed into a rougher job market pretty much across the board for white collar work , hitting junior people worse at this stage. Up to societies around the world to decide how to deal with this - so far we deal with it by ignoring it it seems.

The monks got mad too when the printing press was invented because it took their jobs of hoarding knowledge.

AI is just another tool, learn to use it.


And then in a couple years the AI gets better at "using AI" than the bottom 99.999% of knowledge workers, who are now out of work.

We are all doomed! Doomed I say!

Gosh, I must be doing something wrong. I spent 15 minutes (of which a lot was waiting while it was thinking about "backwards rationalising" it's decision and "gaslighting"[1]) arguing with it over why it keeps using `node -e "console.log(require('fs').readdirSync('…'))"` instead of `ls -l …`.

Like it did everything:

- this is not a Linux system (true, it was macOS) - it is not an available command - the binary is corrupted - node/js is more precise - V8 JavaScript is faster than bash (true technically??? But not in this context lol) - JavaScript is more versatile

I forgot what else we went through but there were a few more things. I indulged it because it was incredulous and funny. The prompts from my side were all questions, never instructions. I assume an instruction would've helped here, but also I don't think Opus ever did this (but on the other hand Opus wrote python scripts to format/indent, instead of just running cargo fmt, so I guess potato potato)


I will stake the claim, as an engineer never having studied sociology, that in group favoritism is the (only) stable political arrangement by and large… and further, the preservation of any culture necessitates discrimination of some sort.

You've got it backwards. That's a defeatist take that results in the exact kind of misery and cruelty documented in detail throughout history. Society prospers when people look past their differences and work together to improve things. It suffers when demagogues successfully divide the public and exploit the chaos to loot the resources required to improve the lives of everyone. Making punching bags out of a group of people is sure way to create instability.

I don’t understand why this is insane? Why do you expect a single cause?

These are all examples where commenters have confidently said that this specific thing is the cause (independent of any data). It is a little odd on a story about how the actual experts are working hard to understand the (likely multifaceted) causes of this trend that there are oodles of people just declaring "oh it is obviously this."

The number of different reasons that people come up with (and with conviction in many cases) is insane IMO, not any single reply or commenter (I hope).

We'll get around to training job specific models or the equivalent. Thats just lower on the value chain for now.


No one believes or acts like this will be a one time event (on any side of the issue). The history of all new forms of taxation is that eventually it will come for you.


The sovereign wealth fund would be a stakeholder in equities and estates. It would have to exercise voting privileges and be a party to lawsuits. Do you want Trump getting control of the board of eg SpaceX or Meta?


> It would have to exercise voting privileges

There's never a requirement to vote your shares. I've never cast a single shareholder vote in my life. The fund could be legally required to not exercise any votes.

> be a party to lawsuits

Since when are shareholders a party to lawsuits? It's called limited liability for a reason.

> Do you want Trump getting control of the board

I'd normally say "legally structure it so that doesn't happen" and "follow best practices".But laws only mean anything if someone enforces them. If the government doesn't function correctly no government function can work correctly.


A 6% wealth tax indeed taxes more than the expected rate of return on the base assets. That is indeed equivalent to a higher rate than 100% in terms of an income tax. This math is in favor of PG’s argument.


Certainly, but it's about as useful as comparing the range of a radio with that of a car.

We might also compare tax revenue. For the US 1% wealth tax (for the 0.1%) would generate 250ish billion while 1% income tax would also generate 250 b. Then say 50% worth of increased tax evasion. 2% wealth tax is equal to 1% income tax.


Income (returns) are not guaranteed. Go for progressive capital gains if that’s what you want. A wealth tax is a crazy bad idea.


The expected a priori utility of any social intervention is strictly negative… even if “more thoughtful” does check out in reality for higher ed, $700 billion and 15million man years yearly is rather expensive.


3 10mm bolts failing simultaneously after two decades (on direct it seems) is unexpected! If it were an installation problem, I can’t imagine it would take that long and that they’d all go at the same time. Ditto for corrosion… people take victory whips all the time.


All three bolts at critical level. A massive strain happens along. They pop: one, two, three!

It's not that unexpected.


It is absolutely unexpected in climbing. Even a single bolt failing is highly unusual.

Additionally, the rope length extends quite a bit of an anchor fails (and it partially recovers it’s elasticity before the other anchors engage). Later anchors would not get forces exceeding a normal ‘healthy’ bolts limits.

So there is some systemic failure involved in this scenario.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: