More

richardw · 2026-03-18T10:59:30 1773831570

Daniel is a very impressive guy. Well within the realm of “fund the people not the idea” that YC seems to do. Got a few bucks from them and probably earning from collaborations etc. Odds of them not figuring out a business model seem slim.

https://www.ycombinator.com/companies/unsloth-ai

sowbug · 2026-03-18T15:08:14 1773846494

From comments elsewhere in this thread, it sounds like Unsloth could also be getting some decent consulting revenue from larger companies.

reactordev · 2026-03-18T15:51:01 1773849061

The opportunity here is HUUUUGGGEEEE!!!

Companies have no idea what they are doing, they know they need it, they know they want it, engineers want it, they don’t have it in their ecosystem so this is a perfect opportunity to come in with a professional services play. We got you on inference training/running, your models, all that, just focus on your business. Pair that with huggingface’s storage and it’s a win/win.

zokier · 2026-03-18T17:05:10 1773853510

Investments are not income

richardw · 2026-03-15T00:10:28 1773533428

I found that when I have “infinite” tokens my behaviour changed. 3-5 tabs so I’m not waiting, free side quests, huge review skills over whole codebase, skills that wrap 10 other skills. It’s like going from expensive data to uncapped.

I think these token doubles are there to kick you into a abundance mindset (for want of a better term) so going back feels painful. Stop counting tokens, focus on your project and the cost of your own time.

rednafi · 2026-03-15T00:48:19 1773535699

I use the enterprise plan for work and often burn ~150$ worth of tokens per day. I have noticed exhibiting similar behaviors here.

When you say nearly unlimited token, do you mean the 100 or 200$ subscription?

richardw · 2026-03-15T02:27:19 1773541639

$200, over December it was doubled. I tried my best in between family time and friends to burn a hole in it. Never got near doing so.

crashabr · 2026-03-15T01:00:30 1773536430

Is it possible to link/wrap several skills together? I haven't managed to get Claude to react to a reference to another skill within a skill.

richardw · 2026-03-15T02:31:23 1773541883

I have this as a skill Claude created to run the rest. It mentions each skill in turn, see below. It’s not deterministic but it definitely runs each skill and it’s raised a bunch of issues, which I then selectively deal with. Where I can, once an issue is identified, I make deterministic tests.

Text includes:

Invoke each review/audit skill in sequence. Each skill runs its own comprehensive checks and returns findings. Capture the findings from each and incorporate them into the final report.

IMPORTANT: Invoke each skill using the Skill tool. Each skill is independently runnable and will produce its own detailed output. Summarize findings per skill into the unified report format.

4. Architecture Health

Invoke: Skill(architecture-review)

Covers: module boundaries, cross-module communication, dependency direction, infrastructure layer rules, hexagonal architecture compliance.

5. Security Health

Invoke: Skill(security-review)

Covers: hardcoded secrets, SQL injection, authorization, HTTPS, CORS, input validation, authentication patterns.

richardw · 2026-02-27T22:45:54 1772232354

“In December, Gemini traffic increased by 28.4% month-over-month, while ChatGPT traffic decreased by 5.6%”

https://www.businessinsider.com/openai-chatgpt-vs-gemini-web...

"What's you number one piece of hiring advice?"

"Hire for slope, not Y-intercept. This is actually my number one piece of life advice."

-@sama, who I’m generally a big fan of. But the job is now harder

medi8r · 2026-02-28T00:36:00 1772238960

Or maybe hire for 2nd (acceleration) or even 3rd (jerk) derivative.

richardw · 2026-02-24T20:26:44 1771964804

And China is likely to do to Tesla robots what they’ve done to the cars. I assume the bans will be incoming, because the US can’t have millions of Chinese kung fu robots sitting about pouring tea, waiting for critical mass.

https://youtu.be/gfJTX1Y0ynM

richardw · 2026-02-20T22:34:39 1771626879

Every so often my YouTube logs out and I’m exposed to the view a “random visitor” would see. Instantly visible because it’s filled with stupid content and sexual provocation.

I manage the shit out of FB and YouTube. You need to block a few things so it stops testing a few segment ideas.

richardw · 2026-02-16T04:34:30 1771216470

Got a friend who is in the high frequency trading industry and uses both Java and C#. I asked about GC. Turns out you just write code that doesn’t need to GC. Object pools, off-heap memory etc.

It won’t do the absolute fastest tasks in the stack quite as well but supposedly the coding speed and memory management benefits are more important, and there’s no GC so it’s reliable.

zahlman · 2026-02-16T08:40:35 1771231235

> Turns out you just write code that doesn’t need to GC. Object pools, off-heap memory etc.

Some GCd languages make this easier than others. Java and C# allow you to use primitive types. Even just doing some basic arithmetic in Python (at least CPython) is liable to create temporary objects; locals don't get stack-allocated.

PacificSpecific · 2026-02-16T06:13:48 1771222428

That's what we do in games too. If you know the scope of your project and how to avoid dynamic allocation it's fine.

richardw · 2026-02-09T02:57:59 1770605879

> I view LLMs akin to a dictionary

…If every time you looked at the dictionary it gave you a slightly different definition, and sometimes it gave you the wrong definition!

sdf2erf · 2026-02-09T03:37:22 1770608242

Go look up the same word across various dictionaries - they do not have a 1:1 copy of the descriptions of terms.

Reproducibility is a separate issue.

fauigerzigerk · 2026-02-09T12:31:50 1770640310

Dictionaries are not a great analogy, because the standout feature of LLMs is that their output can change based on the context provided by individual users.

Differences between dictionaries are decided by the authors and publishers of the dictionaries without taking individual user queries into account.

richardw · 2026-02-08T19:42:55 1770579775

Totally. Surely the IDE’s like antigravity are meant to give the LLM more tools to use for eg refactoring or dependency management? I haven’t used it but seems a quick win to move from token generation to deterministic tool use.

port11 · 2026-02-08T20:38:04 1770583084

As if. I’ve had Gemini stuck on AG because it couldn’t figure out how to use only one version of React. I managed to detect that the build failed because 2 versions of React were being used, but it kept saying “I’ll remove React version N”, and then proceeding to add a new dependency of the latest version. Loops and loops of this. On a similar note AG really wants to parse code with weird grep commands that don’t make any sense given the directory context.

richardw · 2026-02-08T00:25:44 1770510344

It’s a $90k engineer that sometimes acts like a vandal, who never has thoughts like “this seems to be a bad way to go. Let me ask the boss” or “you know, I was thinking. Shouldn’t we try to extract this code into a reusable component?” The worst developers I’ve worked with have better instincts for what’s valuable. I wish it would stop with “the simplest way to resolve this is X little shortcut” -> boom.

It basically stumbles around generating tokens within the bounds (usually) of your prompt, and rarely stops to think. Goal is token generation, baby. Not careful evaluation. I have to keep forcing it to stop creating magic inline strings and rather use constants or config, even though those instructions are all over my Claude.md and I’m using the top model. It loves to take shortcuts that save GPU but cost me time and money to wrestle back to rational. “These issues weren’t created by me in this chat right now so I’ll ignore them and ship it.” No, fix all the bugs. That’s the job.

Still, I love it. I can hand code the bits I want to, let it fly with the bits I don’t. I can try something new in a separate CLI tab while others are spinning. Cost to experiment drops massively.

latch · 2026-02-08T01:33:25 1770514405

Claude code has those "thoughts" you say it never will. In plan mode, it isn't uncommon that it'll ask you: do you want to do this the quick and simple way, or would you prefer to "extract this code into a reusable component". It also will back out and say "Actually, this is getting messy, 'boss' what do you think?"

I could just be lucky that I work in a field with a thorough specification and numerous reference implementations.

devin · 2026-02-08T02:35:50 1770518150

I agree that Claude does this stuff. I also think the Chinese menus of options it provides are weak in their imagination, which means that for thoroughly specified problem spaces with reference implementations you're in good shape, but if you want to come up with a novel system, experience is required, otherwise you will end up in design hell. I think the danger is in juniors thinking the Chinese menu of options provided are "good" options in the first place. Simply because they are coherent does not mean they are good, and the combinations of "a little of this, a little of that" game of tradeoffs during design is lost.

throwaway7783 · 2026-02-08T02:33:28 1770518008

This has happened to me too. Claude has stopped and said on occasions "this is a big refactor, and will affect UI as well. Do you want me to do it?"

ryandrake · 2026-02-08T04:26:40 1770524800

I recently asked Claude to make some kind of simple data structure and it responded with something like "You already have an abstraction very similar to this in SourceCodeAbc.cpp line 123. It would be trivial to refactor this class to be more generic. Should I?" I was pretty blown away. It was like a first glimpse of an LLM play-acting as someone more senior and thoughtful than the usual "cocaine-fueled intern."

fragmede · 2026-02-08T11:36:53 1770550613

> sometimes acts like a vandal

I see you don't have experience working with a large number of real life humans.

richardw · 2026-02-03T19:39:33 1770147573

Great. There’s no reason why all countries don’t start preferring locally or regionally developed software. Of course interoperability is always a thing but there needs to be another option between “one company” and “everyone host your own instance”.