More

jerf · 2026-06-14T14:03:44 1781445824

"Death" is hard to define for a programming language. It's tempting to say "the last time anyone writes it", or maybe "runs it", but to put that in biological terms that seems like defining "death" for a person as "the last chemical bond that was part of their body is broken"... sure, it'll happen someday, but all the properties we associate with the term "death" happen rather soon than that.

jerf · 2026-06-14T14:01:17 1781445677

It's all assembly code in the end. There's nothing intrinsically wrong with compiling down to Javascript, a high-level language can still implement many things that direct Javascript does not. Just about every language guarantee you've ever used can be violated by raw assembler.

jerf · 2026-06-12T17:22:31 1781284951

A Dune-style stillsuit is thermodynamically impossible. You can't both capture water and use that water to cool you via sweat evaporation. If you let it evaporate, it has to leave; if you capture evaporated sweat you also recover all the heat that it took with it. Those suits are equivalent to going out into the desert with no ability to sweat, and rather than extending your life, would kill you much more quickly.

If they were externally powered you might get the numbers to balance, but they are explicitly presented in the book as powered by the human inside, which subtracts even more time from how long you're going to last in the desert before you die.

You can build a larger thing that recovers your water and cools you via some other method that uses external power, but I think you'd be hard pressed to ever beat just bringing more water with you. It won't be long before you're spec'ing a vehicle and not a suit... and then that vehicle should probably just bring more water, too.

On the more positive front, there is an interesting technology for potentially cooling the Fremen in the middle of the desert that could be based on something real: Paint that cools you by dumping your heat directly into space. Here's a video of it in action and what you might call a prototype of a "suit" that works like this: https://www.youtube.com/watch?v=FnKNOPlR2Yo While that YouTube video shows off someone using that paint on clothes, it seems pretty likely that that would not last very long. Paint on clothes is exactly as silly as it sounds for a long-term approach. But hypothesizing that someone could make clothing or suits based on this approach has the advantage of not being thermodynamically impossible, as evidenced by the fact that at least one substance with these properties actually exists. On Earth, that suit won't work in cloudy weather, but on Arrakis that's not a problem. Tapping the local human power to drive some circulation of either air or a bit of liquid cooling attached to some lightweight fins or some other sort of surface area on your back or something and you might just get a suit that could hugely extend your ability to loiter in a hot desert environment. You'd still need water, but much much less, or, the same amount could take you much farther.

mncharity · 2026-06-12T23:00:36 1781305236

That's "passive daytime radiative cooling" for the curious. Supporting sweat, durability, non-toxic, existing textile tech, etc, gets hard. Or perhaps, like radiator-free ships in The Expanse, Dune just didn't want to show Fremen looking like butterflies.

jerf · 2026-06-12T17:06:55 1781284015

You need a limiting principle or there is no limit to the "better funding" you're asking for until you have a Library of Congress in every small town in America, to no positive effect.

What's the limiting principle you propose? It has to be something real libraries and library funding sources can take action on, because they have to take real-world actions on them. So this is not a time for aspirational speeches or vague exhortations to "do more", which is the exact opposite of a limiting principle anyhow. What is "enough"?

WillAdams · 2026-06-12T18:05:12 1781287512

The limiting principle should be that for a given ILL region/system, there is at least one copy of each book/edition which entered that system which can be loaned out.

As I noted, it's a pain for me to have to drive down to DC to get access to a book which _used_ to be in the local library system, but isn't anymore, or to purchase my own copy (which wasn't previously necessary).

throwaway173738 · 2026-06-12T23:25:40 1781306740

> to no positive effect.

This is a REALLY bold assumption you’re making here, and frankly until we’ve tried it I don’t think you can argue that it has no positive effect to put tons of books in every small town everywhere.

jerf · 2026-06-12T14:35:24 1781274924

"The extremely surprising and concerning part of this whole story is that the agent reported that they proactively spun up 5 AWS instances with a combined 100Gps of network egress capacity."

Although given the agent was clearly in la-la land at that point I take that claim with a grain of salt.

If this was some bizarre and very ill-conceived scam, then that claim would be false.

Though even by scammer standards, the theory of mind that tells them that setting an AI to harass a bunch of grizzled network veterans and that they then they would open their wallets out of compassion for how allegedly poorly the harassment went for the harasser after that harassment is... not entirely congruent with reality.

johng · 2026-06-12T20:39:54 1781296794

Clearly AI hasn't read enough BOFH or it would have known it would not get sympathy from old school sysadmins.

100721 · 2026-06-12T14:39:54 1781275194

Maybe I’m just groggy with Friday Brain going on, but I’m having trouble understanding what you’re suggesting.

Do you think this was a scam attempt to extract money in the form of reparation donations?

jerf · 2026-06-12T15:00:02 1781276402

I've seen some other suggestions of that idea in the full HN conversation, which I'm reacting to.

On the one hand I find it a bizarre approach to running a scam. On the other hand I'm having a hard time coming up with any theory of mind on my end as to why this person would solicit $5000+ from the people they just harassed. Sheer cluelessness does fit the facts, though.

numeri · 2026-06-15T02:38:56 1781491136

One context I could imagine is a young person with shaky grasp of English trying to come up with an interesting school/university project via conversations with an LLM set up as an OpenClaw agent.

It's got the right combinations of inexperience, cluelessness, panic, expectations that Westerners are rich, and hopes of others being willing to fix their mistake.

bombcar · 2026-06-12T19:13:16 1781291596

If you’ve not encountered the clueless LLM cowboys who would do then and then blame the victim for it not working, you’ve not met many people yet. This round of hype provides new and shiny footguns which are Never the shooter’s fault.

CrazyStat · 2026-06-13T16:19:06 1781367546

A highly publicized recent example: the author (of a book about genAI!) who doesn’t understand why he should be held responsible for the fake quotes he copy and pasted into his book from ChatGPT [1].

> I do not understand why it's my job as an author to play whack-a-mole with a multibillion-dollar company who puts hallucinations into their feed as a business practice.

[1] https://www.wired.com/story/future-of-truth-ai-interview/

adamrezich · 2026-06-12T17:23:18 1781284998

How about sheer panic after seeing the bill?

jerf · 2026-06-12T13:32:07 1781271127

"Buy for $x, have and not sell for $x, same mathematically."

Sort of. People are being less irrational than it sounds if you account for transaction costs. There's a lot of stuff I might "sell" if I could point a video-game-like pointer at it and right click and hit "sell", and it just instantly disappeared and money was credited to my bank account. Perhaps even more if buying was just as easy and I didn't need to hang on to something like my drill which I don't use very often and I could trivially "rent" it from the market by buying, using it, and selling in mere minutes.

But in practice one-off selling for anything less than $100 or so is a waste of time because there are significant transaction costs for one-off events like that.

jerf · 2026-06-11T21:46:34 1781214394

The most interesting takeaway for me is the three very distinct personalities. Three models all based on the same tech, trained in the same manner, trained by three groups of people with similar ideological outlooks, and the result is three very different AIs.

The military basically wants an oracle. Feed the AI the situation, get the best answer out. But if the AIs are as diverse and opinionated as humans, it is debatable whether they are adding anything to the process. The military can already collect as many different opinions as they want. If "the computer" is just another set of diverse opinions, where one computer says one thing, another says another, and a third just tells the user whatever they want to hear... what value are they? It just becomes AI-washing of someone's opinions, which works until people collectively realize that's all it is.

notJim · 2026-06-11T22:32:31 1781217151

What's interesting is that the LLMs' coding personalities seem to match their policy WRT to strategy, which suggests an underlying consistency.

Claude, for example, is very eager to begin coding, and very persistent. It tends to exit plan mode even when the plan is half-baked, and will go as far as deleting tests to get the suite to "pass."

ChatGPT on the other hand is very hesitant. It loves to pause and ask for permission before it starts coding, and gives up quickly if it runs into a problem. This is similar to its tendency toward passivity in the strategy simulation presented here.

themafia · 2026-06-11T22:09:23 1781215763

They all have conditioning prompts that precede your input; presumably, most of the detected "personality" comes from the differences in these inputs.

jerf · 2026-06-12T14:39:11 1781275151

My point is more-or-less orthogonal to why it happens. The military, and honestly, a lot of people, want AI to just give the answer. If it is highly dependent on a prompt, or the follow-on training, and the AI could be passive or friendly or aggressive or hostile or all those other wonderful attributes of individual humans and there's no sort of AI convergence on "correct" answers, then they aren't going to be able to fulfill that "oracle" role that so many people are looking for.

politician · 2026-06-11T22:08:17 1781215697

I think this is why reasoning chains and reasoning chain verifiers are so important. We need to be able to see an argumentation, not just an answer. The paper below goes into this in more detail.

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

https://arxiv.org/abs/2605.02396

jerf · 2026-06-11T20:16:28 1781208988

It isn't the same as travelling to truly remote cultures, of course, but odds are, your area has more stuff going on than you realize. My wife and I have taken to planning our occasional 2 week vacations with 2-4 days that we have plans for, and then plan to just use local resources to figure out what to do from there. And we always find things. Sometimes literally just driving down the road on the way to something else we found online and there's a little park on the side of the road or something dedicated to some interesting little thing. If you're just traveling as the wind takes you, it's not a problem that maybe that little park is only 10 minutes of "interesting". It's not a bad way to travel.

zazuke · 2026-06-11T20:48:27 1781210907

I find it also much more relaxing if you just go, with less planning, and more surprises. Travel as the wind takes you, that's it.

snicky · 2026-06-11T21:14:22 1781212462

I usually prefer to make a detailed plan for my trip, but I allow myself to break free from it anytime I find something more interesting to do. This way I'm more sure that I won't waste too much time or money staying in wrong places, choosing wrong means of transport and falling for some tourist traps or common scams.

geekasm · 2026-06-12T21:34:54 1781300094

+1. failing to plan is planning to fail

jerf · 2026-06-11T18:44:07 1781203447

I would be interested in a clear statement about how this scales. I've not used this workflow myself, but I've seen teams that did it. Whether they got huge benefits out of it I don't know, but I do know that watching them, I was not jealous of what I saw. If I make a change, and I run some tests that were passing a moment ago, and they fail, and the reason why they failed is that Bob hit "save" on his editor (or his editor autosaves) and he made a syntax error in a shared library, and this happened often... I would go insane. I cause enough problems for myself without other people's problems actively intruding at uncontrolled times into my tests.

AI's code writing velocity makes this even worse, there's no way I can be simultaneously working on a code base while an AI agent is running around it doing something else.

It feels like maybe there's a ghost of an idea here about how to get the best of both worlds, but I'm not sure I follow the throughline on it.

jerf · 2026-06-11T15:22:09 1781191329

"I’ve never understood the “if I don’t enable bad behavior, someone else will, so I might as well enable bad behavior” argument. Can you elaborate?"

You are mentally approaching this as if you have an oracle that can be consulted to say whether or not something is bad behavior. So of course, if this oracle exists and can be consulted and it says the behavior is bad, why would anyone argue with the idea that we should stop bad behavior?

This argument is valid [1], in that give the premises the argument is correct. The problem is, once you draw out the fact that the argument is depending on the existence of an oracle that does not exist, that premise of the argument is invalid.

Two people can sit down in front of an AI right now, with the exact same code base, and type in a prompt to the AI "Analyze this code base for security holes and try to build exploits against them." One person's use is completely valid, another person's use is completely harmful, and the information necessary to distinguish those two use cases is not available to the AI. I phrase it that way carefully, it isn't that "the AI isn't smart enough", the problem is that the information is simply unavailable. Intelligence doesn't factor in at that point.

Therefore, the only way that Antropic has to deal with this at scale is simply to block the query entirely. Which means that when I, the valid user who is trying to establish whether my code base has security issues and whether I can prove they are exploitable, I can not. I am checking for exploitability because while I would like to fix all security issues, issues that are provable exploitable are of a higher priority than smelly code that doesn't seem to be exploitable, which is a perfectly valid thing for me to want to do.

If I can't use legitimate tools to secure my code, but the bad guys can use unrestricted tools to attack my code, now this is a great deal more complicated than "Who can argue with stopping the bad stuff?", which is the main point I want to make here. I'm not going into a huge analysis of that problem, merely pointing out that it is a problem and that this isn't just about "stopping the bad stuff". There are additional complications beyond that, like, even if Anthropic could determine the "bad stuff" and stop just that in their LLM, LLMs in general don't have infinitely precise surgical "stop doing this thing" options and any such instruction to stop doing a thing always degrades the LLM across the board in various ways.

Anthropic has no access to the Platonic ideal of "stop malware", if such a thing even hypothetically exists. When analyzing the real effects their real actions will take, what their intentions were for those actions aren't really relevant. It is clear that they are making their model a great deal less useful for me, a legitimate user, and I and others like me are perfectly justified in disagreeing with their analysis and actions.

I also observe that "the bad guys getting unrestricted access to the full power" is only a matter of time. There's no question whether it will happen, the only question is whether this time is in the past or the future. This includes the fact that while your definition and my definition of "bad guys" may vary, it is virtually certain that your definition includes at least one high-powered intelligence agency somewhere in the world that does cyberattacks and will have the means, the opportunity, and the motive to get unrestricted access to these models by means you may consider licit or illicit. If your threat model includes them, as mine does, it is perfectly reasonable to complain that my tooling is being broken in a ways theirs won't be.

[1]: https://en.wikipedia.org/wiki/Validity_(logic)

Hizonner · 2026-06-11T15:54:08 1781193248

Well, to be fair, what Anthropic is actually doing is downgrading anything that could possibly be related to security in any way at all, good or bad.

What they're then trying to do is to use "user is associated with some big Establishment organization" as a proxy for good intentions, and removing the filter when they can establish such an association.

Which is of course blind reliance on a completely untrustworthy signal, prompted by truly idiotic levels of trust in Authority(TM). But it's a different kind of wrong. I do think they understand they can't tell from the query itself.

cglan · 2026-06-11T15:55:14 1781193314

Well said