More

sureMan6 · 2026-04-30T13:59:11 1777557551

The pro LLM rant is weird, LLMs "hallucinate" in creating detailed elaborate lies, the frontier models still do this egregiously, an LLM written article by default has 0 value since every single line could be true or it could be a convincingly crafted lie, every line has to be fact checked

I'm using Gemini 3.1 pro to help me research my thesis, it still with search enabled and on pro mode, invents entire papers that don't exist, and lies about the contents of existing papers to relate them to the context or to appease me, if I submitted an LLM written article based on the results its given me 80% of the article would be lies

Commenting to complain that the article is LLM written is helpful too since some people aren't able to distinguish

0xbadcafebee · 2026-04-30T17:41:54 1777570914

> an LLM written article by default has 0 value since every single line could be true or it could be a convincingly crafted lie, every line has to be fact checked

The exact same thing is true of Human speech. You have no idea if anything a human says is true until you fact check it. But you don't fact check everything every person says, do you?

So what do you do instead? You use heuristics. Simple - and quite flawed - subconscious rules to stop worrying about things. You find a person you like, and you classify them "trustworthy", and believe almost all of what they say, not considering if any of it might be false. But of course, humans are fallible, and many of them receive "poisoned" input, and even hallucinate (making up information). They then spread that false information around. Yes, even the people you trust.

And when you're faced with something untrue, said by someone you trust, you rationalize it. "Oh, they just made a mistake." And you completely ignore that the person you trust told you a falsehood. Life is hard enough without having to question if everything we hear is false. So we just accept falsehoods from some people, and not others.

LLMs are likely more factual and knowledgeable today than humans are, thanks to their constant improvements via reinforcement. They're going to keep getting better too. But they'll never be perfect. Rather than rejecting anything they produce, my suggestion would be to do what you do with humans: trust them a little, verify big things, let the little things go, accept that there will be errors, and move on with life.

WarmWash · 2026-04-30T15:55:04 1777564504

If you are asking an LLM to cite it's sources you are wasting your time and degrading the quality of the response. LLMs have no inherent mechanism for "knowledge source tracking", because that isn't at all how they work. We're trying to get there with agentic stacks, but it's still too new.

For sparse knowledge tasks, where you know that the model can't possibly have much training because even humans themselves don't have much knowledge there, use it as a brainstorming partner, not as a source. Or put relevant papers in it's context to help you eval those papers in relation to your work. But it's just going to hurt itself in confusion trying to tie fuzzy ideas to sparse sources embedded in pages upon pages of mildly related google search results.

kevin42 · 2026-04-30T14:28:59 1777559339

If they can't distinguish LLM text, then why should they care?

Anti-AI people like to bring up hallucination as if everything AI generates is false.

I can write pages of text, with my own content, and then use AI to improve my writing and clarity. Then I review and edit. It might have some LLM markers in there, which I remove sometimes because it's distracting. But the final, AI assisted writing is easier to read and better organized. But all the ideas are mine. Hallucinations are not remotely a problem in this case.

Forgeties79 · 2026-04-30T14:37:19 1777559839

If you can’t distinguish between fake images and real ones why should you care?

kevin42 · 2026-04-30T14:42:54 1777560174

That depends on the purpose of the image.

If it's used to create a false narrative (like a deep fake), sure, you should care. But if it's used as an alternative to a stock photo, or as an easy way to make an infographic then no, I don't think you should care.

joquarky · 2026-04-30T19:36:32 1777577792

> you should care

Why should I care? The world is full of false narratives.

How can I have the bandwidth to care about everything all of the time?

I swear that more than half of the complaining that I find here comes from priveledged people bike shedding over inane topics, and who have never had to really worry about serious survival-level (how am I going to eat today?) issues in their lives.

Forgeties79 · 2026-04-30T15:28:17 1777562897

And when an LLM starts hallucinating, and I emphasize “when,” is that not the same issue as creating a false narrative?

halJordan · 2026-04-30T14:14:55 1777558495

No, you're being weird (why are you calling people weird anyway, not helpful).

You're complaining about facts that have been true since words have been written on paper. If you read the article with the same criticality you read any other article you wont have the problem you complain about.

The reality is, you're only complaining because you hate ai. Cool, but dont dress it up and resort to name calling to browbeat the other guy

lelanthran · 2026-04-30T15:55:57 1777564557

If I read something and cannot tell that it is AI generated, then there's no problem.

If it has AI tells then I wont bother to continue reading because it was either written by an AI or it was written by someone who can't tell the difference.

Either way it's probably a poor piece of writing.

sureMan6 · 2026-04-28T09:40:32 1777369232

That's very funny to have my exact reaction present in the first comment, I was thinking "that's turquoise" but I do also feel like turquoise is green, like you'd call the Copenhagen copper domes green, and the word verdigris comes from green

sureMan6 · 2026-04-26T21:28:36 1777238916

For sure an AI write up

simonjgreen · 2026-04-26T21:37:52 1777239472

Certainly AI editorialised. I wonder if this is because English isn’t their first language, and they are confidence compensating. I’ve worked with a lot of folks also from Philippines and the Tagalog/English mix leads to some confidence challenges sometimes.

thierrydamiba · 2026-04-27T01:10:15 1777252215

Recommend everyone take this test: https://www.nytimes.com/interactive/2026/03/09/business/ai-w...

You might be surprised…or you might not. I’ve found it’s a good barometer for whether you actually don’t like AI writing or you just don’t like bad AI writing.

deaux · 2026-04-27T09:53:21 1777283601

1. This test has really zero to do with what we're talking about. Stylized fiction is a completely separate domain from non-fiction writing of personal anecdotes. There's effectively zero relation between them.

2. Picked the human 5 out of 5. Since it's pointless to take as a judge of preference due to 1), I took it as a test of "spot the AI", and clearly it was obvious to me in every instance.

3. Of course we just "don't like bad AI writing". "Good AI writing" would be unnoticeable. This is incredibly rare in the domain we're talking about.

Groxx · 2026-04-27T05:37:34 1777268254

Small, pithy quotes vs dozens of paragraphs are rather different things.

It does not surprise me in the least that a machine can produce excellent small quotes. Markov chains have been production some fantastic stuff for decades, for example, and they're about as complicated as an abacus. https://thedoomthatcametopuppet.tumblr.com/

gpderetta · 2026-04-27T09:11:41 1777281101

It seems I chose AI 5 times out of 5. I'm not a native speaker, so I might have preferred a slightly more straightforward text.

On one side, I think this suffers a lot from selection bias: short AI snippets specifically chosen by humans for their quality and they do not necessarily reflect the average experience of AI text. On the other hand, AI generated text does not preclude human editing.

piperswe · 2026-04-27T02:40:47 1777257647

Spoilers:

Question 1 had such different styles. I preferred the style the AI was using, but that was purely a stylistic preference.

Question 3 was a toss-up. They both felt fine, and funny enough they both had a "not just X, it's Y" pattern.

Those were the only two where I clicked the AI version - for the other three, it was obvious which was AI.

Suppafly · 2026-04-28T05:21:07 1777353667

This is like the coke vs pepsi tests, where people prefer pepsi when given a small amount but prefer coke in larger amounts. short snippets aren't a good test of anything useful.

deafpolygon · 2026-04-27T05:58:50 1777269530

I got 4/5 human. #3 - I chose AI, it was very close.

I noticed something-humans will use words precisely and loosely at the same time. AI will seem like it’s precise but a lot of the wording it uses can be cut or replaced by something else without losing much meaning.

somenameforme · 2026-04-27T06:14:48 1777270488

A few paragraphs isn't writing, it's a snippet. The shorter something is, the better AI will be at mimicking it, because underlying flaws are less likely to be made apparent.

Music is another great example of this. I enjoy techno/trance type stuff, but YouTube is becoming borderline unusable for this genre due to AI slop. You'd think AI would do a good job of producing tracks here since this genre is certainly somewhat formulaic. And about 2 minutes into a lengthy track I'd probably do relatively mediocrely at determining whether it was human or AI, but by about 10 minutes into a track it's often painfully obvious. I run this experiment regularly as I find myself having to skip the AI slop which YouTube seems obsessed with recommending anyhow.

Ironically AI is probably providing a boon to human DJs here, because actively seeking them out it is one of the only ways to escape YouTube's sloparithm.

PinkMilkshake · 2026-04-27T02:04:33 1777255473

I preferred the AI 4 out of 5 times. That's a little confronting. And judging by the amount of cope in the comments section, others found it the same. I guess it is a small test, but I think it successfully makes it's point.

avadodin · 2026-04-27T09:36:36 1777282596

All of the fragments read like bad slop.

I successfully chose the least democratically awful slop if that's an indication of anything.

sdrothrock · 2026-04-27T04:22:01 1777263721

Two human editors. I'm one of them and I absolutely do not use AI tools when I edit.

If you're going off the use of emdashes and endashes, I've been using them for over 25 years.

seanmcdirmid · 2026-04-27T01:12:54 1777252374

You couldnt tell the difference between a LinkedIn writer and a up and AI, they are both comparably generic.

TonyStr · 2026-04-27T08:36:22 1777278982

At this point, I assume most LinkedIn users use AI to assist in generating posts anyway, so the distinction kinda becomes pointless. Nobody likes reading AI generated posts, and nobody ever really liked reading LinkedIn posts either.

seanmcdirmid · 2026-04-27T15:57:32 1777305452

At least you aren’t wasting time writing something that people don’t like reading by hand. I just assumed that AI is trained on executive communication which is why they sound like a CEO.

sureMan6 · 2026-04-24T05:07:54 1777007274

Overhired has nothing to do with the talent pool and just means they hired more than they actually needed or wanted, if the talent pool is large enough then everyone can overhire

wakawaka28 · 2026-04-25T04:47:16 1777092436

Of course the judgement about whether people overhired is subjective on a per-company basis. My point is, you can't hire people who don't exist, and the ways to get people into the industry are limited. All other things equal, we would expect massive overhiring to be matched with very low unemployment in the industry, and the correction should not go below some baseline.

sureMan6 · 2026-04-15T17:38:55 1776274735

A new user is much more likely to scan the codebase and report vulnerabilities so they can be fixed than illegally exploit them since most people aren't criminals

eddythompson80 · 2026-04-15T17:46:40 1776275200

Exactly. Who even hacks stuff? Most people will report the issue to earn xp and level up than actually exploit it.

sureMan6 · 2026-04-15T17:05:56 1776272756

This article is LLM written and unsourced so the entire thing could be a hallucination and there's no reason to trust any claim made in it, don't even read it

sureMan6 · 2026-04-15T17:03:56 1776272636

Not really incompetence if he hasn't looked at the article

sureMan6 · 2026-04-14T21:35:52 1776202552

> Another solution is to make software makers responsible and liable for the output of their products. It's long been a problem that there is little legal responsibility, but we shouldn't just accept it. If Ford makes exploding cars, they are liable. If OpenAI makes software that endangers people, it should be the same.

That kind of thinking is exactly why LLMs are so censored, because people think OAI should be liable if someone uses chatgpt to commit cyber crimes

How about cyber crimes are already illegal and we just punish whoever uses the new tools to commit crimes instead of holding the tool maker liable

This gets complex if LLMs enable children to commit complex crimes but that's different from just outright restricting the tool for everyone because someone might misuse it

0x3f · 2026-04-14T21:48:59 1776203339

There's always some wedge issue that means "don't punish the toolkmaker" is not politically viable. You can pick from guns to legal drugs to illegal drugs to all kinds of emotive things.

And once the wedge is in and the concept of maker responsibility is planted, it expands to people's pet issues, obviously.

The actual line of who gets punished just ends up at some equilibrium in the middle. Largely arbitrarily.

kaashif · 2026-04-15T03:50:08 1776225008

I think the classic one is pedophiles and protecting children.

If someone uses ChatGPT to create child porn or worse, to get help tracking down and meeting children, there is NO way in hell the public will accept "don't punish the toolmaker" as a principle.

marshray · 2026-04-15T00:25:04 1776212704

"It's just a neutral tool" gets a lot harder to claim once a vendor starts specifically training and marketing the model for its ability to bypass security controls.

Yes, pentesting tools, even automated ones, are often legal. But they commonly do run up against legal restrictions and risks. They're marketed very differently from ChatGPT.

sureMan6 · 2026-04-02T03:51:20 1775101880

I bought a used Samsung and it started boot looping almost immediately, all these issues seem very specific to Samsung

gzread · 2026-04-02T04:22:50 1775103770

The Seagate of cellphones

account42 · 2026-04-07T09:18:54 1775553534

I'm not aware of Seagate ever having produced explosives.

joe_mamba · 2026-04-02T08:26:30 1775118390

>I bought a used Samsung and it started boot looping almost immediately

Maybe that's why the previous owner sold it.

sureMan6 · 2026-03-20T11:50:30 1774007430

I opt into it on my site it's just a login option you can ignore if you want to log in another way, but for those who use it it removes the friction of writing out a password and verifying the email

al_borland · 2026-03-20T13:32:43 1774013563

It can’t just be ignored, it covers content, and if someone accidentally clicks the wrong thing… poof, they now have that site linked to their Google account.

It’s a cancer on the Internet.

abustamam · 2026-03-20T15:53:43 1774022023

Thanks for sharing! It's not really easily ignored for some people (I ignore it the same way I ignored banner ads in the 00s). I'm curious if you have any metrics on bounce ratios with/without the option. The sentiment here on HN appears to be largely negative but HN does not represent the population at large. I find that many people don't mind or even like a lot of stuff that HN tends to hate.

calmworm · 2026-03-20T13:08:36 1774012116

I’m annoyed by it every time on every site when I have to dismiss it. Probably not the only one and probably depends on your type of site/visitors.

abustamam · 2026-03-20T15:51:09 1774021869

I'm sure some number of website owners ran A/B tests and determined that more people signed in when it was present.

I'm also sure that some number of website owners don't know or care that it's annoying to some people.

Personally I've just learned to ignore it; but if it did annoy me enough I'd zap it with uBlock.