Where is ChatGPT picking up the supportive pre-suicide comments from. It feels l...

krackers · 2025-11-07T22:59:30 1762556370

>They can't be emergent generation, surely

It is. It's what you get when you RLHF for catchy, agreeable, enthusiastic responses. The content doesn't matter, it's the "style" that becomes applied like a coat of paint over anything. That's how you end up with the corpspeak-esque yet chilling sentences mentioned in https://hackernews.hn/item?id=45845871

What would be nice is for OpenAI to do a retrospective here and perform some interoperability research. Does the LLM even "realize" (in the sense of the residual stream encoding those concepts) that it is encouraging suicide? I'd almost hypothesize that the process of RLHF'ing and selecting for sycophancy diminishes those circuits, effectively lobotomizing the LLM (much like safety training does) so it responds only to the shallow immediate context, missing the forest for the trees.

mapotofu · 2025-11-07T13:52:42 1762523562

Absolutely. These places have long existed. Hence the risks of the dragnet of data producing consequences exactly like this. This is no accident.

zparky · 2025-11-07T15:35:20 1762529720

before reddit banned a ton of subreddits for no moderation, I believe r/assistedsuicide was the place for discussion like this.