| | Opus 4.7 Part 3: Model Welfare (lesswrong.com) |
| 11 points by omer_k 8 hours ago | past | 7 comments |
|
| | Claude Code sometimes hallucinates user messages (lesswrong.com) |
| 2 points by cubefox 2 days ago | past | 2 comments |
|
| | There are only four skills: design, technical, management and physical (lesswrong.com) |
| 3 points by samuel246 3 days ago | past | discuss |
|
| | Summarizing and Reviewing my earliest ML research paper, 7 years later (lesswrong.com) |
| 2 points by joozio 3 days ago | past | discuss |
|
| | Resources for starting and growing an AI safety org (lesswrong.com) |
| 1 point by omer_k 3 days ago | past | discuss |
|
| | Only Law Can Prevent Extinction (lesswrong.com) |
| 3 points by namanyayg 4 days ago | past | 1 comment |
|
| | LLMs will soon disrupt algorithmic media feeds (lesswrong.com) |
| 3 points by linhns 4 days ago | past | discuss |
|
| | Working hurts less than procrastinating, we fear the twinge of starting (2011) (lesswrong.com) |
| 14 points by davikr 4 days ago | past | 2 comments |
|
| | The AlphaFold moment for materials is not any time soon (lesswrong.com) |
| 8 points by gmays 7 days ago | past | discuss |
|
| | Morale (lesswrong.com) |
| 2 points by jger15 7 days ago | past | discuss |
|
| | You're gonna need a bigger benchmark, METR (lesswrong.com) |
| 3 points by frmsaul 9 days ago | past | discuss |
|
| | Hypotheses for Why Models Fail on Long Tasks (lesswrong.com) |
| 1 point by joozio 10 days ago | past | discuss |
|
| | Splitting Mounjaro pens for fun and profit (lesswrong.com) |
| 2 points by henryaj 10 days ago | past | discuss |
|
| | We're running out of benchmarks to upper bound AI capabilities (lesswrong.com) |
| 15 points by gmays 12 days ago | past | 10 comments |
|
| | AIs can now do easy-to-verify SWE tasks, I've shortened timelines (lesswrong.com) |
| 3 points by gmays 12 days ago | past | discuss |
|
| | The effects of caffeine consumption do not decay with a ~5 hour half-life (lesswrong.com) |
| 101 points by swah 12 days ago | past | 105 comments |
|
| | My Picture of the Present in AI (lesswrong.com) |
| 1 point by speckx 12 days ago | past | discuss |
|
| | Most people can't juggle one ball (lesswrong.com) |
| 507 points by surprisetalk 13 days ago | past | 174 comments |
|
| | "Alignment" and "Safety", Part One: What Is "AI Safety"? (lesswrong.com) |
| 1 point by joozio 15 days ago | past |
|
| | Paper Close Reading: "Why Language Models Hallucinate" (lesswrong.com) |
| 2 points by joozio 16 days ago | past |
|
| | Estimates of the expected utility gain of AI Safety Research (lesswrong.com) |
| 1 point by joozio 17 days ago | past |
|
| | What I like about MATS and Research Management (lesswrong.com) |
| 2 points by joozio 17 days ago | past |
|
| | Predicting When RL Training Breaks Chain-of-Thought Monitorability (lesswrong.com) |
| 2 points by gmays 17 days ago | past |
|
| | AI Safety at the Frontier: Paper Highlights of February and March 2026 (lesswrong.com) |
| 2 points by joozio 18 days ago | past |
|
| | How to emotionally grasp the risks of AI Safety (lesswrong.com) |
| 3 points by joozio 18 days ago | past |
|
| | You can't imitation-learn how to continual-learn (lesswrong.com) |
| 2 points by paulpauper 19 days ago | past |
|
| | A Mirror Test for LLMs (lesswrong.com) |
| 2 points by gmays 20 days ago | past |
|
| | I'm Suing Anthropic for Unauthorized Use of My Personality (lesswrong.com) |
| 5 points by usrme 21 days ago | past | 2 comments |
|
| | Why did everything take so long? (lesswrong.com) |
| 2 points by jstanley 21 days ago | past |
|
| | The state of AI safety in four fake graphs (lesswrong.com) |
| 3 points by allenleee 22 days ago | past |
|
|
| More |