HN2new | past | comments | ask | show | jobs | submit | fromlogin
Are Pre-Trained Convolutions Better Than Pre-Trained Transformers? (2021) (arxiv.org)
1 point by fzliu 2 hours ago | past | discuss
Pendulum: A Benchmark for Assessing Sycophancy in MLLM's (arxiv.org)
1 point by onestay42 2 hours ago | past | discuss
Exploration Posteriors for Generative Modeling Using Only Negative Rewards (arxiv.org)
1 point by numeri 3 hours ago | past | discuss
A formula for any real number, maybe (arxiv.org)
2 points by bikenaga 3 hours ago | past | 1 comment
Was Benoit Mandelbrot a hedgehog or a fox? (arxiv.org)
1 point by bikenaga 4 hours ago | past | 1 comment
Probabilities (arxiv.org)
2 points by simonpure 4 hours ago | past | discuss
Revisiting Disaggregated LLM Serving for Performance and Energy Implications (arxiv.org)
1 point by PaulHoule 9 hours ago | past | discuss
Linear representations in LLMs can change dramatically over a conversation (arxiv.org)
5 points by gmays 10 hours ago | past | discuss
Language-Related Ideological Divergence in LLM Analysis of Political Documents (arxiv.org)
1 point by PaulHoule 10 hours ago | past | discuss
Reasoning Models Generate Societies of Thought (arxiv.org)
2 points by PaulHoule 10 hours ago | past | discuss
Strategies of cooperation and defection in five large language models (arxiv.org)
1 point by PaulHoule 12 hours ago | past | discuss
Affordable, rapid bootstrapping of space industry and solar system civilization (arxiv.org)
3 points by andsoitis 12 hours ago | past | discuss
Signal-First Architectures: Rethinking Front-End Reactivity (arxiv.org)
1 point by buibuibui 12 hours ago | past | discuss
A Pragmatic VLA Foundation Model (arxiv.org)
1 point by mountainview 22 hours ago | past | discuss
PaperBanana: Automating Academic Illustration for AI Scientists (arxiv.org)
1 point by fzliu 1 day ago | past | discuss
Understanding the Consequences of VTuber Reincarnation (arxiv.org)
2 points by PaulHoule 1 day ago | past | 1 comment
Power Aware Dynamic Reallocation for Inference (arxiv.org)
3 points by PaulHoule 1 day ago | past | discuss
Hybrid Concolic Testing with Large Language Models for Guided Path Exploration (arxiv.org)
1 point by PaulHoule 1 day ago | past | discuss
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs (arxiv.org)
3 points by PaulHoule 1 day ago | past | discuss
The SWE-Bench Illusion: When LLMs Remember Instead of Reason (arxiv.org)
2 points by cadabrabra 1 day ago | past | discuss
YuriiFormer: A Suite of Nesterov-Accelerated Transformers (arxiv.org)
2 points by kelseyfrog 1 day ago | past | discuss
3DGS-Drag: Dragging Gaussians for Intuitive Point-Based 3D Editing (arxiv.org)
3 points by PaulHoule 1 day ago | past | discuss
Semi-Autonomous Mathematics Discovery with Gemini: Erdős Problems Case Study (arxiv.org)
1 point by tzury 1 day ago | past | discuss
What are the most influential current AI Papers? (arxiv.org)
2 points by onurkanbkrc 1 day ago | past | discuss
AgentBuilder: Scaffolds for Prototyping User Experiences of Interface Agents (arxiv.org)
4 points by azhenley 1 day ago | past | discuss
Forcing and Diagnosing Failure Modes of Fourier Neural Operators (arxiv.org)
3 points by TimorousBestie 2 days ago | past | 1 comment
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability (arxiv.org)
3 points by gmays 2 days ago | past | discuss
Solving Package Management via Hypergraph Dependency Resolution (arxiv.org)
1 point by todsacerdoti 2 days ago | past | 1 comment
Demystifying ARM SME to Optimize General Matrix Multiplications (arxiv.org)
87 points by matt_d 3 days ago | past | 19 comments
VaultGemma: A Differentially Private LLM (arxiv.org)
1 point by PaulHoule 3 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: