Hacker News .hnnew | past | comments | ask | show | jobs | submit | danielhanchen's submissionslogin
1.Gemma 4 Fine-Tuning Guide (unsloth.ai)
2 points by danielhanchen 13 hours ago | past | discuss
2.Show HN: Unsloth Studio - Local Fine-tuning, Chat UI (github.com/unslothai)
8 points by danielhanchen 23 days ago | past | 2 comments
3.Qwen3.5: Towards Native Multimodal Agents (qwen.ai)
434 points by danielhanchen 52 days ago | past | 214 comments
4.Qwen3-Coder-Next (qwen.ai)
735 points by danielhanchen 65 days ago | past | 429 comments
5.Qwen-Image-2512 (qwen.ai)
7 points by danielhanchen 3 months ago | past | 1 comment
6.Kimi K2 Thinking: How to Run Locally (unsloth.ai)
3 points by danielhanchen 5 months ago | past
7.LoRA Without Regret (thinkingmachines.ai)
24 points by danielhanchen 6 months ago | past
8.Long context GPT-OSS fine-tuning (unsloth.ai)
4 points by danielhanchen 7 months ago | past | 1 comment
9.Show HN: GPT OSS: How to run and fine-tune (unsloth.ai)
2 points by danielhanchen 8 months ago | past
10.Qwen3-30B-A3B-Instruct-2507 (huggingface.co)
5 points by danielhanchen 8 months ago | past
11.Qwen3-Coder: Agentic coding in the world (qwenlm.github.io)
765 points by danielhanchen 8 months ago | past | 366 comments
12.2.71bit DeepSeek-V3-0324 (unsloth.ai)
1 point by danielhanchen on March 26, 2025 | past | 1 comment
13.Gemma 3: Google's new multimodal models (ai.google.dev)
4 points by danielhanchen on March 12, 2025 | past | 2 comments
14.How to Run QwQ-32B effectively (unsloth.ai)
4 points by danielhanchen on March 7, 2025 | past | 3 comments
15.Train your own R1 reasoning model (unsloth.ai)
11 points by danielhanchen on Feb 6, 2025 | past | 5 comments
16.How to run 1.58bit DeepSeek R1 with Open WebUI (openwebui.com)
37 points by danielhanchen on Jan 31, 2025 | past | 9 comments
17.Phi-4 Bug Fixes (unsloth.ai)
193 points by danielhanchen on Jan 10, 2025 | past | 68 comments
18.My take on the Post Pretraining world (twitter.com/danielhanchen)
1 point by danielhanchen on Dec 16, 2024 | past | 3 comments
19.Dynamic 4bit Quantization (unsloth.ai)
3 points by danielhanchen on Dec 4, 2024 | past | 5 comments
20.Show HN: Finetune Llama 3.2 Vision in a Colab (colab.research.google.com)
10 points by danielhanchen on Nov 21, 2024 | past
21.Python 3.11 is 1.25x faster than 3.10 (python.org)
3 points by danielhanchen on Nov 4, 2024 | past | 5 comments
22.Fixing Gradient Accumulation (huggingface.co)
2 points by danielhanchen on Oct 16, 2024 | past
23.Unit Economics of LLM APIs (lesswrong.com)
5 points by danielhanchen on Aug 27, 2024 | past | 4 comments
24.LoRA Learns Less and Forgets Less Updated (openreview.net)
1 point by danielhanchen on Aug 27, 2024 | past | 1 comment
25.VLLM automatic prefix / prompt caching (vllm.ai)
2 points by danielhanchen on Aug 25, 2024 | past | 1 comment
26.Higher Temperatures and Min_p Sampling (arxiv.org)
1 point by danielhanchen on Aug 23, 2024 | past | 1 comment
27.Show HN: Open-source fine-tuning in a Colab notebook (colab.research.google.com)
5 points by danielhanchen on Aug 21, 2024 | past
28.Sahm rule signals start of recession (stlouisfed.org)
4 points by danielhanchen on Aug 2, 2024 | past | 3 comments
29.Low Level Technicals of LLMs [video] (youtube.com)
1 point by danielhanchen on Aug 1, 2024 | past | 1 comment
30.Gemma-2 2B beats GPT3.5 on Chatbot Arena (huggingface.co)
5 points by danielhanchen on July 31, 2024 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: