nicolevin's comments

nicolevin · on Jan 29, 2025

reach out at @nicilevv on X for questions

nicolevin · on Jan 29, 2025

Bias-Unlearned DeepSeek-R1-Distill-Llama-8B here: https://huggingface.co/hirundo-io/DeepSeek-R1-Distill-Llama-...

nicolevin · on Jan 29, 2025

DeepSeek-R1 (8B) exhibited 2x more bias than base Llama. We applied targeted unlearning, reduced bias by up to 76% across race/gender/nationality, while maintaining model performance (TruthfulQA: 9.8→9.9, LogiQA: 42.6%→42.5%). Done in ~1hr on consumer hardware. Debiased model on HuggingFace.