专题文章
Done
expand_less
OpenR1共学专题计划
就爱瞎鼓捣 2025-02-26 23:26:24
Done
expand_less
OpenR1项目介绍(Huggingface官方博客翻译)
就爱瞎鼓捣 2025-02-09 00:37:41
Done
expand_less
环境安装:安装OpenR1运行环境
就爱瞎鼓捣 2025-02-09 15:46:24
Done
expand_less
结果复现:复现DeepSeek-R1-Distill,Qwen(最新结果,请看3/5更新版)
就爱瞎鼓捣 2025-03-05 22:49:27
Done
expand_less
结果复现:关于结果不能复现的相关讨论
就爱瞎鼓捣 2025-03-07 23:27:58
Done
expand_less
结果复现:最新更新(3/5)
就爱瞎鼓捣 2025-03-05 22:47:03
Done
expand_less
推理训练数据制作方法(OpenR1的实践)
就爱瞎鼓捣 2025-02-26 23:15:47
Done
expand_less
微调模型:Qwen2.5-1.5B,7B 模型 SFT 微调
就爱瞎鼓捣 2025-03-07 23:39:29
Open
expand_less
模型微调:GRPO(待学习)
就爱瞎鼓捣 2025-03-07 23:38:18
Open
expand_less
Training with code interpreter(待学习)
就爱瞎鼓捣 2025-03-07 23:38:07
Open
expand_less
Data decontamination 数据去污染(待学习)
就爱瞎鼓捣 2025-03-07 23:38:31