·
AI & ML interests
RLHF
Organizations
None yet
xiaodongguaAIGC/X-R1-TAL-SCQ5K
Viewer
• Updated • 10k • 9
• 3
xiaodongguaAIGC/X-R1-TAL-SCQ2K
Viewer
• Updated • 3.33k • 10
• 1
xiaodongguaAIGC/X-R1-7500
Viewer
• Updated • 12.5k • 10
• 2
xiaodongguaAIGC/X-R1-1500
Viewer
• Updated • 2.5k • 6
Viewer
• Updated • 1.25k • 54
• 4
Viewer
• Updated • 84.2k • 37
Viewer
• Updated • 108k • 37
xiaodongguaAIGC/math_step_sft
Viewer
• Updated • 12.5k • 8
xiaodongguaAIGC/GSM8k_step_sft
Viewer
• Updated • 8.79k • 7
xiaodongguaAIGC/prm800k_step_sft
Viewer
• Updated • 121k • 6
xiaodongguaAIGC/awesome-sft
Viewer
• Updated • 131k • 15
• 1
xiaodongguaAIGC/awesome-dpo
Viewer
• Updated • 302k • 69
• 3
xiaodongguaAIGC/alpaca_en_zh_ruozhiba
Viewer
• Updated • 111k • 50
• 6
xiaodongguaAIGC/alpaca_gpt4_data_zh
Viewer
• Updated • 48.8k • 37
• 1
Viewer
• Updated • 146k • 22
• 1
xiaodongguaAIGC/CValues_DPO
Viewer
• Updated • 146k • 16