Miguel Moura Ramos
M1keR
AI & ML interests
None yet
Recent Activity
updated a dataset about 10 hours ago
M1S1/Reasoning-SFT-Mixture-NoDedup-NoFilter published a dataset about 14 hours ago
M1S1/Reasoning-SFT-Mixture-NoDedup-NoFilter authored a paper about 1 month ago
Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models