Tony Congqian Wang

TonyCWang

AI & ML interests

None yet

Organizations

None yet

commented 2 papers 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233 •

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 46 •

commented 2 papers 9 months ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •

New activity in timm/vit_little_patch16_reg4_gap_256.sbb_in1k 11 months ago

Loss exploding to nan

#1 opened 11 months ago by

commented 2 papers 12 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •

New activity in timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k about 1 year ago

Training recipe

#2 opened about 1 year ago by

commented 2 papers about 1 year ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •