Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Tony Congqian Wang's picture

Tony Congqian Wang

TonyCWang
6 14 1

AI & ML interests

None yet

Organizations

None yet

commented 2 papers 8 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 233 •
9

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 46 •
4
commented 2 papers 9 months ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57 •
4

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
New activity in timm/vit_little_patch16_reg4_gap_256.sbb_in1k 11 months ago

Loss exploding to nan

31
#1 opened 11 months ago by
tony0278611
commented 2 papers 12 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
New activity in timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k about 1 year ago

Training recipe

#2 opened about 1 year ago by
TonyCWang
commented 2 papers about 1 year ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •
50
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs