Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
upvoted
a
paper
about 18 hours ago
Soft Adaptive Policy Optimization
Organizations
None yet