Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yanxi Chen's picture
4 2

Yanxi Chen

yanxi-chen

AI & ML interests

None yet

Organizations

None yet

commented a paper 2 months ago

Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Paper • 2509.24203 • Published Sep 29 • 7 •
2
commented a paper 7 months ago

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Paper • 2505.17826 • Published May 23 • 9 •
2
commented a paper about 1 year ago

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published Nov 29, 2024 • 6 •
2
commented 3 papers almost 2 years ago

EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism

Paper • 2312.04916 • Published Dec 8, 2023 • 7 •
7

EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism

Paper • 2312.04916 • Published Dec 8, 2023 • 7 •
7

EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism

Paper • 2312.04916 • Published Dec 8, 2023 • 7 •
7
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs