18 8 27

An Yang

yangapku

https://scholar.google.com/citations?user=vO9FZekAAAAJ

AI & ML interests

NLP and Deep Learning

Recent Activity

authored a paper 7 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 7 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

authored a paper 12 days ago

Qwen-Image Technical Report

View all activity

Organizations

authored a paper 7 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 8 days ago • 80

authored 4 papers 12 days ago

authored a paper 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 314

authored 3 papers 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

Paper • 2505.24147 • Published May 30

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 74

authored 7 papers 7 months ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12, 2024 • 16

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31, 2024 • 18

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 52

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 72

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

authored 4 papers about 1 year ago

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 37

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining

Paper • 2003.13198 • Published Mar 30, 2020

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

Paper • 2305.14688 • Published May 24, 2023

M6: A Chinese Multimodal Pretrainer

Paper • 2103.00823 • Published Mar 1, 2021

An Yang

AI & ML interests

Recent Activity

Organizations

yangapku's activity