-
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Paper • 2412.11605 • Published • 18 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 108 -
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
Paper • 2412.17739 • Published • 41 -
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval
Paper • 2412.15443 • Published • 10
Collections
Discover the best community collections!
Collections including paper arxiv:2502.18864
-
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 72 -
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
Paper • 2506.12594 • Published • 2 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
Paper • 2507.14683 • Published • 134
-
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Paper • 2504.17192 • Published • 120
-
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Paper • 2504.08066 • Published • 15 -
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Paper • 2504.19678 • Published • 3 -
AIGS: Generating Science from AI-Powered Automated Falsification
Paper • 2411.11910 • Published -
AgentRxiv: Towards Collaborative Autonomous Research
Paper • 2503.18102 • Published • 25
-
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 192
-
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 192 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 104 -
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
Paper • 2502.14502 • Published • 91 -
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC
Paper • 2502.14282 • Published • 29
-
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Paper • 2412.11605 • Published • 18 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 108 -
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
Paper • 2412.17739 • Published • 41 -
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval
Paper • 2412.15443 • Published • 10
-
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Paper • 2504.08066 • Published • 15 -
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Paper • 2504.19678 • Published • 3 -
AIGS: Generating Science from AI-Powered Automated Falsification
Paper • 2411.11910 • Published -
AgentRxiv: Towards Collaborative Autonomous Research
Paper • 2503.18102 • Published • 25
-
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 72 -
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
Paper • 2506.12594 • Published • 2 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
Paper • 2507.14683 • Published • 134
-
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 192
-
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 51 -
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Paper • 2504.17192 • Published • 120
-
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 192 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 104 -
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
Paper • 2502.14502 • Published • 91 -
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC
Paper • 2502.14282 • Published • 29