Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Paper • 2511.16669 • Published 21 days ago • 31
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery Paper • 2511.11257 • Published 27 days ago • 24
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published Oct 22 • 61
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory Paper • 2507.01945 • Published Jul 2 • 78
Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset Paper • 2506.18851 • Published Jun 23 • 30
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs Paper • 2505.11277 • Published May 16 • 9
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published Jun 13 • 72
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization Paper • 2409.05606 • Published Sep 9, 2024 • 1
Pro3D-Editor : A Progressive-Views Perspective for Consistent and Precise 3D Editing Paper • 2506.00512 • Published May 31 • 5
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published Feb 16 • 59
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published Feb 13 • 148
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published Feb 13 • 191