HAODONG DUAN
KennyUTC
AI & ML interests
Video Understanding; Multi-Modal Learning
Recent Activity
upvoted
a
paper
about 3 hours ago
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
upvoted
a
paper
about 1 month ago
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for
Visual Chain-of-Thought
upvoted
a
paper
about 1 month ago
JanusCoder: Towards a Foundational Visual-Programmatic Interface for
Code Intelligence