Xiao-Ming Wu's picture

4

Xiao-Ming Wu PRO

DravenALG

https://dravenalg.github.io/

AI & ML interests

Deep Learning, Computer Vision, Embodied AI

Recent Activity

updated a dataset 3 days ago

DravenALG/GraspNet-1Billion

published a dataset 3 days ago

DravenALG/GraspNet-1Billion

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published 14 days ago • 28

upvoted 2 papers about 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9 • 125

upvoted a paper 4 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18 • 49