view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30 β’ 190
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper β’ 2512.00473 β’ Published 9 days ago β’ 10
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image Paper β’ 2512.05044 β’ Published 4 days ago β’ 12
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper β’ 2512.05965 β’ Published 3 days ago β’ 30
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Paper β’ 2512.05343 β’ Published 4 days ago β’ 8
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper β’ 2512.05150 β’ Published 5 days ago β’ 43
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 5 days ago β’ 52
view post Post 1170 FYI: Mistral.Ministral-3 dequantizer FP8->BF16https://github.com/csabakecskemeti/ministral-3_dequantizer_fp8-bf16(The instruct model weights are in FP8) See translation π 2 2 π 1 1 + Reply
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper β’ 2511.20645 β’ Published 13 days ago β’ 25
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Paper β’ 2512.05081 β’ Published 4 days ago β’ 27
LATTICE: Democratize High-Fidelity 3D Generation at Scale Paper β’ 2512.03052 β’ Published 15 days ago β’ 9
Generative Neural Video Compression via Video Diffusion Prior Paper β’ 2512.05016 β’ Published 4 days ago β’ 8
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper β’ 2512.05103 β’ Published 4 days ago β’ 12
Running on Zero MCP Featured 47 LongCat Image Edit π 47 Generate or edit images using text prompts