Unified Speech-Text Pre-training for Speech Translation and Recognition
Paper
•
2204.05409
•
Published
None defined yet.
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow