ByteDance
/

Ouro-1.4B

Text Generation

looped-language-model

recurrent-depth

Model card Files Files and versions

ridger commited on 26 days ago

Commit

fcfcfbc

·

verified ·

1 Parent(s): 7ea635b

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -112,17 +112,18 @@ outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ## Citation
 ```bibtex
-@article{ouro2025,
   title={Scaling Latent Reasoning via Looped Language Models},
-  author={Zhu, Rui-Jie and Wang, Zixuan and Hua, Kai and Zhang, Tianyu and Li, Ziniu and Que, Haoran and Wei, Boyi and Yin, Fan and Wen, Zixin and Xing, He and others},
-  journal={arXiv preprint},
   year={2025}
 }
-```
 ## License

 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Acknowledgments
+We thank [@Antizana](https://github.com/Antizana) for the KV cache fix merged from [ouro-cache-fix](https://github.com/Antizana/ouro-cache-fix), which resolved a critical compatibility issue with transformers>=4.56.0.
 ## Citation
 ```bibtex
+@article{zhu2025scaling,
   title={Scaling Latent Reasoning via Looped Language Models},
+  author={Zhu, Rui-Jie and Wang, Zixuan and Hua, Kai and Zhang, Tianyu and Li, Ziniu and Que, Haoran and Wei, Boyi and Wen, Zixin and Yin, Fan and Xing, He and others},
+  journal={arXiv preprint arXiv:2510.25741},
   year={2025}
 }
 ## License