yichengup
/

flux.1-fill-dev-OneReward

Image-to-Image

English

Model card Files Files and versions

xet

Community

linoyts HF Staff commited on Sep 18

Commit

48c0347

verified ·

1 Parent(s): 7cfdf72

improve readme :)

Browse files

hey there! I added some useful tags and details to the README 🤗

Files changed (1) hide show

README.md +20 -3

README.md CHANGED Viewed

@@ -1,9 +1,26 @@
 ---
 base_model:
 - bytedance-research/OneReward
 ---
-flux.1-fill-dev-OneReward
-Process the model into a single model suitable for ComfyUI use
-Original model link: [OneReward](https://huggingface.co/bytedance-research/OneReward)

 ---
+license: cc-by-nc-4.0
 base_model:
+- black-forest-labs/FLUX.1-Fill-dev
 - bytedance-research/OneReward
+language:
+- en
+pipeline_tag: image-to-image
 ---
+# OneReward - ComfyUI
+**ComfyUI community** checkpoint for **[OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning](https://arxiv.org/abs/xxxx)**.
+[![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/abs/2508.21066) [![GitHub Repo](https://img.shields.io/badge/GitHub-Repo-green?logo=github)](https://github.com/bytedance/OneReward) [![GitHub Pages](https://img.shields.io/badge/GitHub-Project-blue?logo=github)](https://one-reward.github.io/)
+<br>
+This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
+<p align="center">
+  <img src="assets/show.jpg" alt="assert" width="800">
+</p>
+**OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** -  based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
+For more details and examples see original model repo: [**OneReward**](https://huggingface.co/bytedance-research/OneReward)