linoyts HF Staff commited on
Commit
48c0347
·
verified ·
1 Parent(s): 7cfdf72

improve readme :)

Browse files

hey there! I added some useful tags and details to the README 🤗

Files changed (1) hide show
  1. README.md +20 -3
README.md CHANGED
@@ -1,9 +1,26 @@
1
  ---
 
2
  base_model:
 
3
  - bytedance-research/OneReward
 
 
 
4
  ---
5
- flux.1-fill-dev-OneReward
6
 
7
- Process the model into a single model suitable for ComfyUI use
8
 
9
- Original model link: [OneReward](https://huggingface.co/bytedance-research/OneReward)
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc-by-nc-4.0
3
  base_model:
4
+ - black-forest-labs/FLUX.1-Fill-dev
5
  - bytedance-research/OneReward
6
+ language:
7
+ - en
8
+ pipeline_tag: image-to-image
9
  ---
10
+ # OneReward - ComfyUI
11
 
12
+ **ComfyUI community** checkpoint for **[OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning](https://arxiv.org/abs/xxxx)**.
13
 
14
+ [![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/abs/2508.21066) [![GitHub Repo](https://img.shields.io/badge/GitHub-Repo-green?logo=github)](https://github.com/bytedance/OneReward) [![GitHub Pages](https://img.shields.io/badge/GitHub-Project-blue?logo=github)](https://one-reward.github.io/)
15
+ <br>
16
+
17
+ This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
18
+
19
+ <p align="center">
20
+ <img src="assets/show.jpg" alt="assert" width="800">
21
+ </p>
22
+
23
+ **OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
24
+
25
+
26
+ For more details and examples see original model repo: [**OneReward**](https://huggingface.co/bytedance-research/OneReward)