Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Sneha7
/
phi2-helpfulness-grpo-demo
like
1
Runtime error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
phi2-helpfulness-grpo-demo
10.2 kB
1 contributor
History:
46 commits
Sneha7
Update grpo_train.py
c8e38f6
verified
2 days ago
.gitattributes
Safe
1.52 kB
initial commit
7 days ago
README.md
Safe
304 Bytes
Update README.md
7 days ago
app.py
Safe
1.59 kB
Update app.py
2 days ago
grpo_train.py
Safe
4.26 kB
Update grpo_train.py
2 days ago
policy.py
Safe
1.48 kB
Update policy.py
2 days ago
requirements.txt
Safe
72 Bytes
Update requirements.txt
5 days ago
reward_fn.py
Safe
937 Bytes
Create reward_fn.py
7 days ago