Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Sneha7
/
phi2-helpfulness-grpo-demo
Runtime error

App Files Files Community
Fetching metadata from the HF Docker repository...
phi2-helpfulness-grpo-demo
6.44 kB
  • 1 contributor
History: 8 commits
Sneha7's picture
Sneha7
Create policy.py
e4c07fc verified 8 days ago
  • .gitattributes
    1.52 kB
    initial commit 8 days ago
  • README.md
    898 Bytes
    Update README.md 8 days ago
  • app.py
    1.37 kB
    Create app.py 8 days ago
  • grpo_train.py
    1.16 kB
    Create grpo_train.py 8 days ago
  • policy.py
    485 Bytes
    Create policy.py 8 days ago
  • requirements.txt
    61 Bytes
    Update requirements.txt 8 days ago
  • reward_fn.py
    937 Bytes
    Create reward_fn.py 8 days ago