Post
581
NEW:
@EssentialAI
just released Rnj-1, their first 8B model.
You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact mode
Free Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynb
More free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks
You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact mode
Free Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynb
More free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks