"...git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "c11013db8b76bebaaed07d4791f693998e398925"
Unverified Commit 883b4289 authored by Sergio Paniego Blanco's avatar Sergio Paniego Blanco Committed by GitHub
Browse files

Add TRL example notebook to RLHF docs (#26346)


Signed-off-by: default avatarsergiopaniego <sergiopaniegoblanco@gmail.com>
parent e1098ced
...@@ -12,4 +12,5 @@ See the following basic examples to get started if you don't want to use an exis ...@@ -12,4 +12,5 @@ See the following basic examples to get started if you don't want to use an exis
See the following notebooks showing how to use vLLM for GRPO: See the following notebooks showing how to use vLLM for GRPO:
- [Efficient Online Training with GRPO and vLLM in TRL](https://huggingface.co/learn/cookbook/grpo_vllm_online_training)
- [Qwen-3 4B GRPO using Unsloth + vLLM](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(4B)-GRPO.ipynb) - [Qwen-3 4B GRPO using Unsloth + vLLM](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(4B)-GRPO.ipynb)
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment