[format] applied code formatting on changed files in pull request 3296 (#3298)

Co-authored-by: github-actions <github-actions@github.com>

[format] applied code formatting on changed files in pull request 3296 (#3298)
Co-authored-by: github-actions <github-actions@github.com>
5134ad5d · github-actions[bot] · GitHub · 682af613 · 5134ad5d · 5134ad5d
Unverified Commit 5134ad5d authored Mar 29, 2023 by github-actions[bot] Committed by GitHub Mar 29, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

applications/Chat/README.md applications/Chat/README.md +1 -1

applications/Chat/examples/README.md applications/Chat/examples/README.md +1 -1

No files found.
--- a/applications/Chat/README.md
+++ b/applications/Chat/README.md
@@ -17,7 +17,7 @@
  - [Stage1 - Supervised instructs tuning](#stage1---supervised-instructs-tuning)
  - [Stage2 - Training reward model](#stage2---training-reward-model)
  - [Stage3 - Training model with reinforcement learning by human feedback](#stage3---training-model-with-reinforcement-learning-by-human-feedback)
-  - [Inference - After Training](#inference---after-training) 
+  - [Inference - After Training](#inference---after-training)
 - [Coati7B examples](#coati7b-examples)
  - [Generation](#generation)
  - [Open QA](#open-qa)

--- a/applications/Chat/examples/README.md
+++ b/applications/Chat/examples/README.md
@@ -100,7 +100,7 @@ Model performance in [Anthropics paper](https://arxiv.org/abs/2204.05862):
 - --max_len:           max sentence length for generation, type=int, default=512
 - --test:              whether is only tesing, if it's ture, the dataset will be small

-## Stage3 - Training model using prompts with RL 
+## Stage3 - Training model using prompts with RL

 Stage3 uses reinforcement learning algorithm, which is the most complex part of the training process, as shown below: