Unverified Commit cf8b9c18 authored by sheikheddy's avatar sheikheddy Committed by GitHub
Browse files

Typo fix (#57)

parent 85752000
...@@ -89,7 +89,7 @@ significant memory. The current implementation (stage 1 of ZeRO) reduces memory ...@@ -89,7 +89,7 @@ significant memory. The current implementation (stage 1 of ZeRO) reduces memory
4x relative to the state-of-art. You can read more about ZeRO in our [paper](https://arxiv.org/abs/1910.02054). 4x relative to the state-of-art. You can read more about ZeRO in our [paper](https://arxiv.org/abs/1910.02054).
With this impressive memory reduction, early adopters of DeepSpeed have already With this impressive memory reduction, early adopters of DeepSpeed have already
produced alanguage model (LM) with over 17B parameters called produced a language model (LM) with over 17B parameters called
[Turing-NLG](https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft), [Turing-NLG](https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft),
establishing a new SOTA in the LM category. establishing a new SOTA in the LM category.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment