1. 28 Nov, 2024 1 commit
    • pdr's avatar
      Benchmarks - Add LLaMA-2 Models (#668) · 249e21c1
      pdr authored
      Added llama benchmark - training and inference in accordance with the
      existing pytorch models implementation like gpt2, lstm etc.
      
      - added llama fp8 unit test for better code coverage, to reduce memory
      required
      - updated transformers version >= 4.28.0 for LLamaConfig
      - set tokenizers version <= 0.20.3 to avoid 0.20.4 version
      [issues](https://github.com/huggingface/tokenizers/issues/1691
      
      ) with
      py3.8
      - added llama2 to tensorrt
      - llama2 tests not added to test_tensorrt_inference_performance.py due
      to large memory requirement for worker gpu. tests validated separately
      on gh200
      
      ---------
      Co-authored-by: default avatardpatlolla <dpatlolla@microsoft.com>
      249e21c1
  2. 07 Dec, 2023 1 commit
  3. 28 Jan, 2023 1 commit
  4. 25 Jan, 2022 1 commit
  5. 19 Jan, 2022 1 commit
  6. 09 Dec, 2021 1 commit
  7. 27 Oct, 2021 1 commit
  8. 12 Oct, 2021 1 commit
    • Yifan Xiong's avatar
      Docs - Refine document structure (#225) · 3d0fde12
      Yifan Xiong authored
      __Major Revisions__
      
      * Refine document structure for user tutorial.
      
      __Minor Revisions__
      
      * Add AMD part in installation.
      * Change default config file to latest link.
      3d0fde12
  9. 30 Jun, 2021 1 commit
  10. 25 Jun, 2021 1 commit