"megatron/model/gpt_model.py" did not exist on "456f17280fcc25eb6bb3d9de7f9cad170b7b98d9"
  • Azure's avatar
    * Reorganize documentation/README · ef89b152
    Azure authored
        * Consolidate the installation section, as it's currently too cluttered
        * Move the Multi-GPU section to the top-level structure
        * Add a **detailed** tutorial on registering extra GPU memory with Marlin
    ef89b152
deepseek-v2-injection.md 13.5 KB