"git@developer.sourcefind.cn:sugon_wxj/megatron-lm.git" did not exist on "b9ae7ba5b7813bcc838e4e250653e2969b1ed812"
  1. 09 Jun, 2023 1 commit
    • Ted Themistokleous's avatar
      Add per-os shim for execution support · a3a162e9
      Ted Themistokleous authored
      Splits support for using tbb (oneTBB now) between OS's when using
      std::execution::par
      
      It appears TBB support seems to wane in earlier versions of g++ in CentOS and
      other OS's still require testing/containers to verify builds.
      
      The MIGraphX shim should just operate as normal with copy_if and sort() calls
      if TBB support isn't functional for the current OS/g++ version
      
      Also moved the TBB dependency from install_preqs to Ubuntu Docker. Shouldn't breka
      other dockers if support isn't added
      a3a162e9
  2. 17 Apr, 2023 17 commits
  3. 13 Apr, 2023 1 commit
  4. 12 Apr, 2023 1 commit
  5. 11 Apr, 2023 1 commit
  6. 10 Apr, 2023 2 commits
  7. 09 Apr, 2023 1 commit
  8. 07 Apr, 2023 1 commit
  9. 06 Apr, 2023 2 commits
    • Charlie Lin's avatar
      Driver dynamic batch update (#1652) · adccec52
      Charlie Lin authored
      Examples..
      
      bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]"
      
      bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr
      
      bin/driver perf resnet50_batch1-10.mxr --batch 4
      adccec52
    • Paul Fultz II's avatar
      Add reduction fusion (#1614) · f201285c
      Paul Fultz II authored
      Automatically fuse multiple reductions and pointwise operations.
      f201285c
  10. 05 Apr, 2023 3 commits
  11. 04 Apr, 2023 2 commits
    • shivadbhavsar's avatar
      fix bug in transpose_slice simplification (#1660) · 30af1697
      shivadbhavsar authored
      Bug found due to failing torch benchmark. Added test case to reproduce issue causing the model to error out on compile.
      Original logic results in the following error:
      AMDMIGraphX/src/include/migraphx/op/unsqueeze.hpp:128: normalize_compute_shape: UNSQUEEZE: Axis dimenstion is not divisible by step
      30af1697
    • Charlie Lin's avatar
      Refactor dynamic_dimension to have multiple optimals (#1625) · e7ec374f
      Charlie Lin authored
      Makes the optimals into a std::set<std::size_t>
      Changes shape object functions to handle the opts change
      Changes to convolution, flatten, pooling, and convolution in that they no longer calculate the output optimal dimensions. Instead returns empty opts. Will need to change this in the future if we want to support dynamic shapes fully.
      Many changes to tests and shape calls with respect to the new optimals
      e7ec374f
  12. 03 Apr, 2023 2 commits
  13. 01 Apr, 2023 1 commit
  14. 31 Mar, 2023 1 commit
    • Charlie Lin's avatar
      Split single dynamic dimension compiler pass (#1580) · e9e3eacc
      Charlie Lin authored
      Adds a new GPU compiler pass split_single_dyn_dim that handles when one input parameter has a single non-fixed dynamic_dimension.
      commonly occurs for dynamic batch or BERT sequence length
      Splits the dynamic shape into several submodules will static input parameters to handle all of the cases in the dynamic_dimension range.
      Essentially does what I manually did for the select_module verify tests
      Adds a compile option split_single_dyn_dim that toggles the pass on/off. Defaults to false.
      Updates verify_program.hpp and run_verify.cpp to allow for the tests to change the compile_options
      e9e3eacc
  15. 30 Mar, 2023 1 commit
  16. 29 Mar, 2023 1 commit
  17. 28 Mar, 2023 1 commit
  18. 27 Mar, 2023 1 commit