"sgl-router/git@developer.sourcefind.cn:change/sglang.git" did not exist on "fd05b56750eec9d8bde1119bc78dc746490d5979"
  1. 21 Mar, 2022 1 commit
    • rocking5566's avatar
      Gemm_c_shuffle (4 layouts) X (fp32 bf16 int8) (#131) · 485ea46a
      rocking5566 authored
      
      
      * [What] Separate fixpoint gemm from gemm example
      [Why] let example of gemm_int8 be pure gemm.
      [What]
      1. Add gemm_requant_relu_requant,
      2. Let CDataType be int32 in pure gemm, because no one use int8 CDataType. It is also part of gemm_requant_relu_requant
      
      * Fix path
      
      * Revise cmakelist due to merge develop
      
      * Add gemm fp16 test
      
      * Extract PrepareGemmTensor
      
      * Extract TestGemm
      
      * Add test for different layout
      
      * Add 4 layouts of shuffle version of fp32
      
      * Add 4 layouts of shuffle version of int8
      
      * Add 4 layouts of shuffle version of bf16
      
      * replace all DeviceGemmPtr_ with DeviceGemmNoOpPtr to fit naming convension
      
      * Add test for non-shuffle verstion of gemm
      
      * Fix typo
      
      * Print kernel information
      
      * Add rest of the fp32 kernel to the test
      
      * 1. Add rest of the fp16 device iop.
      2. Mark the invalid device operation
      Co-authored-by: default avatarrocking <chunylai@amd.com>
      485ea46a
  2. 23 Feb, 2022 1 commit
    • Adam Osewski's avatar
      Unify Convolution FWD XDL 1D/2D implementation. (#93) · 756a7617
      Adam Osewski authored
      
      
      * Convolution ND
      
      * Code unification across dimensions for generating tensor descriptors.
      * Example
      * Instances
      
      * Move convnd f32 instance file to comply with repo structure.
      
      * Conv 1D tensor layouts.
      
      * Formatting and use ReferenceConv
      
      * Reference ConvFwd supporting 1D and 2D convolution.
      
      * Debug printing TensorLayout name.
      
      * Conv fwd 1D instance f32
      
      * Refactor conv ND example.
      
      Needed to support various conv dimensio.
      
      Needed to support various conv dimensions
      
      * Rename conv nd example director to prevent conflicts.
      
      * Refactor some common utility to single file.
      
      Plus some tests.
      
      * Refactor GetHostTensorDescriptor + UT.
      
      * Add 1D test case.
      
      * Test reference convolution 1d/2d
      
      * Remove some leftovers.
      
      * Fix convolution example error for 1D
      
      * Refactor test check errors utility function.
      
      * Test Conv2D Fwd XDL
      
      * More UT for 1D case.
      
      * Parameterize input & weight initializers.
      
      * Rename example to prevent conflicts.
      
      * Split convnd instance into separate files for 1d/2d
      
      * Address review comments.
      
      * Fix data type for flops/gbytes calculations.
      
      * Assign example number 11.
      Co-authored-by: default avatarAdam Osewski <aosewski@amd.com>
      Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
      756a7617