"awq/vscode:/vscode.git/clone" did not exist on "2de6092a9c3db482e2a3b36d2c56a89193391c94"
  • Adam Osewski's avatar
    Unified conv3D API + support for all data types. (#133) · f91579aa
    Adam Osewski authored
    
    
    * Convolution ND
    
    * Code unification across dimensions for generating tensor descriptors.
    * Example
    * Instances
    
    * Move convnd f32 instance file to comply with repo structure.
    
    * Conv 1D tensor layouts.
    
    * Formatting and use ReferenceConv
    
    * Reference ConvFwd supporting 1D and 2D convolution.
    
    * Debug printing TensorLayout name.
    
    * Conv fwd 1D instance f32
    
    * Refactor conv ND example.
    
    Needed to support various conv dimensio.
    
    Needed to support various conv dimensions
    
    * Rename conv nd example director to prevent conflicts.
    
    * Refactor some common utility to single file.
    
    Plus some tests.
    
    * Refactor GetHostTensorDescriptor + UT.
    
    * Add 1D test case.
    
    * Test reference convolution 1d/2d
    
    * Remove some leftovers.
    
    * Fix convolution example error for 1D
    
    * Refactor test check errors utility function.
    
    * Test Conv2D Fwd XDL
    
    * More UT for 1D case.
    
    * Parameterize input & weight initializers.
    
    * Rename example to prevent conflicts.
    
    * Split convnd instance into separate files for 1d/2d
    
    * Address review comments.
    
    * Fix data type for flops/gbytes calculations.
    
    * Assign example number 11.
    
    * 3D cases for convolution utility functions.
    
    * 3D reference convolution.
    
    * Add support for 3D convolution.
    
    * Check for inputs bigger than  2GB.
    
    * Formatting
    
    * Support for bf16/f16/f32/i8 - conv instances + UT.
    
    * Use check_err from test_util.hpp.
    
    * Split convnd test into separate files for each dim.
    
    * Fix data generation and use proper instances.
    
    * Formatting
    
    * Skip tensor initialization if not necessary.
    
    * Fix CMakefiles.
    
    * Remove redundant conv2d_fwd test.
    
    * Lower problem size for conv3D UT.
    
    * 3D case for convnd example.
    
    * Remove leftovers after merge.
    
    * Add Conv Specialization string to GetTypeString
    
    * Skip instance causing numerical errors.
    
    * Small fixes.
    
    * Remove redundant includes.
    
    * Fix namespace name error.
    
    * Script for automatic testing and logging convolution fwd UTs
    
    * Comment out numactl cmd.
    
    * Refine weights initalization and relax rtol for fp16
    
    * Fix weights initialization for int8.
    
    * Add type_convert when store output in ref conv 1D.
    
    * Get back old conv2d_fwd_xdl operation.
    
    * Silence conv debug print.
    
    * format
    
    * clean
    
    * clean
    
    * Fix merge.
    
    * Fix namespace for check_err
    Co-authored-by: default avatarAdam Osewski <aosewski@amd.com>
    Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
    f91579aa
convnd_fwd_xdl.cpp 15.3 KB