• Qianfeng's avatar
    Batchnorm-forward and Batchnorm-infer Implemented using generic kernels (#320) · 53ea4713
    Qianfeng authored
    * Implement multiple-reduction in one kernel (kernels, device ops, examples)
    
    * Add generic elementwise kernel and device interface
    
    * Add generator for normal-distributed data initialization
    
    * Add host refer implementation of batchnorm-forward and batchnorm-infer
    
    * Add examples for implementing batchnorm-forward and batchnorm-infer using generic kernels
    
    * Remove un-needed including in batchnorm example
    
    * Renaming generic_elementwise to elementiwise in kernel and device classes/functions
    
    * Change in gemm_layernorm examples to use DeviceElementwise instead of Device5AryElementwise
    
    * Change in exampe 19_binary_elementwise to use DeviceElementwise instead of DeviceBinaryElementwise
    
    * Change in device_cgemm_4gemm_xdl_cshuffle.hpp to use kernel_elementwise instead of kernel_binary_elementwise
    
    * Add DeviceElementwiseBase and use it in device_normalize_instance.cpp
    
    * Removing and renaming files
    
    * Update to synchronize gemm_layernorm client example to the generic element-wise device op API
    
    * Update to synchronize with the latest headers directory and HostTensorDescriptor interface renaming
    
    * Merge two static member functions in device_elementwise.hpp
    
    * Remove unary_elementwise_1d kernel and device
    53ea4713
CMakeLists.txt 150 Bytes