"test/vscode:/vscode.git/clone" did not exist on "3696fe1c76f50b2d16742632128c41f7869bc119"
  1. 09 Aug, 2024 4 commits
  2. 08 Aug, 2024 5 commits
  3. 07 Aug, 2024 5 commits
  4. 06 Aug, 2024 8 commits
  5. 05 Aug, 2024 2 commits
  6. 01 Aug, 2024 1 commit
  7. 31 Jul, 2024 7 commits
  8. 30 Jul, 2024 2 commits
  9. 26 Jul, 2024 2 commits
  10. 25 Jul, 2024 2 commits
  11. 24 Jul, 2024 2 commits
    • Andriy Roshchenko's avatar
      Adding more instances of grouped convolution 3d forward for FP8 with... · 4a8a1bef
      Andriy Roshchenko authored
      Adding more instances of grouped convolution 3d forward for FP8 with ConvScale+Bias element-wise operation. (#1412)
      
      * Add CMakePresets configurations.
      
      * Add binary elementwise ConvScaleAdd and an example.
      
      * Numerical verification of results.
      
      Observed significant irregularities in F8 to F32 type conversions:
      ```log
      ConvScaleAdd: float=145.000000   f8_t=160.000000    e=144.000000
      ConvScaleAdd: float=97.000000   f8_t=96.000000    e=104.000000
      ConvScaleAdd: float=65.000000   f8_t=64.000000    e=72.000000
      ```
      
      * Implemented ConvScaleAdd + Example.
      
      * Add ConvScale+Bias Instances
      
      * Add Client Example for ConvScale+Bias
      
      * Fix number of bytes in an example..
      
      * Cleanup.
      4a8a1bef
    • Bartłomiej Kocot's avatar
      Add support for half_t and bfloat to reduction operations (#1395) · ffabd70a
      Bartłomiej Kocot authored
      * Add support for half_t and bfloat to reduction operations
      
      * Fix bhalf convert
      
      * Next fix bf16
      ffabd70a