1. 07 Mar, 2023 2 commits
  2. 06 Mar, 2023 12 commits
  3. 02 Mar, 2023 1 commit
    • Illia Silin's avatar
      Change the CI workflow. (#611) · e6cda9f8
      Illia Silin authored
      * add new parallel stage on navi node
      
      * dont run performance tests on navi, get rid of 9110 compiler
      
      * only run navi build when not doing QA
      
      * fix syntax
      
      * use navi21 label
      
      * dont stash profiler on navi nodes, scp deb package to ginger
      
      * disable tests on navi nodes
      
      * test posting a binary to ginger
      
      * add sshpass and use it to copy deb package
      
      * fix the scp example
      
      * fix syntax
      
      * debug the scp issues
      
      * add jenkins user to docker
      
      * dont try whoami
      
      * change jenkins uid and add user with uid=1002
      
      * try scp from the last stage on micimaster
      
      * rename and stash the package, scp from micimaster
      e6cda9f8
  4. 01 Mar, 2023 2 commits
  5. 28 Feb, 2023 3 commits
  6. 27 Feb, 2023 3 commits
  7. 24 Feb, 2023 2 commits
  8. 22 Feb, 2023 2 commits
    • Rostyslav Geyyer's avatar
      Add Grouped Conv Backward Weight on Navi21 for ResNet50. (#505) · 246ceee4
      Rostyslav Geyyer authored
      
      
      * Add DeviceOp and examples
      
      * Format DeviceOp template arguments
      
      * Remove bf16 example
      
      * Format
      
      * Format
      
      * Update MakeABCGridDescriptor_A_K0_M_K1_B_K0_N_K1_C_M_N
      
      * Refactor argument preparation
      
      * Update conv_bwd_weight_dl to grouped_conv_bwd_weight_dl
      
      * Rename device op file
      
      * Update include directive in the example file
      
      * Update descriptor preparation for grouped op
      
      * Update the argument
      
      * Update batch handling
      
      * Add gridwise gemm supporting batched input
      
      * Update blockwise indexing, working version
      
      * Update copyright year
      
      * Update check if argument is supported
      
      * Refactor and make consistent with xdl examples
      
      * Update check if argument is supported
      
      * Add changelog entry
      
      * Added comments on Dl op split_k>1 support
      
      ---------
      Co-authored-by: default avatarRosty Geyyer <rosty.geyyer@amd.com>
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      246ceee4
    • ltqin's avatar
      Grouped conv1d client example (#589) · 830d37a7
      ltqin authored
      
      
      * add conv1d fwd client example
      
      * change 07_grouped_conv2d_fwd to 07_grouped_convnd_fwd
      
      * add conv1d bwd weight
      
      ---------
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      830d37a7
  9. 16 Feb, 2023 10 commits
  10. 15 Feb, 2023 3 commits
    • pmaybank's avatar
      Sphinx doc (#581) · cb3fac4d
      pmaybank authored
      
      
      * New docs directory with minimal config
      
      * Based on docs directory of rocBLAS
      
      * Config for running Doxygen then Sphinx to generate HTML
      
      * Add minimal content - intro to doc
      
      * Add some boilerplate sections to doc
      
      * content still needs to be done,
      * e.g., need to generate API documentation using Doxygen
      * need to write contributor guide
      
      * Start Softmax section of Support Primitives doc
      
      * Written as a test bed for typesetting math content
      
      * Need to decide how much detail to go into
      
      * add doc directories to git ignore file.
      
      * Minor edits - new line at EOF, change year in copyright notices
      
      * Port Markdown files to ReStructuredText
      
      * Copy Markdown files from pre-existing doc directory to docs directory
      
      * Convert to reStructured Text (rst) - section headings, links, tables
        have a different syntax in rst
      
      * New rst files added to index - can generate HTML with same style as
        HTML generated from rst files in previous commits
      
      * Intention is to make all the content in doc redundant and use rst
        throughout rather than mix of md and rst
      
      * Extend Softmax section of Primitives Guide
      
      * rename l to z
      
      * add material on applying softmax row-wise to matrix
      
      * define macro for diag operator (represents diagonal matrix)
      
      ---------
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      cb3fac4d
    • Illia Silin's avatar
      Clean up kernel launch output (#569) · 19490ac4
      Illia Silin authored
      
      
      * clean up output from kernel_launch
      
      * set RUN_WARMUP to 0 by default
      
      * split the warm-up into a separate issue
      
      ---------
      Co-authored-by: default avatarzjing14 <zhangjing14@gmail.com>
      19490ac4
    • zjing14's avatar
      Add contraction_fp64 example (#570) · 24c9ee1d
      zjing14 authored
      
      
      * add contraction_bilinear
      
      * add contraction_scale_xdl_fp64
      
      * reduce tile size to avoid register spill
      
      ---------
      Co-authored-by: default avatarroot <root@ctr-ubbsmc16.amd.com>
      24c9ee1d