"vscode:/vscode.git/clone" did not exist on "cc74499d511b719942f4e7a861b7a2f22d7bf1c6"
  1. 05 Aug, 2021 1 commit
    • Paul Fultz II's avatar
      Add gpu driver and improvements to pointwise codegen (#851) · 29fa2666
      Paul Fultz II authored
      
      
      * Add method to compile pointwise
      
      * Formatting
      
      * Add lambda
      
      * Add semicolon
      
      * Rename variable
      
      * Add driver to run jit kernels
      
      * Formatting
      
      * Add context
      
      * Formatting
      
      * Make seperate driver folder
      
      * Add more general gpu driver
      
      * Formatting
      
      * Print out wll time
      
      * Formatting
      
      * Run multiple times and skip first run
      
      * Formatting
      
      * Seperate time_op
      
      * Run an op for comparison
      
      * Formatting
      
      * Add debug asserts
      
      * Formatting
      
      * Change parameer name
      
      * Formatting
      
      * Fix argument order
      
      * Formatting
      
      * Add preloading
      
      * Formatting
      
      * Allow a different data type
      
      * Formatting
      
      * Pipeline transformations
      
      * Formatting
      
      * Add vectorization
      
      * Formatting
      
      * Reduce dims
      
      * Formatting
      
      * Compile with launch params as constant
      
      * Formatting
      
      * Make sure buffer can be vecotrized
      
      * Formatting
      
      * Enable vectorization and preloading
      
      * Formatting
      
      * Add print header
      
      * Formatting
      
      * Avoid allocating to large of LDS
      
      * Formatting
      
      * Add some vec functions to a seperate header
      
      * Formatting
      
      * Add stride loops
      
      * Formatting
      
      * Improve the transform pipeline
      
      * Formatting
      
      * Add const
      
      * Fix shape check
      
      * Formatting
      
      * Just check stride axis is zero
      
      * Remove extra finc_vector_axis overload
      
      * Simplify some mroe functions
      
      * Formatting
      
      * Remove some more extra functions
      
      * Formatting
      
      * Simplify more decltypes
      
      * Add another const
      
      * Fix test
      
      * Get buffer pointer different for older compilers
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      Co-authored-by: default avatarChris Austen <causten@users.noreply.github.com>
      29fa2666
  2. 19 Oct, 2020 1 commit
    • Shucai Xiao's avatar
      Op constructor c/python api (#636) · e939ddcd
      Shucai Xiao authored
      
      
      * code backup
      
      * clang format
      
      * add the python api for op construction
      
      * clang format
      
      * add converting a json string to a value and then pass to make_op
      
      * fix a build error
      
      * support kwargs decoder
      
      * clang format
      
      * code backup
      
      * clang format
      
      * temp code backup
      
      * clang format
      
      * temp code backup
      
      * convert input string to the json format
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * clang format
      
      * add a comment
      
      * add unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * rename two files
      
      * refine an error report
      
      * clang format
      
      * remove unnecessary code
      
      * fix a review comments
      
      * add convert_to_json unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * add more unit tests
      
      * clang format
      
      * fix clang tidy error
      
      * clang format
      
      * fixed an test failure
      
      * fix cppcheck error
      
      * fix a bug
      
      * fix cppcheck error
      
      * fix jenkins build error
      
      * fix a jenkin build error by wrap up string
      
      * clang format
      
      * another try
      
      * clang format
      
      * fix jenkins build error
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * fix cppcheck error
      
      * fix review comments
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      e939ddcd