"src/propagate_constant.cpp" did not exist on "c05e72e2be46937cd43837afec53eff411a387f5"
- 25 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Use increment instead of division to compute register offset * Formatting * Limit layernorm to 1024 elements * Formatting * Add verification to driver * Formatting * Remove early return * Use block_size 256 * Vectorize the kernel * Formatting * Convert to vector type * Add layernorm tests * Formatting * Formatting * Refactor layernorm to run both algos * Formatting * Fix compile error * Fix tidy warnings * Formatting * Add layernorm function * Formatting
-
- 16 Sep, 2019 1 commit
-
-
Paul Fultz II authored
* Add flags to quantize in driver * Formatting * Fix compile error
-
- 19 Aug, 2019 2 commits
- 05 Jun, 2019 4 commits