"...resnet50_tensorflow.git" did not exist on "cf60559f2f0fa165b77eacd5fb19c0d2cb8bfee5"
Pre-compute coordinates to speed up store_tile() for TileWindowWithStaticDistribution<> (#12)
* Extract store_tile() logics as method
* Extract load_tile() logics as method
* Rename type alias
* Extract common logics as traits
* Remove unnecessary access specifier
* Add ComputeMode for TileWindowWithStaticDistribution
* Put field check into Traits
* More definition of Traits types
* Use more clear static_assert() message
* Enable pre-compute coordinates in store_tile()
* Re-formate static assert
* Undo changes to the wrong method
* Enable pre-compute coords for store_tile()
* Remove static_vector usage
* Add method to move non-member coordinates
* Force using pre-computed coordinates in Store()
* Fix wrong access for SFC_Ys
* Change comment
* Allow users to hint # access per coord
* Add comment for noting remove data members later
* Unify FIXME comments
* Replace FIXME comments by TODO
* Let user specify HintNumCoords
* clean
* clean
* clean
* clean
* refactor load/store for window
* clean
* clean
* bug fix for window; clean
---------
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
This diff is collapsed.
Please register or sign in to comment