"src/targets/gpu/vscode:/vscode.git/clone" did not exist on "0c1df49c7252cca61b705cf411fa92c2b4a7828e"
- 03 May, 2023 2 commits
- 01 May, 2023 3 commits
- 28 Apr, 2023 4 commits
- 25 Apr, 2023 2 commits
-
-
kahmed10 authored
update rocBLAS version check to support 3.0 and above with simplified logic
-
Chris Austen authored
-
- 24 Apr, 2023 3 commits
-
-
Charlie Lin authored
Updates the hip::copy_to_gpu and hip::copy_from_gpu operators to work with dynamic shapes Allows for offload_copy to be used with dynamic batch Changed assert in select_module because the argument might now be smaller with how offload_copy will work with dynamic batch. (maximum buffer size will be used)
-
Paul Fultz II authored
This fixes #1700
-
Paul Fultz II authored
-
- 21 Apr, 2023 1 commit
-
-
Umang Yadav authored
-
- 20 Apr, 2023 1 commit
-
-
Umang Yadav authored
Solves #1311
-
- 19 Apr, 2023 1 commit
-
-
shivadbhavsar authored
Expose get_shape and get_operator methods for instruction_ref object in the python API.
-
- 18 Apr, 2023 2 commits
-
-
Paul Fultz II authored
This will show whats being replaced with a constant. This is useful for debugging where a literal comes from.
-
Ted Themistokleous authored
Ensure that we don't have empty inputs when computing shape for pointwise function
-
- 17 Apr, 2023 2 commits
-
-
Charlie Lin authored
Fixes the above behavior This needs to be changed to allow for setting static shapes with map_dyn_input_dims since you cannot also use map_input_dims
-
shivadbhavsar authored
Expose the shape::type_t values to be used by the python api and is required by torch_migraphx to support torchbench models.
-
- 14 Apr, 2023 1 commit
-
-
Paul authored
-
- 13 Apr, 2023 5 commits
-
-
Paul authored
-
Paul authored
-
Paul authored
-
Paul authored
-
Zhuoran Yin authored
-
- 12 Apr, 2023 1 commit
-
-
Paul Fultz II authored
-
- 11 Apr, 2023 1 commit
-
-
Paul Fultz II authored
-
- 10 Apr, 2023 2 commits
-
-
Umang Yadav authored
-
Charlie Lin authored
Adds a matcher to split_single_dyn_dim to find all broadcast or multibroadcast with two static shape inputs and replaces the instruction with the one input version. Sorts the get_output_parameters() list to ensure the correct ordering. (Was getting an error for some models.)
-
- 09 Apr, 2023 1 commit
-
-
Paul Fultz II authored
* Enable hiprtc by default
-
- 08 Apr, 2023 5 commits
- 07 Apr, 2023 1 commit
-
-
Paul Fultz II authored
Converts can be inserted when the scales and input differ in the onnx file(we are already doing this implicit conversion in the ref implementation). This will also improve the compile-time of quantizelinear.hpp since we can remove the nested visit method.
-
- 06 Apr, 2023 2 commits
-
-
Paul authored
-
Charlie Lin authored
Examples.. bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]" bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr bin/driver perf resnet50_batch1-10.mxr --batch 4
-