1. 01 Feb, 2026 4 commits
    • one's avatar
      Enhance profiling capabilities in evo2 scripts · c647fd9a
      one authored
      - Update run.sh to include trace logging options with gzip support.
      - Modify test_evo2_generation_batched.py to add command-line arguments for trace log directory and gzip option.
      - Refactor custom trace handler to utilize gzip compression for trace outputs.
      c647fd9a
    • one's avatar
      Update gemv benchnmark scripts · 3bb2e7a5
      one authored
      - Remove gemv_export.cpp
      - Update Makefile and README for compiler variable changes
      - Adjust run-all.sh for consistent build commands
      3bb2e7a5
    • one's avatar
      Add kernel launch overhead benchmark and associated build scripts · 0fe0b01f
      one authored
      - Introduce kernel_launch_overhead.cu to measure kernel launch latency, system throughput, CPU dispatch overhead, and GPU dispatch time.
      - Create Makefile for building the benchmark with support for nvcc and hipcc.
      - Add run-all.sh script to execute the benchmark with specified device settings.
      0fe0b01f
    • one's avatar
      Add trace fix script and refactor evo2 launch scripts · 65bf476e
      one authored
      - Add fix-pt-trace.sh for repairing non-UTF-8 traces.
      - Remove deprecated run-rocblas.sh.
      - Update trace handler (worker names) and tune GPU bindings in run-all.sh.
      65bf476e
  2. 31 Jan, 2026 2 commits
  3. 30 Jan, 2026 3 commits
  4. 29 Jan, 2026 3 commits
  5. 28 Jan, 2026 1 commit
  6. 27 Jan, 2026 1 commit
  7. 26 Jan, 2026 4 commits