• Lei Wang's avatar
    [Bugfix] Fix compilation issues for amd cdna element size check (#364) · d627fd58
    Lei Wang authored
    * [Refactor] Update AutoTuner run method and timeout handling
    
    - Modified the `run` method to reduce the default timeout from 100 to 30 seconds for improved responsiveness.
    - Changed the `get_input_tensors_supply` call to disable output generation, enhancing performance during tensor supply retrieval.
    - Refactored the latency measurement to streamline the benchmarking process, ensuring proper timeout handling with `ThreadPoolExecutor`.
    - Added logging for timeout occurrences to aid in debugging and performance analysis.
    
    * bug fix
    
    * lint fix
    d627fd58
gemm.cc 11 KB