• Rostyslav Geyyer's avatar
    Refactor f8_t, add bf8_t (#792) · 62d4af74
    Rostyslav Geyyer authored
    * Refactor f8_t to add bf8_t
    
    * Add check_err impl for f8_t
    
    * Update fp8 test
    
    * Format
    
    * Revert the fix
    
    * Update vector_type implementation
    
    * Add bf8 test
    
    * Add bf8, use BitInt types
    
    * Add bf8 conversion methods
    
    * Update type_convert for fp8/bf8
    
    * Add check_err fp8/bf8 support
    
    * Add subnorm fp8 tests
    
    * Add subnorm bf8 tests
    
    * Fix conversion
    
    * Add bf8 cmake bindings
    
    * Add macros to enable build with disabled fp8/bf8
    
    * Remove is_native method
    
    * Update flag combination for mixed precision instances
    
    * Add more flag checks
    
    * Add another flag to a client example
    
    * Add type traits, decouple f8/bf8 casting
    
    * Clean up
    
    * Decouple fp8 and bf8 flags
    
    * Remove more redundant flags
    
    * Remove leftover comments
    62d4af74
profile_gemm_splitk.cpp 7 KB