* disable cache hint for CUDA < 11.4 * fix lint * fix lint * fix cuda-11.3 build
* build turbomind * change namespace fastertransformer to turbomind * change logger name
* add ft code * gitignore * fix lint * revert fmha