* add half support * add cpu implementation * fix bugs, load with inline asm * better vector load * add comments
Attach a file by drag & drop or click to upload