--fill0 [std::vector<std::string>]

Fill parameter with 0s

--fill1 [std::vector<std::string>]

Fill parameter with 1s

--gpu

Compile on the gpu

--cpu

Compile on the cpu

--ref

Compile on the reference implementation

--enable-offload-copy

Enable implicit offload copying

--disable-fast-math

Disable fast math optimization

--fp16

Quantize for fp16

--int8

Quantize for int8