## Motivation ## Modifications ## Checklist - [ ] Code is formatted using Pre-Commit hooks. - [ ] Relevant unit tests are added in the [`tests`](../tests) directory following the guidance in [`tests/README.md`](../tests/README.md). - [ ] [README](../README.md) and example scripts in [`examples`](../examples) are updated if necessary. - [ ] Throughput/latency benchmarks and quality evaluations are included where applicable. - [ ] **For reviewers:** If you're only helping merge the main branch and haven't contributed code to this PR, please remove yourself as a co-author when merging. - [ ] Please feel free to join our [Slack](https://join.slack.com/t/nunchaku/shared_invite/zt-3170agzoz-NgZzWaTrEj~n2KEV3Hpl5Q), [Discord](https://discord.gg/Wk6PnwX9Sm) or [WeChat](https://github.com/mit-han-lab/nunchaku/blob/main/assets/wechat.jpg) to discuss your PR.