- 08 Jun, 2023 1 commit
-
-
Nicolas Patry authored
# What does this PR do? Reworked the loading logic. Idea is to use cleaner loading code: - Remove need for `no_init_weights` - Remove all weird `bnb_linear` and `load_weights` and `post_load_weights`. New code layout: - New class `Weights` in charge of handling loading the weights from multiple files into appropiate tensors (potentially sharded) - TP layers now are "shells", they contain the code to know what kind of sharding we need + eventual `all_reduce`. They do not inherit from linear, but they contain some kind of Linear instead - the contained linear can be either FastLinear, BnbLinear or GPTq Linear next. - All modeling code is explictly made for sharding, process group is just no-ops for non sharded code (removes a lot of test cases)  --------- Co-authored-by:
Ubuntu <ubuntu@ip-172-31-41-161.taildb5d.ts.net> Co-authored-by:
Ubuntu <ubuntu@ip-172-31-41-161.ec2.internal> Co-authored-by:
OlivierDehaene <olivier@huggingface.co> Co-authored-by:
OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
-
- 16 May, 2023 1 commit
-
-
OlivierDehaene authored
Fixes #333 --------- Co-authored-by:Nicolas Patry <patry.nicolas@protonmail.com>
-
- 20 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 19 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 16 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 09 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 27 Mar, 2023 1 commit
-
-
Nick Hill authored
Generates .pyi files for protobuf stubs which provide strong typing information. Very helpful for IDE auto-completion, etc.
-
- 24 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 15 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 13 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 07 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 03 Mar, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 13 Feb, 2023 1 commit
-
-
OlivierDehaene authored
-
- 24 Jan, 2023 1 commit
-
-
OlivierDehaene authored
-
- 08 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 01 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 08 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 07 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 03 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 28 Oct, 2022 1 commit
-
-
OlivierDehaene authored
-
- 22 Oct, 2022 1 commit
-
-
Nicolas Patry authored
Co-authored-by:OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
-
- 20 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
-
- 08 Oct, 2022 1 commit
-
-
Olivier Dehaene authored
-