Unverified Commit f0181436 authored by Nicolas Patry's avatar Nicolas Patry Committed by GitHub
Browse files

fix(server): Fixing RW code (it's remote code so the Arch checking doesn't...

fix(server): Fixing RW code (it's remote code so the Arch checking doesn't work to see which weights to keep). (#579)

Fixes #555
parent b4024edd
...@@ -49,7 +49,13 @@ class FlashRWSharded(FlashCausalLM): ...@@ -49,7 +49,13 @@ class FlashRWSharded(FlashCausalLM):
torch.distributed.barrier(group=self.process_group) torch.distributed.barrier(group=self.process_group)
filenames = weight_files(model_id, revision=revision, extension=".safetensors") filenames = weight_files(model_id, revision=revision, extension=".safetensors")
weights = Weights(filenames, device, dtype, process_group=self.process_group) weights = Weights(
filenames,
device,
dtype,
process_group=self.process_group,
aliases={"transformer.word_embeddings.weight": ["lm_head.weight"]},
)
config.quantize = quantize config.quantize = quantize
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment