added llama_inference_pytorch
Showing
model/tokenize.py
0 → 100644
prompts.txt
0 → 100644
requirements.txt
0 → 100644
| tensor_parallel == 1.2.2 | ||
| transformers == 4.28.1 | ||
result.txt
0 → 100644
run-dialogue.sh
0 → 100644
run-tp.sh
0 → 100644
run.sh
0 → 100644
test_latency.py
0 → 100644
utils.py
0 → 100644
Please register or sign in to comment