Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a86754a1
Unverified
Commit
a86754a1
authored
Jul 12, 2025
by
Reid
Committed by
GitHub
Jul 12, 2025
Browse files
[docs] convert supported configs to table (#20858)
Signed-off-by:
reidliu41
<
reid201711@gmail.com
>
parent
c2a2f19a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
14 additions
and
30 deletions
+14
-30
docs/getting_started/installation/intel_gaudi.md
docs/getting_started/installation/intel_gaudi.md
+14
-30
No files found.
docs/getting_started/installation/intel_gaudi.md
View file @
a86754a1
...
...
@@ -133,36 +133,20 @@ docker run \
The following configurations have been validated to function with
Gaudi2 devices. Configurations that are not listed may or may not work.
-
[
meta-llama/Llama-2-7b
](
https://huggingface.co/meta-llama/Llama-2-7b
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Llama-2-7b-chat-hf
](
https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3-8B
](
https://huggingface.co/meta-llama/Meta-Llama-3-8B
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3-8B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3.1-8B
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-8B
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3.1-8B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
)
on single HPU, or with tensor parallelism on 2x and 8x HPU, BF16
datatype with random or greedy sampling
-
[
meta-llama/Llama-2-70b
](
https://huggingface.co/meta-llama/Llama-2-70b
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
-
[
meta-llama/Llama-2-70b-chat-hf
](
https://huggingface.co/meta-llama/Llama-2-70b-chat-hf
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3-70B
](
https://huggingface.co/meta-llama/Meta-Llama-3-70B
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3-70B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3.1-70B
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-70B
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
-
[
meta-llama/Meta-Llama-3.1-70B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct
)
with tensor parallelism on 8x HPU, BF16 datatype with random or greedy sampling
| Model | TP Size| dtype | Sampling |
|-------|--------|--------|----------|
|
[
meta-llama/Llama-2-7b
](
https://huggingface.co/meta-llama/Llama-2-7b
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Llama-2-7b-chat-hf
](
https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3-8B
](
https://huggingface.co/meta-llama/Meta-Llama-3-8B
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3-8B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3.1-8B
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-8B
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3.1-8B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
)
| 1, 2, 8 | BF16 | Random / Greedy |
|
[
meta-llama/Llama-2-70b
](
https://huggingface.co/meta-llama/Llama-2-70b
)
| 8 | BF16 | Random / Greedy |
|
[
meta-llama/Llama-2-70b-chat-hf
](
https://huggingface.co/meta-llama/Llama-2-70b-chat-hf
)
| 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3-70B
](
https://huggingface.co/meta-llama/Meta-Llama-3-70B
)
| 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3-70B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct
)
| 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3.1-70B
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-70B
)
| 8 | BF16 | Random / Greedy |
|
[
meta-llama/Meta-Llama-3.1-70B-Instruct
](
https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct
)
| 8 | BF16 | Random / Greedy |
## Performance tuning
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment