Unverified Commit 7be7f382 authored by Harry Mellor's avatar Harry Mellor Committed by GitHub
Browse files

[Docs] Improve API docs (+small tweaks) (#22459)


Signed-off-by: default avatarHarry Mellor <19981378+hmellor@users.noreply.github.com>
parent ccdae737
...@@ -58,10 +58,9 @@ nav: ...@@ -58,10 +58,9 @@ nav:
- CI: contributing/ci - CI: contributing/ci
- Design Documents: design - Design Documents: design
- API Reference: - API Reference:
- Summary: api/README.md - Summary: api/summary.md
- Contents: - Contents:
- glob: api/vllm/* - api/vllm/*
preserve_directory_names: true
- CLI Reference: - CLI Reference:
- Summary: cli/README.md - Summary: cli/README.md
- Community: - Community:
......
--- # FP8 INC
title: FP8 INC
---
[](){ #inc }
vLLM supports FP8 (8-bit floating point) weight and activation quantization using Intel® Neural Compressor (INC) on Intel® Gaudi® 2 and Intel® Gaudi® 3 AI accelerators. vLLM supports FP8 (8-bit floating point) weight and activation quantization using Intel® Neural Compressor (INC) on Intel® Gaudi® 2 and Intel® Gaudi® 3 AI accelerators.
Currently, quantization is validated only in Llama models. Currently, quantization is validated only in Llama models.
......
...@@ -105,7 +105,7 @@ class Example: ...@@ -105,7 +105,7 @@ class Example:
return fix_case(self.path.stem.replace("_", " ").title()) return fix_case(self.path.stem.replace("_", " ").title())
def generate(self) -> str: def generate(self) -> str:
content = f"---\ntitle: {self.title}\n---\n\n" content = f"# {self.title}\n\n"
content += f"Source <gh-file:{self.path.relative_to(ROOT_DIR)}>.\n\n" content += f"Source <gh-file:{self.path.relative_to(ROOT_DIR)}>.\n\n"
# Use long code fence to avoid issues with # Use long code fence to avoid issues with
......
...@@ -40,6 +40,7 @@ theme: ...@@ -40,6 +40,7 @@ theme:
- navigation.sections - navigation.sections
- navigation.prune - navigation.prune
- navigation.top - navigation.top
- navigation.indexes
- search.highlight - search.highlight
- search.share - search.share
- toc.follow - toc.follow
...@@ -51,11 +52,6 @@ hooks: ...@@ -51,11 +52,6 @@ hooks:
- docs/mkdocs/hooks/generate_argparse.py - docs/mkdocs/hooks/generate_argparse.py
- docs/mkdocs/hooks/url_schemes.py - docs/mkdocs/hooks/url_schemes.py
# Required to stop api-autonav from raising an error
# https://github.com/tlambert03/mkdocs-api-autonav/issues/16
nav:
- api
plugins: plugins:
- meta - meta
- search - search
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment