Stop using title frontmatter and fix doc that can only be reached by search (#20623)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Stop using title frontmatter and fix doc that can only be reached by search (#20623)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
b942c094 · Harry Mellor · GitHub · b4bab816 · b942c094 · b942c094
Unverified Commit b942c094 authored Jul 08, 2025 by Harry Mellor Committed by GitHub Jul 08, 2025
20 changed files
--- a/docs/.nav.yml
+++ b/docs/.nav.yml
@@ -55,6 +55,7 @@ nav:
      - contributing/model/registration.md
      - contributing/model/tests.md
      - contributing/model/multimodal.md
+    - CI: contributing/ci
    - Design Documents:
      - V0: design
      - V1: design/v1

--- a/docs/community/contact_us.md
+++ b/docs/community/contact_us.md
---
+# Contact Us
-title: Contact Us
---
 --8<-- "README.md:contact-us"
--- a/docs/community/meetups.md
+++ b/docs/community/meetups.md
---
+# Meetups
-title: Meetups
---
 We host regular meetups in San Francisco Bay Area every 2 months. We will share the project updates from the vLLM team and have guest speakers from the industry to share their experience and insights. Please find the materials of our previous meetups below:

--- a/docs/configuration/engine_args.md
+++ b/docs/configuration/engine_args.md
---
+# Engine Arguments
-title: Engine Arguments
---
 Engine arguments control the behavior of the vLLM engine.

--- a/docs/configuration/serve_args.md
+++ b/docs/configuration/serve_args.md
---
+# Server Arguments
-title: Server Arguments
---
 The `vllm serve` command is used to launch the OpenAI-compatible server.

--- a/docs/contributing/benchmarks.md
+++ b/docs/contributing/benchmarks.md
---
+# Benchmark Suites
-title: Benchmark Suites
---
 vLLM contains two sets of benchmarks:

--- a/docs/contributing/ci-failures.md
+++ b/docs/contributing/ci-failures.md
--- a/docs/ci/update_pytorch_version.md
+++ b/docs/ci/update_pytorch_version.md
---
+# Update PyTorch version on vLLM OSS CI/CD
-title: Update PyTorch version on vLLM OSS CI/CD
---
 vLLM's current policy is to always use the latest PyTorch stable
 release in CI/CD. It is standard practice to submit a PR to update the

--- a/docs/contributing/model/README.md
+++ b/docs/contributing/model/README.md
---
+# Summary
-title: Summary
---
 !!! important
    Many decoder language models can now be automatically loaded using the [Transformers backend][transformers-backend] without having to implement them in vLLM. See if `vllm serve <model>` works first!

--- a/docs/contributing/model/basic.md
+++ b/docs/contributing/model/basic.md
---
+# Basic Model
-title: Basic Model
---
 This guide walks you through the steps to implement a basic vLLM model.

--- a/docs/contributing/model/multimodal.md
+++ b/docs/contributing/model/multimodal.md
---
+# Multi-Modal Support
-title: Multi-Modal Support
---
 This document walks you through the steps to extend a basic model so that it accepts [multi-modal inputs](../../features/multimodal_inputs.md).

--- a/docs/contributing/model/registration.md
+++ b/docs/contributing/model/registration.md
---
+# Registering a Model
-title: Registering a Model
---
 vLLM relies on a model registry to determine how to run each model.
 A list of pre-registered architectures can be found [here](../../models/supported_models.md).

--- a/docs/contributing/model/tests.md
+++ b/docs/contributing/model/tests.md
---
+# Unit Testing
-title: Unit Testing
---
 This page explains how to write unit tests to verify the implementation of your model.

--- a/docs/deployment/docker.md
+++ b/docs/deployment/docker.md
---
+# Using Docker
-title: Using Docker
---
 [](){ #deployment-docker-pre-built-image }

--- a/docs/deployment/frameworks/anyscale.md
+++ b/docs/deployment/frameworks/anyscale.md
---
+# Anyscale
-title: Anyscale
---
 [](){ #deployment-anyscale }
 [Anyscale](https://www.anyscale.com) is a managed, multi-cloud platform developed by the creators of Ray.

--- a/docs/deployment/frameworks/anything-llm.md
+++ b/docs/deployment/frameworks/anything-llm.md
---
+# Anything LLM
-title: Anything LLM
---
 [Anything LLM](https://github.com/Mintplex-Labs/anything-llm) is a full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting.

--- a/docs/deployment/frameworks/autogen.md
+++ b/docs/deployment/frameworks/autogen.md
---
+# AutoGen
-title: AutoGen
---
 [AutoGen](https://github.com/microsoft/autogen) is a framework for creating multi-agent AI applications that can act autonomously or work alongside humans.

--- a/docs/deployment/frameworks/bentoml.md
+++ b/docs/deployment/frameworks/bentoml.md
---
+# BentoML
-title: BentoML
---
 [BentoML](https://github.com/bentoml/BentoML) allows you to deploy a large language model (LLM) server with vLLM as the backend, which exposes OpenAI-compatible endpoints. You can serve the model locally or containerize it as an OCI-compliant image and deploy it on Kubernetes.

--- a/docs/deployment/frameworks/cerebrium.md
+++ b/docs/deployment/frameworks/cerebrium.md
---
+# Cerebrium
-title: Cerebrium
---
 <p align="center">
    <img src="https://i.ibb.co/hHcScTT/Screenshot-2024-06-13-at-10-14-54.png" alt="vLLM_plus_cerebrium"/>

--- a/docs/deployment/frameworks/chatbox.md
+++ b/docs/deployment/frameworks/chatbox.md
---
+# Chatbox
-title: Chatbox
---
 [Chatbox](https://github.com/chatboxai/chatbox) is a desktop client for LLMs, available on Windows, Mac, Linux.