1. 28 Nov, 2025 1 commit
  2. 25 Nov, 2025 1 commit
  3. 21 Nov, 2025 1 commit
  4. 17 Nov, 2025 1 commit
    • Jae-Won Chung's avatar
      [Metrics] Fix KV cache usage percent metric multiproc (#28792) · d4acf518
      Jae-Won Chung authored
      
      
      The `vllm:kv_cache_usage_perc` Gauge metric is missing `multiprocess_mode="mostrecent"` and ends up returning
      
      ```
      vllm:kv_cache_usage_perc{engine="0",model_name="Qwen/Qwen3-VL-8B-Instruct",pid="277"} 0.0
      vllm:kv_cache_usage_perc{engine="0",model_name="Qwen/Qwen3-VL-8B-Instruct",pid="275"} 0.0
      vllm:kv_cache_usage_perc{engine="0",model_name="Qwen/Qwen3-VL-8B-Instruct",pid="273"} 0.6530455880475035
      ...
      ```
      
      The deprecated `vllm:gpu_cache_usage_perc` Gauge metric has `multiprocess_mode="mostrecent"`.
      Signed-off-by: default avatarJae-Won Chung <jwnchung@umich.edu>
      d4acf518
  5. 14 Nov, 2025 1 commit
  6. 10 Nov, 2025 1 commit
  7. 05 Nov, 2025 1 commit
  8. 04 Nov, 2025 1 commit
  9. 30 Oct, 2025 1 commit
  10. 29 Oct, 2025 2 commits
  11. 24 Oct, 2025 1 commit
  12. 23 Oct, 2025 1 commit
  13. 18 Oct, 2025 1 commit
  14. 14 Oct, 2025 1 commit
  15. 12 Oct, 2025 1 commit
  16. 10 Oct, 2025 2 commits
  17. 05 Oct, 2025 1 commit
  18. 03 Oct, 2025 1 commit
  19. 25 Sep, 2025 1 commit
  20. 24 Sep, 2025 1 commit
  21. 19 Sep, 2025 1 commit
  22. 16 Sep, 2025 1 commit
  23. 14 Sep, 2025 1 commit
  24. 08 Sep, 2025 1 commit
  25. 04 Sep, 2025 1 commit
  26. 02 Sep, 2025 1 commit
  27. 27 Jul, 2025 1 commit
  28. 21 Jul, 2025 1 commit
  29. 19 Jun, 2025 1 commit
  30. 14 Jun, 2025 1 commit
  31. 03 Jun, 2025 1 commit
  32. 30 May, 2025 1 commit
  33. 27 May, 2025 1 commit
  34. 16 May, 2025 1 commit
  35. 12 May, 2025 1 commit
  36. 10 May, 2025 1 commit
  37. 30 Apr, 2025 1 commit
  38. 27 Apr, 2025 1 commit