Unverified Commit 219bb5b8 authored by Matthew Bonanni's avatar Matthew Bonanni Committed by GitHub
Browse files

[Misc] Update `committers.md` (#40058)


Signed-off-by: default avatarMatthew Bonanni <mbonanni@redhat.com>
parent ad2b1277
...@@ -31,6 +31,7 @@ Sorted alphabetically by GitHub handle: ...@@ -31,6 +31,7 @@ Sorted alphabetically by GitHub handle:
- [@LucasWilkinson](https://github.com/LucasWilkinson): Kernels and performance - [@LucasWilkinson](https://github.com/LucasWilkinson): Kernels and performance
- [@luccafong](https://github.com/luccafong): Llama models, speculative decoding, distributed - [@luccafong](https://github.com/luccafong): Llama models, speculative decoding, distributed
- [@markmc](https://github.com/markmc): Observability - [@markmc](https://github.com/markmc): Observability
- [@MatthewBonanni](https://github.com/MatthewBonanni): Kernels and performance
- [@mgoin](https://github.com/mgoin): Quantization and performance - [@mgoin](https://github.com/mgoin): Quantization and performance
- [@NickLucche](https://github.com/NickLucche): KV connector - [@NickLucche](https://github.com/NickLucche): KV connector
- [@njhill](https://github.com/njhill): Distributed, API server, engine core - [@njhill](https://github.com/njhill): Distributed, API server, engine core
...@@ -87,7 +88,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r ...@@ -87,7 +88,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r
- AsyncLLM: the zmq based protocol hosting engine core and making it accessible for entrypoints - AsyncLLM: the zmq based protocol hosting engine core and making it accessible for entrypoints
- @robertgshaw2-redhat, @njhill, @russellb - @robertgshaw2-redhat, @njhill, @russellb
- ModelRunner, Executor, Worker: the abstractions for engine wrapping model implementation - ModelRunner, Executor, Worker: the abstractions for engine wrapping model implementation
- @WoosukKwon, @tlrmchlsmth, @heheda12345, @LucasWilkinson, @ProExpertProg - @WoosukKwon, @tlrmchlsmth, @heheda12345, @LucasWilkinson, @ProExpertProg, @MatthewBonanni
- KV Connector: Connector interface and implementation for KV cache offload and transfer - KV Connector: Connector interface and implementation for KV cache offload and transfer
- @robertgshaw2-redhat, @njhill, @KuntaiDu, @NickLucche, @ApostaC - @robertgshaw2-redhat, @njhill, @KuntaiDu, @NickLucche, @ApostaC
- Distributed, Parallelism, Process Management: Process launchers managing each worker, and assign them to the right DP/TP/PP/EP ranks - Distributed, Parallelism, Process Management: Process launchers managing each worker, and assign them to the right DP/TP/PP/EP ranks
...@@ -106,7 +107,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r ...@@ -106,7 +107,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r
- Custom Layers: Utility layers in vLLM such as rotary embedding and rms norms - Custom Layers: Utility layers in vLLM such as rotary embedding and rms norms
- @ProExpertProg - @ProExpertProg
- Attention: Attention interface for paged attention - Attention: Attention interface for paged attention
- @WoosukKwon, @LucasWilkinson, @heheda12345 - @WoosukKwon, @LucasWilkinson, @heheda12345, @MatthewBonanni
- FusedMoE: FusedMoE kernel, Modular kernel framework, EPLB - FusedMoE: FusedMoE kernel, Modular kernel framework, EPLB
- @tlrmchlsmth - @tlrmchlsmth
- Quantization: Various quantization config, weight loading, and kernel. - Quantization: Various quantization config, weight loading, and kernel.
...@@ -134,7 +135,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r ...@@ -134,7 +135,7 @@ If you have PRs touching the area, please feel free to ping the area owner for r
### Features ### Features
- Spec Decode: Covers model definition, attention, sampler, and scheduler related to n-grams, EAGLE, and MTP. - Spec Decode: Covers model definition, attention, sampler, and scheduler related to n-grams, EAGLE, and MTP.
- @WoosukKwon, @benchislett, @luccafong - @WoosukKwon, @benchislett, @luccafong, @MatthewBonanni
- Structured Output: The structured output implementation - Structured Output: The structured output implementation
- @russellb, @aarnphm - @russellb, @aarnphm
- RL: The RL related features such as collective rpc, sleep mode, etc. - RL: The RL related features such as collective rpc, sleep mode, etc.
...@@ -154,8 +155,8 @@ If you have PRs touching the area, please feel free to ping the area owner for r ...@@ -154,8 +155,8 @@ If you have PRs touching the area, please feel free to ping the area owner for r
### External Kernels Integration ### External Kernels Integration
- FlashAttention: @LucasWilkinson - FlashAttention: @LucasWilkinson, @MatthewBonanni
- FlashInfer: @LucasWilkinson, @mgoin, @WoosukKwon - FlashInfer: @LucasWilkinson, @mgoin, @WoosukKwon, @MatthewBonanni
- Blackwell Kernels: @mgoin, @yewentao256 - Blackwell Kernels: @mgoin, @yewentao256
- DeepEP/DeepGEMM: @mgoin, @yewentao256 - DeepEP/DeepGEMM: @mgoin, @yewentao256
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment