"description":"Claude Code marketplace for the Humanize LightOp/DCU operator development and optimization loop.",
"description":"Claude Code marketplace for Humanize DCU kernel development loops, including LightOp HIP/C++ and Triton vLLM/SGLang or direct-file optimization.",
"plugins":[
"plugins":[
{
{
"name":"humanize",
"name":"humanize",
"source":"./humanize",
"source":"./humanize",
"description":"Humanize plus LightOp/DCU operator workflows, dcu-profiler-report profiling, and review-gated iteration.",
"description":"Humanize plus LightOp/DCU and Triton/DCU kernel workflows for frameworks or direct files, profiler digests, and review-gated iteration.",
Chinese reading version: [`triton-skills.zh-CN.md`](triton-skills.zh-CN.md).
KernelPilot includes an independent Triton/DCU skill pack for optimizing Triton
kernels inside vLLM, inside SGLang, or in a user-specified Triton Python file.
It is separate from the LightOp skill pack and uses its own task state
directory:
```text
.humanize/triton-agent/
```
## Skills
| Skill | Purpose |
| --- | --- |
| `triton-kernel-agent-loop` | Main loop for vLLM/SGLang Triton attention, MLA, MoE, quantization, fused norm, cache, sampler, routing, small JIT kernels, and direct Triton files on DCU/ROCm. |
| `triton-kernel-knowledge` | Evidence search for local vLLM/SGLang source, direct-file call sites and harnesses, KernelPilot PR corpus, Triton/ROCm/DTK/DCU docs, and portable cross-platform ideas. |
| `triton-dcu-profiler-report` | DCU profiler digest for framework or standalone Triton JIT kernels, including backend/call-site proof, hipprof/rocprofv3/rocprof-compute evidence, Triton cache/IR dumps, and AMDGPU ISA/code-object clues. |
## Open Kernel References
The Triton knowledge route includes source-reference pages for high-value open
Triton kernel libraries:
```text
ref-rocm-aiter
ref-rocm-aotriton
ref-stackav-conch
ref-flaggems
ref-liger-kernel
ref-huggingface-kernels
ref-triton-distributed
```
Use them as reference implementations or discovery routes, then validate
correctness, benchmark, profiler names, Triton cache/IR, and DCU ISA locally
before promoting any idea.
## Install
The standard installers include both LightOp and Triton skill packs:
"description":"Humanize packaged with LightOp/DCU operator-loop skills for adding, optimizing, benchmarking, and profiling fused ROCm operators with review-gated iteration.",
"description":"Humanize packaged with KernelPilot DCU skills for LightOp HIP/C++ operators and Triton kernels in vLLM, SGLang, or direct Python files, with benchmark/profile evidence and review-gated iteration.",
{"version":"v001","parent":"v000","hypothesis":"tune BLOCK_N for decode","correct":true,"decision":"keep","artifact":".humanize/triton-agent/profile-artifacts/v001/digest.md"}