README.md 1.56 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
# LightOp Kernel Knowledge

This directory backs the `lightop-kernel-knowledge` skill. For LightOp/DCU
work, use evidence in this order:

1. Local LightOp source, wrappers, bindings, config tables, tests, and
   benchmarks.
2. ROCm/DCU official docs and upstream source: SourceFind DCU/DTK docs,
   ROCm/HIP, MIOpen, rocBLAS, hipBLASLt, Composable Kernel, Triton AMD,
   PyTorch ROCm, SGLang/vLLM AMD paths, plus bundled MR evidence from
   SourceFind LightOp and DCU Toolkit flash-attention-cutlass.
3. The bundled CUDA-oriented PR corpus, only as cross-platform inspiration
   after translating and validating the idea on DCU.

The existing corpus is still available:

- PR pages under `sources/prs/`
- PR evidence bundles under `evidence/pull-bundles/`
- Candidate ledgers under `candidates/`
- Metadata under `data/`
- Query and materialization scripts under `scripts/`

Run local PR queries from this directory:

```bash
python3 scripts/query.py "<keywords>" --compact --limit 30
python3 scripts/query.py "lightop dcu <operator>" --repo sourcefind-lightop --compact --limit 20
python3 scripts/query.py "flash attention dcu" --repo flash-attention-cutlass --compact --limit 20
python3 scripts/search-pr-diffs.py <term1> <term2> [--any] [--limit 100]
python3 scripts/get_page.py <pr-page-id>
```

When CUDA evidence shapes a LightOp implementation, record that it is
inspiration, cite the source path/URL and commit/version, and explain the DCU
translation: HIP API, CU/wavefront/LDS/MFMA/resource/config differences.

Validate the bundled corpus with:

```bash
python3 scripts/validate.py
```