• Graham King's avatar
    feat: Support larger Gemma 3 models (#1359) · cfd12d7f
    Graham King authored
    Publish `generation_config.json` from worker to ingress, as part of Model Deployment Card. That allows ingress to read key fields out of it. Gemma 3 4B+ has some important information that's only in there.
    cfd12d7f
lib.rs 8.42 KB