"src/diffusers/pipelines/pag/pipeline_pag_sd.py" did not exist on "6b04d61cf6c105de9f2530b5bfca2d65fc9e29d7"
  • Parth Sareen's avatar
    sample: improve ollama engine sampler performance (#9374) · 0682dae0
    Parth Sareen authored
    This change bring in various interface cleanups along with greatly improving the performance of the sampler.
    
    Tested with llama3.2 on local machine.
    Improves performance from ~ 70 tokens/s -> 135 tokens/s with topK(40) enabled.
    Without topK performance is ~ 110 tokens/s
    0682dae0
transforms.go 4.36 KB