• Fredy's avatar
    Add RequestScopedPipeline for safe concurrent inference, tokenizer lock and... · eda9ff83
    Fredy authored
    
    Add RequestScopedPipeline for safe concurrent inference, tokenizer lock and non-mutating retrieve_timesteps (#12328)
    
    * Basic implementation of request scheduling
    
    * Basic editing in SD and Flux Pipelines
    
    * Small Fix
    
    * Fix
    
    * Update for more pipelines
    
    * Add examples/server-async
    
    * Add examples/server-async
    
    * Updated RequestScopedPipeline to handle a single tokenizer lock to avoid race conditions
    
    * Fix
    
    * Fix _TokenizerLockWrapper
    
    * Fix _TokenizerLockWrapper
    
    * Delete _TokenizerLockWrapper
    
    * Fix tokenizer
    
    * Update examples/server-async
    
    * Fix server-async
    
    * Optimizations in examples/server-async
    
    * We keep the implementation simple in examples/server-async
    
    * Update examples/server-async/README.md
    
    * Update examples/server-async/README.md for changes to tokenizer locks and backward-compatible retrieve_timesteps
    
    * The changes to the diffusers core have been undone and all logic is being moved to exmaples/server-async
    
    * Update examples/server-async/utils/*
    
    * Fix BaseAsyncScheduler
    
    * Rollback in the core of the diffusers
    
    * Update examples/server-async/README.md
    
    * Complete rollback of diffusers core files
    
    * Simple implementation of an asynchronous server compatible with SD3-3.5 and Flux Pipelines
    
    * Update examples/server-async/README.md
    
    * Fixed import errors in 'examples/server-async/serverasync.py'
    
    * Flux Pipeline Discard
    
    * Update examples/server-async/README.md
    
    * Apply style fixes
    
    ---------
    Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
    Co-authored-by: default avatargithub-actions[bot] <github-actions[bot]@users.noreply.github.com>
    eda9ff83
requestscopedpipeline.py 11.7 KB