"git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "f35f120d700fe425296f790d6e5669af621244b9"
  • Rinat Shigapov's avatar
    Async inference interface (#1647) · 9729ca54
    Rinat Shigapov authored
    * async inference support
    
    * implemented concurrent decorator
    
    * fixes for Python versions < 3.7
    
    * async methods depend on python version
    
    * revert changes in forward method
    
    * async_test -> async_simple_test, debug logging is done via logger.debug
    
    * add async test
    
    * add asynctest to test requirements
    
    * async tests are run in Python 3.7
    
    * check CUDA, add docs
    
    * fix device
    
    * run test only if CUDA is available
    
    * fix linting
    
    * custom operators can run on nondefault stream
    
    * set current stream in kernel launch configuration
    
    * example fixes
    
    * add async/sync interface comparison benchmark
    
    * fix linting
    9729ca54
GETTING_STARTED.md 14.2 KB