• Funtowicz Morgan's avatar
    Add support for stop words in TRTLLM (#2678) · ba5fc7d9
    Funtowicz Morgan authored
    * feat(trtllm): rewrite health to not account for current state
    
    * chore(looper): cleanup a bit more
    
    * feat(post_processing): max_new_tokens is const evaluated now
    
    * chore(ffi):formatting
    
    * feat(trtllm): add stop words handling
    
    # Conflicts:
    #	backends/trtllm/lib/backend.cpp
    
    * chore(trtllm): create specific parallelconfig factory and logging init methods
    
    * chore(trtllm): define a macro for SizeType cast
    
    * chore(trtllm): use GetParallelConfig
    
    * chore(trtllm): minor refactoring
    
    * chore(trtllm): validate there are enough GPus on the system for the desired model
    
    * chore(trtllm): ensure max throughput scheduling policy is selected
    
    * chore(trtllm): minor fix
    
    * chore(router): minor refactorings
    
    * feat(docker): build with-slurm ompi
    
    * feat(docker): add python3.10 dev to runtime deps
    
    * chore(docker): add mpi to ld_library_path
    
    * chore(docker): install transformers
    
    * feat(trtllm): detect stop_words from generation_config.json
    ba5fc7d9
ffi.cpp 2.93 KB