• Nicolas Patry's avatar
    Prefix test - Different kind of load test to trigger prefix test bugs. (#2490) · a4e3e8c6
    Nicolas Patry authored
    
    
    * Adding prefix test.
    
    * [WIP] tmp dump of integration load tests.
    
    * Remove other tensor creation.
    
    * Fixed the radix tree.
    
    Used a slice everywhere in radix.rs to keep the cheap Arc cloning
    instead of recomputing the input_ids.
    
    * Fix parsing
    
    * Is it really flashinfer version ?
    
    * Remove some comments.
    
    * Revert the max prefix hit.
    
    * Adding numpy to diff.
    
    * Upgraded flashinfer.
    
    * Upgrading some stuff.
    
    * Are we done yet ?
    
    * Minor fixup
    
    * Remove 1 log and put back the other.
    
    * Add comment for why slot 0 is OK.
    
    * Mounting on the job.
    
    * Get me a debug branch
    
    * Debugging CIs is fun.
    
    * Attempt #28
    
    * wip
    
    * Tmate.
    
    * Praying.
    
    * Updating VLM causal model with updated context.
    
    * Important line got squashed.
    
    * Tmate again.
    
    * Fingers crossed.
    
    * We want only 1 run of integration tests.....
    
    ---------
    Co-authored-by: default avatarGuillaume LEGENDRE <glegendre01@gmail.com>
    a4e3e8c6
flake.lock 26.4 KB