Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • V vllm_cscc
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • OpenDAS
  • vllm_cscc
  • Commits

Switch branch/tag
  • vllm_cscc
  • vllm
  • model_executor
  • models
  • phi4flash.py
  1. 15 Aug, 2025 1 commit
    • Thomas Parnell's avatar
      [BugFix] Fix regression caused by mamba state dtype PR (#22998) · f5d412ba
      Thomas Parnell authored Aug 16, 2025
      
      Signed-off-by: default avatarThomas Parnell <tpa@zurich.ibm.com>
      f5d412ba
  2. 10 Aug, 2025 1 commit
    • Harry Mellor's avatar
      Refactor sliding window configuration to Transformers best practice (#21927) · c4984839
      Harry Mellor authored Aug 10, 2025
      
      Signed-off-by: default avatarHarry Mellor <19981378+hmellor@users.noreply.github.com>
      c4984839
  3. 02 Aug, 2025 1 commit
    • Chih-Chieh Yang's avatar
      [Model] Mamba2 preallocate SSM output tensor to avoid d2d copy overhead (#21075) · b690e348
      Chih-Chieh Yang authored Aug 02, 2025
      
      Signed-off-by: default avatarChih-Chieh Yang <7364402+cyang49@users.noreply.github.com>
      Signed-off-by: default avatarChih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
      b690e348
  4. 16 Jul, 2025 2 commits
    • Cyrus Leung's avatar
      [Model] Remove model sampler (#21059) · ac2bf41e
      Cyrus Leung authored Jul 17, 2025
      
      Signed-off-by: default avatarDarkLight1337 <tlleungac@connect.ust.hk>
      ac2bf41e
    • Peter Pan's avatar
      [CI] update typos config for CI pre-commit and fix some spells (#20919) · 1eb2b9c1
      Peter Pan authored Jul 16, 2025
      
      Signed-off-by: default avatarPeter Pan <Peter.Pan@daocloud.io>
      1eb2b9c1
  5. 12 Jul, 2025 1 commit
    • Congcong Chen's avatar
      [Model] New model support for microsoft/Phi-4-mini-flash-reasoning (#20702) · 2c11a738
      Congcong Chen authored Jul 12, 2025
      
      Signed-off-by: default avatarCongcong Chen <congcongchen@microsoft.com>
      2c11a738