"vscode:/vscode.git/clone" did not exist on "f23d6eb8f2a618c924a2e9f928edbc2e3b0e274f"
  • Dmitry Rogozhkin's avatar
    feat: enable pytorch xpu support for non-attention models (#2561) · 58848cb4
    Dmitry Rogozhkin authored
    
    
    XPU backend is available natively (without IPEX) in pytorch starting
    from pytorch 2.4. This commit extends TGI to cover the case when user
    has XPU support thru pytorch 2.4, but does not have IPEX installed.
    Models which don't require attention can work. For attention required
    models more work is needed to provide attention implementation.
    
    Tested with the following models:
    * teknium/OpenHermes-2.5-Mistral-7B
    * bigscience/bloom-560m
    * google/gemma-7b
    * google/flan-t5-xxl
    Signed-off-by: default avatarDmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>
    58848cb4
seq2seq_lm.py 34.3 KB