Blame · csrc/moe/moe_ops.h · e26d37a185fd33c3f91d0035611c26cfb03883da · OpenDAS / vllm_cscc · GitLab

Switch branch/tag

vllm_cscc

csrc

moe

moe_ops.h
Find file
Normal viewHistoryPermalink

moe_ops.h

218 Bytes

Newer

Older

Add fused top-K softmax kernel for MoE (#2769)

Woosuk Kwon
committed
Feb 05, 2024

#pragma once

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047)

bnellnm
committed
Jun 09, 2024

#include <torch/all.h>

Add fused top-K softmax kernel for MoE (#2769)

Woosuk Kwon
committed
Feb 05, 2024

[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722)

Michael Goin
committed
May 22, 2024

void topk_softmax(torch::Tensor& topk_weights, torch::Tensor& topk_indices,
                  torch::Tensor& token_expert_indices,
                  torch::Tensor& gating_output);