""" Copyright (c) 2025 Ma Zhaojia This source code is licensed under the MIT license found in the LICENSE file in the root directory of this source tree. BatchOpt Extensions - C++ and CUDA implementations for performance-critical operations. This module provides optimized implementations of common operations using torch.utils.cpp_extension for JIT compilation. """