README.md 2.6 KB
Newer Older
zhanggzh's avatar
zhanggzh committed
1
2
3
4
5
# <div align="center"><strong>Kimi-Audio</strong></div>
## 简介
 Kimi-Audio,这是一个开源音频基础模型,在音频理解、生成和对话方面表现出色。此存储库包含 Kimi-Audio 的官方实现、模型和评估工具包。
## 安装
组件支持组合
wangzhengtao's avatar
wangzhengtao committed
6

zhanggzh's avatar
zhanggzh committed
7
8
9
10
   | PyTorch版本 | fastpt版本  |Kimi-Audio版本      | DTK版本                  | Python版本       | 推荐编译方式 |
   | ----------- | ----------- | ----------- | ------------------------ | -----------------| ------------ |
   | 2.5.1       | 2.1.0       |0.1.0        | >= 25.04                 | 3.8、3.10、3.11  | fastpt不转码 |
   | 2.4.1       | 2.0.1       |0.1.0        | >= 25.04                 | 3.8、3.10、3.11  | fastpt不转码 |
wangzhengtao's avatar
wangzhengtao committed
11

zhanggzh's avatar
zhanggzh committed
12
+ pytorch版本大于2.4.1 && dtk版本大于25.04 推荐使用fastpt不转码编译。
wangzhengtao's avatar
wangzhengtao committed
13

zhanggzh's avatar
zhanggzh committed
14
### 1、使用源码编译方式安装
wangzhengtao's avatar
wangzhengtao committed
15

zhanggzh's avatar
zhanggzh committed
16
17
#### 编译环境准备
提供基于fastpt不转码编译:
wangzhengtao's avatar
wangzhengtao committed
18

zhanggzh's avatar
zhanggzh committed
19
1. 基于光源pytorch基础镜像环境:镜像下载地址:[光合开发者社区](https://sourcefind.cn/#/image/dcu/pytorch),根据pytorch、python、dtk及系统下载对应的镜像版本。
wangzhengtao's avatar
wangzhengtao committed
20

zhanggzh's avatar
zhanggzh committed
21
22
23
24
25
2. 基于现有python环境:安装pytorch,fastpt whl包下载目录:[光合开发者社区](https://sourcefind.cn/#/image/dcu/pytorch),根据python、dtk版本,下载对应pytorch的whl包。安装命令如下:
```shell
pip install torch* (下载torch的whl包)
pip install fastpt* --no-deps (下载fastpt的whl包, 安装顺序,先安装torch,后安装fastpt)
pip install setuptools==59.5.0 wheel
xinyifei's avatar
xinyifei committed
26
```
wangzhengtao's avatar
wangzhengtao committed
27

zhanggzh's avatar
zhanggzh committed
28
29
30
31
#### 源码编译安装
- 代码下载
```shell
git clone https://developer.sourcefind.cn/codes/OpenDAS/openpcdet.git # 根据编译需要切换分支
Deep-unlearning's avatar
Deep-unlearning committed
32
```
zhanggzh's avatar
zhanggzh committed
33
- 进入openpcdet目录:
wangzhengtao's avatar
wangzhengtao committed
34
```
zhanggzh's avatar
zhanggzh committed
35
36
1. 设置不转码编译环境变量
source /usr/local/bin/fastpt -C
wangzhengtao's avatar
wangzhengtao committed
37

zhanggzh's avatar
zhanggzh committed
38
39
40
41
42
43
44
3. 源码编译安装
python3 install .
```
#### 注意事项
+ 若使用pip install下载安装过慢,可添加pypi清华源:-i https://pypi.tuna.tsinghua.edu.cn/simple/
+ ROCM_PATH为dtk的路径,默认为/opt/dtk
+ 在pytorch2.5.1环境下编译需要支持c++17语法,打开setup.py文件,把文件中的 -std=c++14 修改为 -std=c++17
wangzhengtao's avatar
wangzhengtao committed
45

zhanggzh's avatar
zhanggzh committed
46
47
48
49
50
51
52
53
54
## 验证
```
python3
Python 3.10.12 (main, Feb  4 2025, 14:57:36) [GCC 11.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import kimia_infer
>>> kimia_infer.__version__
'0.1.0'
>>>
wangzhengtao's avatar
wangzhengtao committed
55
```
zhanggzh's avatar
zhanggzh committed
56
版本号与官方版本同步,查询该软件的版本号,例如0.1.0;
wangzhengtao's avatar
wangzhengtao committed
57

zhanggzh's avatar
zhanggzh committed
58
59
## Known Issue
-
wangzhengtao's avatar
wangzhengtao committed
60

zhanggzh's avatar
zhanggzh committed
61
62
63
64
## 参考资料
- [README_ORIGIN](README_ORIGIN.md)
- [README_zh-CN](README_zh-CN.md)
- [https://github.com/MoonshotAI/Kimi-Audio](https://github.com/MoonshotAI/Kimi-Audio)