Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Uni-Fold_pytorch
Commits
cde228fa
Commit
cde228fa
authored
Jan 04, 2024
by
zhangqha@sugon.com
Browse files
update readme
parent
86aa557a
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
73 additions
and
30 deletions
+73
-30
README.md
README.md
+73
-30
alphafold2.png
alphafold2.png
+0
-0
alphafold2_1.png
alphafold2_1.png
+0
-0
No files found.
README.md
View file @
cde228fa
# Uni-Fold(An Open-Source Platform for Developing Protein Folding Models beyond AlphaFold)
# Uni-Fold
## 模型介绍
深势科技推出的蛋白质结构预测工具Uni-Fold,成功复现曾引起生物学界轰动的 AlphaFold2 的全规模训练,并开源训练代码与推理代码。相应解决方案已集成至深势科技自主研发的药物设计平台 Hermite,供广大用户测试使用。
## 论文
Uni-Fold 克服了AlphaFold2 未开源训练代码、硬件支持单一、模型不可商用等局限性,在训练与推理环节进行了 NVIDIA GPU上的适配、性能优化及功能完善等工作,为更多人参与推动领域进一步发展提供了基础。
Uni-Fold: An Open-Source Platform for Developing Protein Folding Models beyond AlphaFold
https://www.biorxiv.org/content/biorxiv/early/2022/08/06/2022.08.04.502811.full.pdf
## 模型结构
## 模型结构
模型核心是一个基于Transformer架构的神经网络,包括两个主要组件:Sequence to Sequence Model和Structure Model,这两个组件通过迭代训练进行优化,以提高其预测准确性。
Uni-Fold,参考https://github.com/dptech-corp/Uni-Fold

## Uni-Fold代码参考版本
版本:master
## 算法原理
通过从蛋白质序列和结构数据中提取信息,使用神经网络模型来预测蛋白质三维结构。
原始代码位置:https://github.com/dptech-corp/Uni-Fold.git

## 数据集
示例中验证的数据集来自:
```
bash scripts/download/download_all_data.sh /path/to/database/directory
```
## 预训练模型参数
```
wget https://github.com/dptech-corp/Uni-Fold/releases/download/v2.0.0/unifold_params_2022-08-01.tar.gz
tar -zxf unifold_params_2022-08-01.tar.gz
```
## 推理
##
#
环境配置
## 环境配置
提供
[
光源
](
https://www.sourcefind.cn/#/service-details
)
拉取的训练的docker镜像:
提供
[
光源
](
https://www.sourcefind.cn/#/service-details
)
拉取的训练的docker镜像:
*
推理镜像:
```
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:unifold-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:unifold-latest
docker run -it -v /path/your_code_data/:/path/your_code_data/ --shm-size=32G --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name docker_name imageID bash
cd /root/Uni-Fold-main
cd /root/Uni-Fold-main
```
```
*
依赖安装:(如使用镜像,则无需再次安装)
安装requirement.txt中的工具,镜像中已经安装好,加载方式
安装requirement.txt中的工具,镜像中已经安装好,加载方式
```
```
export PATH=/root/software/hmmer/bin${PATH:+:${PATH}}
export PATH=/root/software/hmmer/bin${PATH:+:${PATH}}
...
@@ -53,6 +34,55 @@ export PATH=/root/software/kalign/bin${PATH:+:${PATH}}
...
@@ -53,6 +34,55 @@ export PATH=/root/software/kalign/bin${PATH:+:${PATH}}
export LD_LIBRARY_PATH=/root/software/hh-suite-master/lib${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
export LD_LIBRARY_PATH=/root/software/hh-suite-master/lib${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}
```
```
## 数据集
推荐使用AlphaFold2中的开源数据集,包括BFD、MGnify、PDB70、Uniclust、Uniref90等,数据集大小约2.62TB。数据集格式如下:
```
$DOWNLOAD_DIR/
bfd/
bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt_hhm.ffindex
bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt_hhm.ffdata
bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt_cs219.ffindex
...
mgnify/
mgy_clusters_2022_05.fa
params/
params_model_1.npz
params_model_2.npz
params_model_3.npz
...
pdb70/
pdb_filter.dat
pdb70_hhm.ffindex
pdb70_hhm.ffdata
...
pdb_mmcif/
mmcif_files/
100d.cif
101d.cif
101m.cif
...
obsolete.dat
pdb_seqres/
pdb_seqres.txt
small_bfd/
bfd-first_non_consensus_sequences.fasta
uniref30/
UniRef30_2021_03_hhm.ffindex
UniRef30_2021_03_hhm.ffdata
UniRef30_2021_03_cs219.ffindex
...
uniprot/
uniprot.fasta
uniref90/
uniref90.fasta
```
此处提供了一个脚本download_all_data.sh用于下载使用的数据集和模型文件:
```
bash scripts/download/download_all_data.sh /path/to/database/directory
```
## 推理
### 安装
### 安装
#### 安装Uni-Core-main(如使用镜像,则无需再次安装)
#### 安装Uni-Core-main(如使用镜像,则无需再次安装)
```
```
...
@@ -75,6 +105,19 @@ sh run_multimer.sh
...
@@ -75,6 +105,19 @@ sh run_multimer.sh
```
```
sh run_monomer.sh
sh run_monomer.sh
```
```
## result
无
## 精度
无
## 应用场景
### 算法类别
NLP
### 热点应用行业
医疗,科研,教育
## 源码仓库及问题反馈
## 源码仓库及问题反馈
*
https://developer.hpccube.com/codes/modelzoo/uni-fold
*
https://developer.hpccube.com/codes/modelzoo/uni-fold
...
...
alphafold2.png
0 → 100644
View file @
cde228fa
130 KB
alphafold2_1.png
0 → 100644
View file @
cde228fa
134 KB
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment