Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • R ResNet50v1.5_pytorch
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • ModelZoo
  • ResNet50v1.5_pytorch
  • Issues
  • #1

"src/libtorio/ffmpeg/ffmpeg.cpp" did not exist on "bd319959cd504b718685e720cbd4f47408ee1835"
Closed
Open
Created Jan 04, 2024 by hehaijun@hehaijun

训练脚本报错

单卡训练脚本train_single_fp16.sh 报错 return self._apply(lambda t: t.cuda(device)) File "/home/user/miniconda3/envs/meg/lib/python3.9/site-packages/torch/cuda/init.py", line 229, in _lazy_init torch._C._cuda_init() RuntimeError: No HIP GPUs are available 多卡训练脚本train_multi_fp16.sh报错 RuntimeError: NCCL error in: /data/jenkins_workspace/workspace/pytorch_23.04_abi@4/torch/csrc/distributed/c10d/ProcessGroupNCCL.cpp:1269, invalid usage, NCCL version 2.13.4 ncclInvalidUsage: This usually reflects invalid usage of NCCL library. Last error: Duplicate GPU detected : rank 0 and rank 7 both on CUDA device 51000 I0104 13:53:26.121362 123515 ProcessGroupNCCL.cpp:837] [Rank 0] NCCL watchdog thread terminated normally

Assignee
Assign to
Time tracking