-
Notifications
You must be signed in to change notification settings - Fork 409
Issues: openucx/ucx
Error: Transport retry count exceeded on mlx5_0:1/RoCE
#6000
by afernandezody
was closed Feb 1, 2021
Closed
7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
mlx5 connect on mlx5_1 failed: Connection timed out
Bug
#9971
opened Jun 24, 2024 by
shinoharakazuya
UCX blocked after .sendStreamNonBlocking(sendBuffer, new SendCallback(sendBuffer, this));
Bug
#9964
opened Jun 17, 2024 by
pereverges
UCX installation done with OFED doesn't recognize cuda, cuda_cpy etc.
Bug
#9950
opened Jun 12, 2024 by
RamHPC
Performance regression in collectives due to UCX_PROTO_ENABLE
Bug
#9914
opened May 30, 2024 by
angainor
cuda_copy_md.c:489 UCX WARN cuPointerSetAttribute error with CUDA VMM API
Bug
#9895
opened May 23, 2024 by
MinassZhang
DBFS library installations are not supported on DBR 15 or above.
Bug
#9777
opened Mar 25, 2024 by
rkkalluri
UCS/ARCH/BITOPS: gcc 12.3.0 fails to build x86_64 ucs_ffs32
Bug
#9774
opened Mar 21, 2024 by
tvegas1
osu_mbw_mr for CUDA memory shows bad performance with UCX_PROTO_ENABLE=y
Bug
#9690
opened Feb 15, 2024 by
dmitrygx
Question: does ucx support FPGA to AMDGPU (ROCm ) p2p transfer?
#9598
opened Jan 13, 2024 by
littlewu2508
GPU Aware openMPI 5.0.1 + ROCM gives UCX ERROR : failed to register address
Bug
#9589
opened Jan 10, 2024 by
denisbertini
Selection of Network Ressources and creating worker/endpoint pair
Bug
#9586
opened Jan 9, 2024 by
98luks
question about fine-grained transport selection for multi-node env
#9560
opened Dec 24, 2023 by
qelk123
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.