Add gpu_driven_amd with AMD GPU support #217

CalebZ9909 · 2025-07-28T05:15:07Z

Add AMD HIP for gpu_driven on AMD GPUs

YangZhou1997 · 2025-07-28T06:23:24Z

@CalebZ9909 Looks great. Can you put your code also in the gpu_driven folder so that we can track the diff and do the merging?

YangZhou1997 · 2025-07-29T03:16:42Z

Nice. I think my past wording was confusing. You should also do git rm -r gpu_driven_amd.

CalebZ9909 · 2025-07-29T03:20:34Z

Yes, I am done on my side. Ziming and you can check the code now. I am working on testing the code and will update next version soon.

MaoZiming · 2025-07-29T05:00:14Z

Thanks @CalebZ9909 , ideally we can define a flag (e.g. HIP_PLATFORM_AMD in rdma folder) so that we can toggle whether to run on AMD platform from the makefile. It should not overwrite all existing cuda commands with hip but keeping the option to toggle between the two

Maybe we can define some function name to either refer to the cuda version or the hip version depending on the macro. e.g. (either hipMemcpyPeerAsync or cudaMemcpyPeerAsync).

YangZhou1997 · 2025-07-29T05:01:34Z

I have some macro defination at https://github.com/uccl-project/uccl/blob/main/include/util/gpu_rt.h

CalebZ9909 · 2025-07-29T05:09:48Z

Sounds good! Yes, I can set up a flag and keep all CUDA-related files along with AMD files, which can be chosen to be executed by the flag in the makefile.

YangZhou1997 · 2025-07-29T05:12:29Z

I guess a better way is to use the unified name like gpuMemcpy in https://github.com/uccl-project/uccl/blob/main/include/util/gpu_rt.h, so that we just need to write a new MakefileHip for AMD. You can find an example in https://github.com/uccl-project/uccl/tree/main/p2p

CalebZ9909 · 2025-07-29T05:16:11Z

Sure, I will look into this!

Add gpu_driven_amd with AMD GPU support

ec606b9

replace gpu_driven

2f922e7

Delete gpu_driven_amd directory

cc0ecaf

YangZhou1997 requested a review from MaoZiming July 29, 2025 03:19

Debug for remote benchmark

ec91df9

HermesCui force-pushed the deepep_amd branch from ac9d770 to ec91df9 Compare July 29, 2025 19:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gpu_driven_amd with AMD GPU support #217

Add gpu_driven_amd with AMD GPU support #217

CalebZ9909 commented Jul 28, 2025

Uh oh!

YangZhou1997 commented Jul 28, 2025 •

edited

Loading

Uh oh!

YangZhou1997 commented Jul 29, 2025 •

edited

Loading

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

MaoZiming commented Jul 29, 2025

Uh oh!

YangZhou1997 commented Jul 29, 2025

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

YangZhou1997 commented Jul 29, 2025

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

Uh oh!

Add gpu_driven_amd with AMD GPU support #217

Are you sure you want to change the base?

Add gpu_driven_amd with AMD GPU support #217

Conversation

CalebZ9909 commented Jul 28, 2025

Uh oh!

YangZhou1997 commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YangZhou1997 commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

MaoZiming commented Jul 29, 2025

Uh oh!

YangZhou1997 commented Jul 29, 2025

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

YangZhou1997 commented Jul 29, 2025

Uh oh!

CalebZ9909 commented Jul 29, 2025

Uh oh!

Uh oh!

YangZhou1997 commented Jul 28, 2025 •

edited

Loading

YangZhou1997 commented Jul 29, 2025 •

edited

Loading