Add `GpuBuffer` class #423

chhwang · 2024-12-19T23:47:56Z

Renamed and moved mem alloc functions into the mscclpp::detail:: namespace (now mscclpp::detail::gpuCalloc*<T>())
Deprecated constructor-calling mem alloc functions (mscclpp::makeShared*<T>() and mscclpp::makeUnique*<T>())
Added a new mscclpp::GpuBuffer<T>() class that should be used in general for allocating communication buffers
Added a new mscclpp.utils.GpuBuffer Python class that inherits cupy.ndarray and allocates using mscclpp::gpuMemAlloc
Renamed mscclpp::memcpyCuda*<T>() functions into mscclpp::gpuMemcpy*<T>() for name consistency
A few fixes in NVLS memory allocation
Tackled minor compiler warnings

chhwang · 2024-12-19T23:48:32Z

/azp run

azure-pipelines · 2024-12-19T23:48:42Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

chhwang · 2024-12-20T00:52:50Z

/azp run

azure-pipelines · 2024-12-20T00:52:58Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

chhwang · 2024-12-20T00:53:19Z

/azp run

azure-pipelines · 2024-12-20T00:53:35Z

Azure Pipelines successfully started running 3 pipeline(s).

chhwang · 2024-12-20T13:06:05Z

/azp run

azure-pipelines · 2024-12-20T13:06:23Z

Azure Pipelines successfully started running 3 pipeline(s).

chhwang · 2025-01-02T19:25:50Z

/azp run

azure-pipelines · 2025-01-02T19:26:08Z

Azure Pipelines successfully started running 3 pipeline(s).

chhwang · 2025-01-02T22:13:08Z

/azp run

azure-pipelines · 2025-01-02T22:13:27Z

Azure Pipelines successfully started running 3 pipeline(s).

chhwang · 2025-01-03T02:09:20Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-03T02:09:30Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang · 2025-01-03T03:30:36Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-03T03:30:47Z

Azure Pipelines successfully started running 1 pipeline(s).

src/gpu_utils.cc

test/mscclpp-test/allreduce_test.cu

python/mscclpp_benchmark/mscclpp_op.py

chhwang · 2025-01-04T00:50:11Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-04T00:50:21Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang · 2025-01-05T05:12:34Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-05T05:12:44Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang · 2025-01-05T21:46:03Z

/azp run mscclpp-test

azure-pipelines · 2025-01-05T21:46:13Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang · 2025-01-06T01:26:56Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-06T01:27:06Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang · 2025-01-06T03:38:52Z

/azp run mscclpp-ut

azure-pipelines · 2025-01-06T03:39:01Z

Azure Pipelines successfully started running 1 pipeline(s).

chhwang added 2 commits December 19, 2024 23:00

Tackle build warnings

75a2ac5

Add gpuMemAlloc function

b8a1360

chhwang marked this pull request as draft December 19, 2024 23:49

Base automatically changed from chhwang/build to main December 20, 2024 00:51

Merge branch 'main' into chhwang/malloc

1d1d07d

chhwang marked this pull request as ready for review December 20, 2024 00:53

chhwang requested review from Binyang2014 and caiomcbr December 20, 2024 11:06

chhwang linked an issue Dec 20, 2024 that may be closed by this pull request

[Bug] Proxy channel over CudaIPC on AMD GPUs #418

Closed

Refine gpu_utils.hpp

78da4a4

chhwang added 2 commits January 2, 2025 18:45

Fixes

fbd0f6c

Merge branch 'main' into chhwang/malloc

08815b9

fix

434898c

chhwang added 5 commits January 2, 2025 23:43

Fix a test

35b4598

Fix npkit

24d883e

Merge branch 'main' into chhwang/malloc

0862717

Python interface

a99522d

Merge branch 'main' into chhwang/malloc

fcf8392

lint

11c2bf9

chhwang mentioned this pull request Jan 3, 2025

[Bug] Proxy channel over CudaIPC on AMD GPUs #418

Closed

chhwang added 2 commits January 3, 2025 03:19

Fix names & make uncached alloc available only on AMD

4f80f7c

lint

a5c3653

Binyang2014 reviewed Jan 3, 2025

View reviewed changes

src/gpu_utils.cc Show resolved Hide resolved

test/mscclpp-test/allreduce_test.cu Show resolved Hide resolved

python/mscclpp_benchmark/mscclpp_op.py Show resolved Hide resolved

chhwang added 3 commits January 4, 2025 00:38

Fix NVLS memory allocation

afc9d20

Merge branch 'main' into chhwang/malloc

6a5ce05

lint

3abd219

Merge branch 'main' into chhwang/malloc

ec4452a

chhwang added 2 commits January 6, 2025 00:01

pipeline fix

ea0c4a3

C++ class

3f9c653

chhwang changed the title ~~Add gpuMemAlloc function~~ Add GpuBuffer class Jan 6, 2025

chhwang requested a review from Binyang2014 January 6, 2025 01:26

Fix

38b27c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `GpuBuffer` class #423

Add `GpuBuffer` class #423

chhwang commented Dec 19, 2024 •

edited

Loading

chhwang commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

chhwang commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

chhwang commented Jan 3, 2025

azure-pipelines bot commented Jan 3, 2025

chhwang commented Jan 3, 2025

azure-pipelines bot commented Jan 3, 2025

chhwang commented Jan 4, 2025

azure-pipelines bot commented Jan 4, 2025

chhwang commented Jan 5, 2025

azure-pipelines bot commented Jan 5, 2025

chhwang commented Jan 5, 2025

azure-pipelines bot commented Jan 5, 2025

chhwang commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

chhwang commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

Add GpuBuffer class #423

Are you sure you want to change the base?

Add GpuBuffer class #423

Conversation

chhwang commented Dec 19, 2024 • edited Loading

chhwang commented Dec 19, 2024

azure-pipelines bot commented Dec 19, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Dec 20, 2024

azure-pipelines bot commented Dec 20, 2024

chhwang commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

chhwang commented Jan 2, 2025

azure-pipelines bot commented Jan 2, 2025

chhwang commented Jan 3, 2025

azure-pipelines bot commented Jan 3, 2025

chhwang commented Jan 3, 2025

azure-pipelines bot commented Jan 3, 2025

chhwang commented Jan 4, 2025

azure-pipelines bot commented Jan 4, 2025

chhwang commented Jan 5, 2025

azure-pipelines bot commented Jan 5, 2025

chhwang commented Jan 5, 2025

azure-pipelines bot commented Jan 5, 2025

chhwang commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

chhwang commented Jan 6, 2025

azure-pipelines bot commented Jan 6, 2025

Add `GpuBuffer` class #423

Add `GpuBuffer` class #423

chhwang commented Dec 19, 2024 •

edited

Loading