You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently i have workaround that uses cudaMemcpyAsync instead of UCC when i try to do something like this, but for composability it would be nice for UCC to handle this case:
for (msg: messages)
if (myrank == msg.src)
ucc.send(msg.dst, ...)
if (myrank == msg.dst)
ucc.recv(msg.src, ...)
wait_all_ucc_requests()
The text was updated successfully, but these errors were encountered:
From the API perspecitve: active_set.size = 1; start = my_rank; stride = 1; should probably work. But it might not be implemented currectly in TL/UCP currently.
Currently we use active_set field to "emulate" p2p communication:
however I don't see a way to set up coll properly to "send" to myself for this code to work:
https://github.com/openucx/ucc/blob/master/src/utils/ucc_coll_utils.h#L270-L271
Currently i have workaround that uses
cudaMemcpyAsync
instead of UCC when i try to do something like this, but for composability it would be nice for UCC to handle this case:The text was updated successfully, but these errors were encountered: