Gpu Communication Library In Meta

GPU Communication Library in Meta-Scale AI Clusters

Presenter(s): James Hongyi Zeng, Senior Engineering Manager,

NCCL Explained: How NVIDIA's GPU Communication Library Powers Distributed Deep Learning

In this video, we break down NCCL (

Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications

Benjamin Glick Pouya Kousha, Arnav Goel (

Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025

Want to scale beyond the limits of a single

NCCLX: Collective Comms for 100k+ GPUs

In this AI Research Roundup episode, Alex discusses the paper: 'Collective

GPUs: Explained

Check out IBM Cloud for

Lecture 17: NCCL

Code and Slides: https://github.com/cuda-mode/lectures/tree/main/lecture_017.

Collective Communication for 100k+ GPUs -- Paper Study

https://arxiv.org/pdf/2510.20171 The NCCLX collective

Demystifying RDMA Protocols for GPU Data Centers | NVlink, Connectx, EFA, Infiniband, GPUDirect

RDMA (Remote Direct Memory Access) is the secret sauce behind fast

Getting Started with Distributed Multi-GPU Libraries for Scalable AI and HPC | NVIDIA GTC 2025

Scaling beyond a single

Nvidia CUDA in 100 Seconds

What is CUDA? And how does parallel computing on the

Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu

Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey (

NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA

Simplifying AI Cluster Management with NVIDIA Base Command

AI clusters are difficult to manage. There are multiple hardware and software elements to coordinate and constant updates that ...

𝗖𝗮𝗻 𝗠𝗲𝘁𝗮 𝗡𝗲𝘄 𝗔𝗜 𝗦𝘂𝗽𝗲𝗿𝗰𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗪𝗶𝗹𝗹 𝗦𝗲𝘁 𝗥𝗲𝗰𝗼𝗿𝗱𝘀 ??

RSC is also estimated to be 9x faster, at running the

ML Performance Reading Group Session 1: GPU Architecture, CUDA, NCCL

ML Performance research paper reading group session 1 meeting (2024/11/29). This was an intro session covering prerequisite ...

MultiGPU + NCCL from the authors

Speaker: Jeff Hammond.

Meta Expands AI Compute Deal, Nvidia GTC Kicks Off | Bloomberg Tech

Bloomberg's Caroline Hyde and Ed Ludlow discuss the rise in