Quick Overview: Presenter(s): James Hongyi Zeng, Senior Engineering Manager, Benjamin Glick Pouya Kousha, Arnav Goel ( Want to scale beyond the limits of a single

Gpu Communication Library In Meta - Detailed Overview & Context

Presenter(s): James Hongyi Zeng, Senior Engineering Manager, Benjamin Glick Pouya Kousha, Arnav Goel ( Want to scale beyond the limits of a single In this AI Research Roundup episode, Alex discusses the paper: 'Collective RDMA (Remote Direct Memory Access) is the secret sauce behind fast What is CUDA? And how does parallel computing on the

Zhiyi Hu, Siyuan Shen, Tommaso Bonato (ETH Zurich), Sylvain Jeaugey ( NCCL: High-Speed Inter-GPU Communication for Large-Scale Training - Sylvain Jeaugey, NVIDIA AI clusters are difficult to manage. There are multiple hardware and software elements to coordinate and constant updates thatย ... RSC is also estimated to be 9x faster, at running the ML Performance research paper reading group session 1 meeting (2024/11/29). This was an intro session covering prerequisiteย ... Bloomberg's Caroline Hyde and Ed Ludlow discuss the rise in

Photo Gallery

GPU Communication Library in Meta-Scale AI Clusters
NCCL Explained: How NVIDIA's GPU Communication Library Powers Distributed Deep Learning
Tutorial: GPU Communication Libraries for Accelerating HPC and AI Applications
Multi-GPU Communication Libraries for Scaling HPC and AI Workloads | NVIDIA GTC 2025
NCCLX: Collective Comms for 100k+ GPUs
GPUs: Explained
Lecture 17: NCCL
Collective Communication for 100k+ GPUs -- Paper Study
Demystifying RDMA Protocols for GPU Data Centers | NVlink, Connectx, EFA, Infiniband, GPUDirect
Getting Started with Distributed Multi-GPU Libraries for Scalable AI and HPC | NVIDIA GTC 2025
Nvidia CUDA in 100 Seconds
Demystifying NCCL An In depth Analysis of GPU Communication Protocols and Algorithms - Zhiyi Hu
Sponsored
Sponsored
View Main Result
Sponsored
Sponsored