UPDF AI

DISTMM: Accelerating Distributed Multimodal Model Training

Jun Huang,Zhen Zhang,2 作者,Yida Wang

2024 · DBLP: conf/nsdi/HuangZZQ024
Symposium on Networked Systems Design and Implementation · 引用 21 次

TLDR

A new pipeline execution primitive, called batch-sync instruction, and a corresponding schedule, called D IST MM-Pipe are proposed, which addresses the limitation of existing pipeline execution schedules for multimodal training with contrastive loss.