DISTMM: Accelerating Distributed Multimodal Model Training
DISTMM: Accelerating Distributed Multimodal Model Training
Jun Huang,Zhen Zhang,2 作者,Yida Wang
2024 · DBLP: conf/nsdi/HuangZZQ024
Symposium on Networked Systems Design and Implementation · 引用 21 次
TLDR
A new pipeline execution primitive, called batch-sync instruction, and a corresponding schedule, called D IST MM-Pipe are proposed, which addresses the limitation of existing pipeline execution schedules for multimodal training with contrastive loss.
