DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models

Nastaran Saadati, Minh Pham, Nasla Saleem, Joshua R. Waite, Aditya Balu, Zhanong Jiang, Chinmay Hegde, Soumik Sarkar

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Recent advances in decentralized deep learning algorithms have demonstrated cutting-edge performance on various tasks with large pretrained models. However, a pivotal prerequisite for achieving this level of competitiveness is the significant communication and computation overheads when updating these models, which prohibits the applications of them to real-world scenarios. To address this issue, drawing inspiration from advanced model merging techniques without requiring additional training, we introduce the Decentralized Iterative Merging-And-Training (DIMAT) paradigm-a novel decentralized deep learning framework. Within DIMAT, each agent is trained on their local data and periodically merged with their neighboring agents using advanced model merging techniques like activation matching until convergence is achieved. DIMAT provably converges with the best available rate for non-convex functions with various first-order methods, while yielding tighter error bounds compared to the popular existing approaches. We conduct a comprehensive empirical analysis to validate DIMAT's superiority over baselines across diverse computer vision tasks sourced from multiple datasets. Empirical results validate our theoretical claims by showing that DIMAT attains faster and higher initial gain in accuracy with independent and identically distributed (IID) and non-IID data, incurring lower communication overhead. This DIMAT paradigm presents a new op-portunity for the future decentralized learning, enhancing its adaptability to real-world with sparse and lightweight communication and computation.

    Original languageEnglish (US)
    Title of host publicationProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
    PublisherIEEE Computer Society
    Pages27507-27517
    Number of pages11
    ISBN (Electronic)9798350353006
    DOIs
    StatePublished - 2024
    Event2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024 - Seattle, United States
    Duration: Jun 16 2024Jun 22 2024

    Publication series

    NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
    ISSN (Print)1063-6919

    Conference

    Conference2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
    Country/TerritoryUnited States
    CitySeattle
    Period6/16/246/22/24

    Keywords

    • Convergence
    • Decentralized Learning
    • Deep Learning
    • Model Merging
    • Non-Convex Optimization

    ASJC Scopus subject areas

    • Software
    • Computer Vision and Pattern Recognition

    Fingerprint

    Dive into the research topics of 'DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models'. Together they form a unique fingerprint.

    Cite this