LoRASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning

Siwei Li, Yifan Yang, Yifei Shen, Fangyun Wei, Zongqing Lu, Lili Qiu, Yuqing Yang


Abstract
Efficient fine-tuning plays a fundamental role in modern large models, with low-rank adaptation emerging as a particularly promising approach. However, the existing variants of LoRA are hampered by limited expressiveness, a tendency to overfit, and sensitivity to hyperparameter settings. This paper presents LoRA Slow Cascade Learning (LoRASC), an innovative technique designed to enhance LoRA’s expressiveness and generalization capabilities while preserving its training efficiency. Our approach augments expressiveness through a cascaded learning strategy that enables a mixture-of-low-rank adaptation, thereby increasing the model’s ability to capture complex patterns. Additionally, we introduce a slow-fast update mechanism and cascading noisy tuning to bolster generalization. The extensive experiments on various language and vision datasets, as well as robustness benchmarks, demonstrate that the proposed method not only significantly outperforms existing baselines, but also mitigates overfitting, enhances model stability, and improves OOD robustness.
Anthology ID:
2024.findings-emnlp.748
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12806–12816
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.748/
DOI:
10.18653/v1/2024.findings-emnlp.748
Bibkey:
Cite (ACL):
Siwei Li, Yifan Yang, Yifei Shen, Fangyun Wei, Zongqing Lu, Lili Qiu, and Yuqing Yang. 2024. LoRASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 12806–12816, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
LoRASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning (Li et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.748.pdf