Learn more, but bother less: parameter efficient continual learning

Name: Learn more, but bother less: parameter efficient continual learning
Start: 2025-12-03T12:00:00-07:00
End: 2025-12-03T12:30:00-07:00
Location: Online (Zoom)

Reza Rahimi Azghan

Abstract

Large Language Models (LLMs) have demonstrated profound capabilities due to their extensive pre-training on diverse corpora. However, LLMs often struggle with catastrophic forgetting when engaged in sequential task learning. In this paper, we propose a novel parameter-efficient approach for continual learning in LLMs, which empirically investigates knowledge transfer from previously learned tasks to new tasks through low-rank matrix parameters, enhancing the learning of new tasks without significant interference. Our method employs sensitivity-based analysis of low-rank matrix parameters to identify knowledge-specific parameters between sequential tasks, which are used to initialize the low-rank matrix parameters in new tasks. To maintain orthogonality and minimize forgetting, we further involve the gradient projection technique that keeps the low-rank subspaces of each new task orthogonal to those of previous tasks. Our experimental results on continual learning benchmarks validate the efficacy of our proposed method, which outperforms existing state-of-the-art methods in reducing forgetting, enhancing task performance, and preserving the model’s ability to generalize to unseen tasks.

Date

Dec 3, 2025 12:00 PM — 12:30 PM

Event

EMIL Spring'25 Seminars

Location

Online (Zoom)

Learn more, but bother less: parameter efficient continual learning

Abstract

Reza Rahimi Azghan

Grad Research Associate