Buy Crypto Markets Spot FuturesINTC Earn Event Center

This section reviews literature related to Instance-Incremental Learning (IIL), contrasting it with the more explored Class-Incremental LearningThis section reviews literature related to Instance-Incremental Learning (IIL), contrasting it with the more explored Class-Incremental Learning

Incremental Learning: Comparing Methods for Catastrophic Forgetting and Model Promotion

Author: Hackernoon

Source: Hackernoon

2025/11/05 02:00

4 min read

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Table of Links

Abstract and 1 Introduction

Related works
Problem setting
Methodology

4.1. Decision boundary-aware distillation

4.2. Knowledge consolidation
Experimental results and 5.1. Experiment Setup

5.2. Comparison with SOTA methods

5.3. Ablation study
Conclusion and future work and References

\

Supplementary Material

Details of the theoretical analysis on KCEMA mechanism in IIL
Algorithm overview
Dataset details
Implementation details
Visualization of dusted input images
More experimental results

2. Related works

This paper devotes to the instance-incremental learning which is an associated topic to the CIL but seldom investigated. In the following, related topics on class-incremental learning, continual domain adaptation, and methods based on knowledge distillation (KD) are introduced.

\ Class-incremental learning. CIL is proposed to learn new classes without suffering from the notorious catastrophic forgetting problem and is the main topic that most of works focused on in this area. Methods of CIL can be categorized into three types: 1) important weights regularization [1, 10, 19, 32], which constrains the important weights for old tasks and free those unimportant weights for new task. Freezing the weights limits the ability to learn from new data and always lead to a inferior performance on new classes. 2) Rehearsal or pseudo rehearsal method, which stores a small-size of typical exemplars [2, 4, 9, 22] or relies on a generation network to produce old data [23] for old knowledge retaining. Usually, these methods utilize knowledge distillation and perform over the weight regularization method. Although the prototypes of old classes are efficacy in preserving knowledge, they are unable to promote the model’s performance on hard samples, which is always a problem in real deployment. 3) Dynamic network architecture based method [8, 15, 30, 31], which adaptively expenses the network each time for new knowledge learning. However, deploying a changing neural model in real scenarios is troublesome, especially when it goes too big. Although most CIL methods have strong ability in learning new classes, few of them can be directly utilized in the new IIL setting in our test. The reason is that performance promotion on old classes is less emphasized in CIL.

\ Knowledge distillation-based incremental learning. Most of existing incremental learning works utilize knowledge distillation (KD) to mitigate catastrophic forgetting. LwF [12] is one of the earliest approaches that constrains the prediction of new data through KD. iCarl [22] and many other methods distill knowledge on preserved exemplars to free the learning capability on new data. Zhai et al. [33] and Zhang et al. [34] exploit distillation with augmented data and unlabeled auxiliary data at negligible cost. Different from above distillation at label level, Kang et al. [9] and Douillard [4] proposed to distill knowledge at feature level for CIL. Compared to the aforementioned researches, the proposed decision boundary-aware distillation requires no access to old exemplars and is simple but effective in learning new as well as retaining the old knowledge.

\ Comparison with the CDA and ISL. Rencently, some work of continual domain adptation (CDA) [7, 21, 27] and incremental subpopulation learning (ISL) [13] is proposed and has high similarity with the IIL setting. All of the three settings have a fixed label space. The CDA focus on solving the visual domain variations such as illumination and background. ISL is a specific case of CDA and pays more attention to the subcategories within a class, such as Poodles and Terriers. Compared to them, IIL is a more general setting where the concept drift is not limited to the domain shift in CDA or subpopulation shifting problem in ISL. More importantly, the new IIL not only aims to retain the performance but also has to promote the generalization with several new observations in the whole data space.

:::info Authors:

(1) Qiang Nie, Hong Kong University of Science and Technology (Guangzhou);

(2) Weifu Fu, Tencent Youtu Lab;

(3) Yuhuan Lin, Tencent Youtu Lab;

(4) Jialin Li, Tencent Youtu Lab;

(5) Yifeng Zhou, Tencent Youtu Lab;

(6) Yong Liu, Tencent Youtu Lab;

(7) Qiang Nie, Hong Kong University of Science and Technology (Guangzhou);

(8) Chengjie Wang, Tencent Youtu Lab.

:::

:::info This paper is available on arxiv under CC BY-NC-ND 4.0 Deed (Attribution-Noncommercial-Noderivs 4.0 International) license.

:::

CHZ +28%! Will History Repeat?

0-fee opening long & short. Be ready for any move!

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

Tags:

#Hat #Tim #Chi #INU #Vera

24/7 Live News

Ripple Prime launches soon, impacting $112 trillion assets. XRP involved in settlement layer selection.

Author: Ripple Bull Winkle | Crypto Researcher 🚀🚨09:01

Iran tensions impact crypto market sentiment.

Author: DEG07:46

Ripple Prime joins DTCC tokenization group, impacting $114 trillion market.

Author: Ripple Bull Winkle | Crypto Researcher 🚀🚨07:43

BTC lost 2-week ascending channel, awaiting retest and potential dump.

Author: 𝗖𝗛𝗔𝗜𝗡 𝗠𝗜𝗡𝗗 ⛓🧠05:19

SEC approves DTCC to tokenize securities. Ripple's technology follows. Impact on XRP market potential.

Author: Ripple Bull Winkle | Crypto Researcher 🚀🚨05:15

Crypto Prices

Bitcoin

BTC

$64,050.01

$64,050.01$64,050.01

-0.27%

Ethereum

ETH

$1,731.62

$1,731.62$1,731.62

+0.05%

Solana

SOL

$73.56

$73.56$73.56

-0.78%

USDCoin

USDC

$1.00098

$1.00098$1.00098

0.00%

Ethena USDe

USDE

$0.9999

$0.9999$0.9999

-0.02%

World Cup Combo: Aim for 200x

Combine up to 20 World Cup matches in one order

Incremental Learning: Comparing Methods for Catastrophic Forgetting and Model Promotion

Table of Links

2. Related works

You May Also Like

Trump’s enemies kept 'up at night' by one 'unprecedented' advantage he has left

Bitcoin Prediction From February Comes Back Into Focus As BTC Trades Near $65K Zone

Yamal opens World Cup account in style as Spain thrash Saudi Arabia 4-0 to top group

Trending News

Hormuz ship traffic evaporates after Iran again locks down strategic waterway

Taiko Urges Immediate Bridge Withdrawals After $1M Exploit Compromises Chain Security

Why the aviation industry’s next crisis has already taken off — Ahmad Ibrahim

Gotta catch ‘em all... even in Kinarut: Pokémon card craze officially reaches small-town Sabah

Taiko ERC20 Vault Exploit Leads to Estimated $1 Million Loss, Blockaid Reports

24/7 Live News

Quick Reads

BlackRock Launches BITA, a Bitcoin Income ETF: Investors Can Benefit from Bitcoin While Receiving Monthly Cash Flow

SpaceX Secondary Shares Jump 50% as Private Market Demand Surges — Early Sellers Regret Exit Timing

Glamsterdam Enters Final Phase as the quest to Push Ethereum L1 Toward 10000 TPS Draws Closer

Benched Until 2030: How the US CBDC Freeze Extends a Four-Year Runway to Tether and Circle

From Perp DEX to Spot ETF: How a First-Month $900M ETF Run Establishes $HYPE as a Tier-1 Asset

Crypto Prices