Using Multiple Heads to Subsize Meta-memorization Problem

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

The memorization problem is a meta-level overfitting phenomenon in meta-learning. The trained model prefers to remember learned tasks instead of adapting to new tasks. This issue limits many meta-learning approaches to generalize. In this paper, we mitigate this limitation issue by proposing multiple supervisions through a multi-objective optimization process. The design leads to a Multi-Input Multi-Output (MIMO) configuration for meta-learning. The model has multiple outputs through different heads. Each head is supervised by a different order of labels for the same task. This leads to different memories, resulting in meta-level conflicts as regularization to avoid meta-overfitting. The resulting MIMO configuration is applicable to all MAML-like algorithms with minor increments in training computation, the inference calculation can be reduced through early-exit policy or better performance can be achieved through low cost ensemble. In experiments, identical model and training settings are used in all test cases, our proposed design is able to suppress the meta-overfitting issue, achieve smoother loss landscapes, and improve generalisation.

原文English
主出版物標題Artificial Neural Networks and Machine Learning - ICANN 2022 - 31st International Conference on Artificial Neural Networks, Proceedings
編輯Elias Pimenidis, Mehmet Aydin, Plamen Angelov, Chrisina Jayne, Antonios Papaleonidas
發行者Springer Science and Business Media Deutschland GmbH
頁面496-507
頁數12
ISBN(列印)9783031159367
DOIs
出版狀態Published - 2022
事件31st International Conference on Artificial Neural Networks, ICANN 2022 - Bristol, United Kingdom
持續時間: 6 9月 20229 9月 2022

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13532 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference31st International Conference on Artificial Neural Networks, ICANN 2022
國家/地區United Kingdom
城市Bristol
期間6/09/229/09/22

指紋

深入研究「Using Multiple Heads to Subsize Meta-memorization Problem」主題。共同形成了獨特的指紋。

引用此