Category-Wise Fine-Tuning for Image Multi-label Classification with Partial Labels

Chak Fong Chong, Xu Yang, Tenglong Wang, Wei Ke, Yapeng Wang

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Image multi-label classification datasets are often partially labeled (for each sample, only the labels on some categories are known). One popular solution for training convolutional neural networks is treating all unknown labels as negative labels, named Negative mode. But it produces wrong labels unevenly over categories, decreasing the binary classification performance on different categories to varying degrees. On the other hand, although Ignore mode that ignores the contributions of unknown labels may be less effective than Negative mode, it ensures the data have no additional wrong labels, which is what Negative mode lacks. In this paper, we propose Category-wise Fine-Tuning (CFT), a new post-training method that can be applied to a model trained with Negative mode to improve its performance on each category independently. Specifically, CFT uses Ignore mode to one-by-one fine-tune the logistic regressions (LRs) in the classification layer. The use of Ignore mode reduces the performance decreases caused by the wrong labels of Negative mode during training. Particularly, Genetic Algorithm (GA) and binary crossentropy are used in CFT for fine-tuning the LRs. The effectiveness of our methods was evaluated on the CheXpert competition dataset and achieves state-of-the-art results, to our knowledge. A single model submitted to the competition server for the official evaluation achieves mAUC 91.82% on the test set, which is the highest single model score in the leaderboard and literature. Moreover, our ensemble achieves mAUC 93.33% (The competition was recently closed. We evaluate the ensemble on a local machine after the test set is released and can be downloaded.) on the test set, superior to the best in the leaderboard and literature (93.05%). Besides, the effectiveness of our methods is also evaluated on the partially labeled versions of the MS-COCO dataset.

原文English
主出版物標題Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings
編輯Biao Luo, Long Cheng, Zheng-Guang Wu, Hongyi Li, Chaojie Li
發行者Springer Science and Business Media Deutschland GmbH
頁面332-345
頁數14
ISBN(列印)9789819981441
DOIs
出版狀態Published - 2024
事件30th International Conference on Neural Information Processing, ICONIP 2023 - Changsha, China
持續時間: 20 11月 202323 11月 2023

出版系列

名字Communications in Computer and Information Science
1965 CCIS
ISSN(列印)1865-0929
ISSN(電子)1865-0937

Conference

Conference30th International Conference on Neural Information Processing, ICONIP 2023
國家/地區China
城市Changsha
期間20/11/2323/11/23

指紋

深入研究「Category-Wise Fine-Tuning for Image Multi-label Classification with Partial Labels」主題。共同形成了獨特的指紋。

引用此