TY - GEN
T1 - THTL
T2 - 31st International Conference on Neural Information Processing, ICONIP 2024
AU - Fang, Xinyi
AU - Luo, Yuqi
AU - Simões, Marco
AU - Yang, Xu
AU - Wang, Yapeng
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Early and accurate identification of benign or malignant laryngeal lesions is crucial for early intervention in laryngeal cancer. To achieve this goal, we propose a two-step heterogeneous transfer learning (THTL) framework based on traditional transfer learning. This framework utilizes heterogeneous medical images from coronary angiography as an intermediate domain, which while differing in semantics and imaging instruments, shares vascular features with the target domain (laryngeal blood vessel images) to a certain extent. We verified our proposed THTL on three different scales of target domains: two extremely challenging ones using only 2% and 10% of the original laryngeal blood vessel dataset for training and validation, and the other larger one using 90%. Compared to traditional transfer learning, THTL achieves better results across all scales. Specifically, THTL improves accuracy, malignant precision, and malignant recall by 3.45%, 13.1%, and 18.73%, and by 9.93%, 11.62%, and 36.76%, respectively, for two limited data scenarios, and exceeds the state-of-the-art accuracy in the target domain scale of 90%. Additionally, to gain deeper insights, a number of comprehensive experiments are conducted on fine-tuning techniques, domain color impacts, and the intermediate domain’s size and complexity, providing valuable information for subsequent studies in applying transfer learning for early laryngeal cancer identification.
AB - Early and accurate identification of benign or malignant laryngeal lesions is crucial for early intervention in laryngeal cancer. To achieve this goal, we propose a two-step heterogeneous transfer learning (THTL) framework based on traditional transfer learning. This framework utilizes heterogeneous medical images from coronary angiography as an intermediate domain, which while differing in semantics and imaging instruments, shares vascular features with the target domain (laryngeal blood vessel images) to a certain extent. We verified our proposed THTL on three different scales of target domains: two extremely challenging ones using only 2% and 10% of the original laryngeal blood vessel dataset for training and validation, and the other larger one using 90%. Compared to traditional transfer learning, THTL achieves better results across all scales. Specifically, THTL improves accuracy, malignant precision, and malignant recall by 3.45%, 13.1%, and 18.73%, and by 9.93%, 11.62%, and 36.76%, respectively, for two limited data scenarios, and exceeds the state-of-the-art accuracy in the target domain scale of 90%. Additionally, to gain deeper insights, a number of comprehensive experiments are conducted on fine-tuning techniques, domain color impacts, and the intermediate domain’s size and complexity, providing valuable information for subsequent studies in applying transfer learning for early laryngeal cancer identification.
KW - Larynx Cancer
KW - Medical Image Classification
KW - Transfer Learning
UR - https://www.scopus.com/pages/publications/105009402897
U2 - 10.1007/978-981-96-6963-9_8
DO - 10.1007/978-981-96-6963-9_8
M3 - Conference contribution
AN - SCOPUS:105009402897
SN - 9789819669622
T3 - Communications in Computer and Information Science
SP - 105
EP - 119
BT - Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
A2 - Mahmud, Mufti
A2 - Doborjeh, Maryam
A2 - Wong, Kevin
A2 - Leung, Andrew Chi Sing
A2 - Doborjeh, Zohreh
A2 - Tanveer, M.
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 2 December 2024 through 6 December 2024
ER -