TY - JOUR
T1 - Challenges and Opportunities in Tomato Leaf Disease Detection with Limited and Multimodal Data
T2 - A Review
AU - Hu, Yingbiao
AU - Li, Huinian
AU - Yang, Chengcheng
AU - Chen, Ningxia
AU - Pan, Zhenfu
AU - Ke, Wei
N1 - Publisher Copyright:
© 2026 by the authors.
PY - 2026/2
Y1 - 2026/2
N2 - Tomato leaf diseases cause substantial yield and quality losses worldwide, yet reliable detection in real fields remains challenging. Two practical bottlenecks dominate current research: (i) limited data, including small samples for rare diseases, class imbalance, and noisy field images, and (ii) multimodal heterogeneity, where RGB images, textual symptom descriptions, spectral cues, and optional molecular assays provide complementary but hard-to-align evidence. This review summarizes recent advances in tomato leaf disease detection under these constraints. We first formalize the problem settings of limited and multimodal data and analyze their impacts on model generalization. We then survey representative solutions for limited data (transfer learning, data augmentation, few-/zero-shot learning, self-supervised learning, and knowledge distillation) and multimodal fusion (feature-, decision-, and hybrid-level strategies, with attention-based alignment). Typical model–dataset pairs are compared, with emphasis on cross-domain robustness and deployment cost. Finally, we outline open challenges—including weak generalization in complex field environments, limited interpretability of multimodal models, and the absence of unified multimodal benchmarks—and discuss future opportunities toward lightweight, edge-ready, and scalable multimodal systems for precision agriculture.
AB - Tomato leaf diseases cause substantial yield and quality losses worldwide, yet reliable detection in real fields remains challenging. Two practical bottlenecks dominate current research: (i) limited data, including small samples for rare diseases, class imbalance, and noisy field images, and (ii) multimodal heterogeneity, where RGB images, textual symptom descriptions, spectral cues, and optional molecular assays provide complementary but hard-to-align evidence. This review summarizes recent advances in tomato leaf disease detection under these constraints. We first formalize the problem settings of limited and multimodal data and analyze their impacts on model generalization. We then survey representative solutions for limited data (transfer learning, data augmentation, few-/zero-shot learning, self-supervised learning, and knowledge distillation) and multimodal fusion (feature-, decision-, and hybrid-level strategies, with attention-based alignment). Typical model–dataset pairs are compared, with emphasis on cross-domain robustness and deployment cost. Finally, we outline open challenges—including weak generalization in complex field environments, limited interpretability of multimodal models, and the absence of unified multimodal benchmarks—and discuss future opportunities toward lightweight, edge-ready, and scalable multimodal systems for precision agriculture.
KW - domain generalization
KW - few-shot learning
KW - lightweight models
KW - limited data learning
KW - mathematical modeling
KW - multimodal fusion
KW - precision agriculture
KW - self-supervised learning
KW - tomato leaf disease detection
UR - https://www.scopus.com/pages/publications/105030190268
U2 - 10.3390/math14030422
DO - 10.3390/math14030422
M3 - Review article
AN - SCOPUS:105030190268
SN - 2227-7390
VL - 14
JO - Mathematics
JF - Mathematics
IS - 3
M1 - 422
ER -