BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data

๐Ÿ“… 2025-10-07
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Insect classification is critical for agricultural and ecological research but faces challenges including morphological complexity, severe class imbalance, and large-scale data. To address these, this work pioneers the application of neural architecture search (NAS) to multimodal insect classification, proposing a joint modeling framework that fuses visual features and metadata. We introduce an alternating bilevel optimization strategy coupled with a zero-operation pruning mechanism to automatically construct high-performance, sparse networks. Furthermore, multi-cell stacking and sparsity constraints jointly balance representational capacity and computational efficiency. On the BIOSCAN-5M benchmark, our method achieves 96.81% accuracy and 97.05% F1-scoreโ€”surpassing state-of-the-art transfer learning, Transformer-based, and AutoML approaches by 8โ€“16 percentage points. It also demonstrates strong generalization on Insects-1M, confirming its practicality and robustness across diverse insect datasets.

Technology Category

Application Category

๐Ÿ“ Abstract
Insect classification is important for agricultural management and ecological research, as it directly affects crop health and production. However, this task remains challenging due to the complex characteristics of insects, class imbalance, and large-scale datasets. To address these issues, we propose BioAutoML-NAS, the first BioAutoML model using multimodal data, including images, and metadata, which applies neural architecture search (NAS) for images to automatically learn the best operations for each connection within each cell. Multiple cells are stacked to form the full network, each extracting detailed image feature representations. A multimodal fusion module combines image embeddings with metadata, allowing the model to use both visual and categorical biological information to classify insects. An alternating bi-level optimization training strategy jointly updates network weights and architecture parameters, while zero operations remove less important connections, producing sparse, efficient, and high-performing architectures. Extensive evaluation on the BIOSCAN-5M dataset demonstrates that BioAutoML-NAS achieves 96.81% accuracy, 97.46% precision, 96.81% recall, and a 97.05% F1 score, outperforming state-of-the-art transfer learning, transformer, AutoML, and NAS methods by approximately 16%, 10%, and 8% respectively. Further validation on the Insects-1M dataset obtains 93.25% accuracy, 93.71% precision, 92.74% recall, and a 93.22% F1 score. These results demonstrate that BioAutoML-NAS provides accurate, confident insect classification that supports modern sustainable farming.
Problem

Research questions and friction points this paper is trying to address.

Automating insect classification using multimodal data and neural architecture search
Addressing class imbalance and dataset scale challenges in biodiversity analysis
Optimizing network architecture for efficient and accurate insect identification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses neural architecture search for image feature extraction
Fuses image embeddings with metadata for multimodal classification
Employs bi-level optimization and zero operations for efficiency
A
Arefin Ittesafun Abian
Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh
D
Debopom Sutradhar
Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh
M
Md Rafi Ur Rashid
Department of Computer Science and Engineering, Penn State University, University Park, PA, USA
R
Reem E. Mohamed
Faculty of Science and Information Technology, Charles Darwin University, Sydney, NSW, Australia
M
Md Rafiqul Islam
Faculty of Science and Technology, Charles Darwin University, Casuarina, 0909, NT, Australia
A
Asif Karim
Faculty of Science and Technology, Charles Darwin University, Casuarina, 0909, NT, Australia
Kheng Cher Yeo
Kheng Cher Yeo
Charles Darwin University
S
Sami Azam
Faculty of Science and Technology, Charles Darwin University, Casuarina, 0909, NT, Australia