Abstract
Plant identification has applications in ethnopharmacology and agriculture. Since leaves are one of a distinguishable feature of a plant, they are routinely used for identification. Recent developments in deep learning have made it possible to accurately identify the majority of samples in five publicly available leaf datasets. However, each dataset captures the images in a highly controlled environment. This paper evaluates the performance of EfficientNet models, B1 to B6, and several other convolutional neural network (CNN) architectures when applied to a combination of the LeafSnap, Middle European Woody Plants 2014, Flavia, Swedish, and Folio datasets. To normalize the impact of imbalance resulting from combining the original datasets, oversampling, undersampling, and transfer learning techniques were used to construct an end-to-end CNN classifier. Greater emphasis was placed on metrics that are appropriate for a diverse-imbalanced dataset, rather than stressing high performance on any one of the original datasets. The B6 model of EfficientNet achieved highly accurate results, with an F-score of 0.9938 on the combined dataset.
Viewing Instructions
The downloadable file is very large and will take additional time to download.
You may wish to download and view these individual folders of images.
There are 364 folders of 42,420 images.
Recommended Citation
Gajjar, Viraj K.; Nambisan, Anand; and Kosbar, Kurt Louis, "Plant Identification in a Combined-Imbalanced Leaf Dataset -- Images" (2021). Research Data. 8.
https://scholarsmine.mst.edu/research_data/8
Contact Information
Viraj Gajjar, vgf4c@mst.edu
Ph.D. candidate, Electrical Engineering Department
Missouri University of Science and Technology
Anand Nambisan, akn36d@mst.edu
Ph.D. candidate, Electrical Engineering Department
Missouri University of Science and Technology
Dr. Kurt Kosbar, kosbar@mst.edu
Associate Professor, Electrical Engineering Department
Missouri University of Science and Technology
Department(s)
Electrical and Computer Engineering
Document Type
Data
Document Version
Final Version
File Format
text
File Size
22 GB
Language(s)
English
Rights
© 2021 Missouri University of Science and Technology, All rights reserved.
Python code for Stratified k-fold cross-validation
Comments
Recommended citations