SmallFishBD: An extensive image dataset of common native small fish species in Bangladesh for identification and classificationMendeley Data
Uloženo v:
| Název: | SmallFishBD: An extensive image dataset of common native small fish species in Bangladesh for identification and classificationMendeley Data |
|---|---|
| Autoři: | Md Hasanul Ferdaus, Rizvee Hassan Prito, Masud Ahmed, Syeda Raisha Abedin Ohona, Khandaker Golam Morshed, Israt Jahan Jarin, Mohammad Manzurul Islam, Nishat Tasnim Niloy, Md Sawkat Ali, Maheen Islam, Taskeed Jabid, Md Mizanur Rahoman |
| Zdroj: | Data in Brief, Vol 63, Iss , Pp 112193- (2025) |
| Informace o vydavateli: | Elsevier, 2025. |
| Rok vydání: | 2025 |
| Sbírka: | LCC:Computer applications to medicine. Medical informatics LCC:Science (General) |
| Témata: | Aquaculture, Computer vision, Ecological monitoring, Image processing, Machine learning, Marine Biology, Computer applications to medicine. Medical informatics, R858-859.7, Science (General), Q1-390 |
| Popis: | This data article presents a comprehensive image dataset of ten native small fish species commonly found in Bangladesh: Bele (Glossogobius giuris), Chanda Nama (Chanda nama), Chela (Salmostoma bacaila), Guchi (Mastacembelus pancalus), Kachki (Corica soborna), Mola (Amblypharyngodon mola), Kata Phasa (Stolephorus tri), Pabda (Ompok pabda), Puti (Puntius sophore), and Tengra (Mystus vittatus). The dataset was carefully curated to facilitate the study and research in fish species identification, classification, and biodiversity monitoring. Specimens of these species were collected from various fish markets in the capital city Dhaka. Different varieties of fish are supplied to Dhaka city from diverse geographical locations in Bangladesh. Thus, the dataset ensures a representative sampling of local aquatic biodiversity.To maintain uniformity across samples, images were captured using a smartphone camera under a standardized and controlled environment. Each specimen was placed against a neutral background with consistent lighting conditions. This limits environmental variability and enhances image quality for analytical use. The dataset contains high-resolution original images that were augmented using standard data augmentation techniques. This augmentation introduced variations such as rotations, flipping, and brightness adjustments. This expands the dataset and improves its utility for training robust machine learning (ML) and deep learning (DL) models in computer vision applications.The dataset has significant reuse potential across multiple domains. It serves as a critical resource for researchers and industry experts to develop automated systems for fish species identification and classification, particularly in the context of the rich aquatic biodiversity in Bangladesh. Furthermore, the dataset can facilitate ecological and environmental studies and research by supporting the monitoring of native fish species distribution and population dynamics. Its structured format facilitates integration into ML/DL pipelines that can foster advancements in fisheries management, sustainable aquaculture, conservation biology, and economic and cultural studies. Thus, the dataset represents a significant step towards integrating technological advancements and ecological sustainability. This article outlines the utility of the data, the dataset structure, the data collection methodology, and the applied augmentation processes to ensure transparency and reproducibility for future research endeavors. |
| Druh dokumentu: | article |
| Popis souboru: | electronic resource |
| Jazyk: | English |
| ISSN: | 2352-3409 |
| Relation: | http://www.sciencedirect.com/science/article/pii/S235234092500914X; https://doaj.org/toc/2352-3409 |
| DOI: | 10.1016/j.dib.2025.112193 |
| Přístupová URL adresa: | https://doaj.org/article/42fb93e232434d679823c268cd175275 |
| Přístupové číslo: | edsdoj.42fb93e232434d679823c268cd175275 |
| Databáze: | Directory of Open Access Journals |
| Abstrakt: | This data article presents a comprehensive image dataset of ten native small fish species commonly found in Bangladesh: Bele (Glossogobius giuris), Chanda Nama (Chanda nama), Chela (Salmostoma bacaila), Guchi (Mastacembelus pancalus), Kachki (Corica soborna), Mola (Amblypharyngodon mola), Kata Phasa (Stolephorus tri), Pabda (Ompok pabda), Puti (Puntius sophore), and Tengra (Mystus vittatus). The dataset was carefully curated to facilitate the study and research in fish species identification, classification, and biodiversity monitoring. Specimens of these species were collected from various fish markets in the capital city Dhaka. Different varieties of fish are supplied to Dhaka city from diverse geographical locations in Bangladesh. Thus, the dataset ensures a representative sampling of local aquatic biodiversity.To maintain uniformity across samples, images were captured using a smartphone camera under a standardized and controlled environment. Each specimen was placed against a neutral background with consistent lighting conditions. This limits environmental variability and enhances image quality for analytical use. The dataset contains high-resolution original images that were augmented using standard data augmentation techniques. This augmentation introduced variations such as rotations, flipping, and brightness adjustments. This expands the dataset and improves its utility for training robust machine learning (ML) and deep learning (DL) models in computer vision applications.The dataset has significant reuse potential across multiple domains. It serves as a critical resource for researchers and industry experts to develop automated systems for fish species identification and classification, particularly in the context of the rich aquatic biodiversity in Bangladesh. Furthermore, the dataset can facilitate ecological and environmental studies and research by supporting the monitoring of native fish species distribution and population dynamics. Its structured format facilitates integration into ML/DL pipelines that can foster advancements in fisheries management, sustainable aquaculture, conservation biology, and economic and cultural studies. Thus, the dataset represents a significant step towards integrating technological advancements and ecological sustainability. This article outlines the utility of the data, the dataset structure, the data collection methodology, and the applied augmentation processes to ensure transparency and reproducibility for future research endeavors. |
|---|---|
| ISSN: | 23523409 |
| DOI: | 10.1016/j.dib.2025.112193 |
Full Text Finder
Nájsť tento článok vo Web of Science