CSpace
Supervised machine learning improves general applicability of eDNA metabarcoding for reservoir health monitoring
Hu, Huan1,2; Wei, Xing-Yi1,2; Liu, Li2,3; Wang, Yuan-Bo1,2; Jia, Huang-Jie2; Bu, Ling-Kang2; Pei, De-Sheng4
2023-11-01
摘要Effective and standardized monitoring methodologies are vital for successful reservoir restoration and management. Environmental DNA (eDNA) metabarcoding sequencing offers a promising alternative for bio-monitoring and can overcome many limitations of traditional morphological bioassessment. Recent attempts have even shown that supervised machine learning (SML) can directly infer biotic indices (BI) from eDNA metabarcoding data, bypassing the cumbersome calculation process of BI regardless of the taxonomic assignment of eDNA sequences. However, questions surrounding the general applicability of this taxonomy-free approach to monitoring reservoir health remain unclear, including model stability, feature selection, algorithm choice, and multi-season biomonitoring. Here, we firstly developed a novel biological integrity index (Me-IBI) that integrates multitrophic interactions and environmental information, based on taxonomy-assigned eDNA metabarcoding data. The Me-IBI can better distinguish the actual health status of the Three Gorges Reservoir (TGR) than physicochemical assessments and have a clear response to human activity. Then, taking this reliable Me-IBI as a supervised label, we compared the impact of selecting different numbers of features and SML algorithms on the stability and predictive performance of the model for predicting ecological conditions in multiple seasons using taxonomy-free eDNA metabarcoding data. We discovered that even with a small number of features, different SML algorithms can establish a stable model and obtain excellent predictive performance. Finally, we proposed a four-step strategy for standardized routine biomonitoring using SML tools. Our study firstly explores the general applicability problem of the taxonomy-free eDNA-SML approach and establishes a solid foundation for the large-scale and standardized biomonitoring application.
关键词Three Gorges Reservoir Physicochemical assessment Water quality index eDNA metabarcoding Index of biotic integrity Supervised machine learning
DOI10.1016/j.watres.2023.120686
发表期刊WATER RESEARCH
ISSN0043-1354
卷号246页码:12
通讯作者Pei, De-Sheng(peids@cqmu.edu.cn)
收录类别SCI
WOS记录号WOS:001099766500001
语种英语