基于斯皮尔曼相关分析与随机森林模型的乌梁素海富营养化预警研究

Early warning study of eutrophication in Ulansuhai Lake based on Spearman correlation analysis and random forest model

  • 摘要: 为解决传统水体富营养化预警模型预测精度不足、难以满足精准监测需求的问题,收集2011—2020年乌梁素海的水质数据,包含水温、化学需氧量、总氮、总磷以及叶绿素a浓度等关键指标,以叶绿素a表征水体富营养化水平,探究叶绿素a浓度时空变化规律,构建了融合斯皮尔曼相关分析与随机森林的水体富营养化预警模型,对乌梁素海水体富营养化状态进行预测,采用准确率、召回率、F1值、ROC曲线和混淆矩阵等多种评估指标验证模型预测的精确性和稳定性。结果表明:乌梁素海2011—2020年叶绿素a浓度波动明显,年均值呈“下降—上升—回落”的趋势。模型在训练集上的准确率达到95.89%,能够准确区分富营养化与非富营养化状态(类别1与类别0)。测试集中总计发生80次类别0判断,16次类别1预警,其中90次与实际水体状态相符,测试准确度达93.75%,召回率为85.71%,F1值为0.8,证明了模型在实际场景中的预警能力,混淆矩阵和ROC曲线分析进一步验证了模型的高效性和可靠性。在训练集训练过程中经过超参数调优,模型的交叉验证平均准确率达到91.32%。特征重要性分析表明,氨氮、总磷等水质指标在水体富营养化状态预测中具有重要影响。该模型为水体富营养化的精准监测和湖泊管理提供了有效预警工具,具备实际应用潜力。

     

    Abstract: Timely and accurate prediction of water eutrophication is of great significance to lake management, but the traditional early warning model of water eutrophication is difficult to meet the demand for accurate monitoring of the eutrophication status of water bodies, and the prediction accuracy still needs to be improved. We collected the water quality data of Ulansuhai Lake from 2011 to 2020, including water temperature, chemical oxygen demand, total nitrogen, total phosphorus, chlorophyll-a concentration, and other key indicators. Chlorophyll a was used to characterize the eutrophication of the water body, and the temporal and spatial patterns of chlorophyll a concentration were explored. A model integrating Spearman's correlation analysis and random forest was constructed to predict the eutrophication status of water bodies in Ulansuhai Lake. The accuracy and stability of the model were verified by using various assessment indexes such as accuracy, recall rate, F1 value, ROC curve, and confusion matrix. The results showed that chlorophyll a concentration in Ulansuhai Lake fluctuated significantly from 2011 to 2020, and the annual average value showed a trend of "decreasing-rising-decreasing". The accuracy of the model in the training set was 95.89%, and it was able to accurately distinguish between eutrophication and non-eutrophication states (category 1 and category 0). A total of 80 category 0 judgments and 16 category 1 warnings occurred in the test set, of which 90 matched the actual water body status, with a test accuracy of 93.75%, a recall rate of 85.71%, and an F1 value of 0.8, which proved the model's ability to provide early warnings in real-world scenarios, and the confusion matrix and ROC curve analyses further verified the model's efficiency and reliability. The model was also tuned with hyperparameters during the training of the training set, resulting in an average cross-validation accuracy of 91.32%. The feature importance analysis showed that the water quality indicators, such as ammonia nitrogen and total phosphorus, played a significant role in the prediction of the eutrophication status of water bodies. This study provides an effective early warning tool for accurate monitoring of water body eutrophication and lake management, and has practical application value.

     

/

返回文章
返回