基于斯皮尔曼与随机森林的乌梁素海富营养化预警研究

Early warning study of eutrophication in the Ulansuhai Lake based on Spearman and random forests

  • 摘要: 及时且精准地预测水体富营养化对湖泊管理具有重要意义,但传统水体富营养化预警模型难以满足对水体富营养化状态精准监测的需求,预测精度仍需提升。本文收集了2011—2020年乌梁素海的水质数据,包含水温、化学需氧量、总氮、总磷以及叶绿素a浓度等关键指标,其中以叶绿素a表征水体富营养化,探究叶绿素a浓度时空变化规律,构建了融合斯皮尔曼相关分析与随机森林的水体富营养化预警模型,对乌梁素海水体富营养化状态进行预测,采用准确率、召回率、F1值、ROC曲线和混淆矩阵等多种评估指标验证模型预测的精确性和稳定性。结果表明:乌梁素海2011—2020年叶绿素a浓度波动明显,年均值呈“下降—上升—回落”的趋势。模型在训练集上的准确率达到95.89%,能够准确区分富营养化与非富营养化状态(类别1与类别0)。测试集中总计发生80次类别0判断,16次类别1预警,其中90次与实际水体状态相符,测试准确度达93.75%,召回率为85.71%,F1值为0.8,证明了模型在实际场景中的预警能力,混淆矩阵和ROC曲线分析进一步验证了模型的高效性和可靠性。在训练集训练过程中还经过超参数调优,模型的交叉验证平均准确率达到91.32%。通过特征重要性分析可知,氨氮、总磷等水质指标在水体富营养化状态预测中具有重要影响。本研究为水体富营养化的精准监测和湖泊管理提供了有效的预警工具,并具有较高的实际应用价值。

     

    Abstract: Timely and accurate prediction of water eutrophication is of great significance to lake management, but the traditional early warning model of water eutrophication is difficult to meet the demand for accurate monitoring of the eutrophication status of water bodies, and the prediction accuracy still needs to be improved. This paper collects the water quality data of Ulansuhai Lake from 2011 to 2020, including water temperature, chemical oxygen demand, total nitrogen, total phosphorus, and chlorophyll a concentration and other key indicators, in which chlorophyll a characterizes the eutrophication of the water body, explores the temporal and spatial patterns of chlorophyll a concentration, and constructs a water body eutrophication early warning model integrating the Spearman's correlation analysis and the Random Forest, and then provides an early warning model for the water body eutrophication of the lake. A model integrating Spearman's correlation analysis and random forest was constructed to predict the eutrophication status of water bodies in the Ulansuhai Lake, and the accuracy and stability of the model were verified by using various assessment indexes such as accuracy, recall, F1 value, ROC curve and confusion matrix. The results showed that the chlorophyll a concentration in the Ulansuhai Lake fluctuated significantly from 2011 to 2020, and the annual average value showed a trend of “decreasing-rising-decreasing”. The accuracy of the model in the training set was 95.89%, and it was able to accurately distinguish between eutrophication and non-eutrophication states (category 1 and category 0). A total of 80 category 0 judgments and 16 category 1 warnings occurred in the test set, of which 90 matched the actual water body status, with a test accuracy of 93.75%, a recall rate of 85.71%, and an F1 value of 0.8, which proved the model's ability to provide early warnings in real-world scenarios, and the confusion matrix and ROC curve analyses further verified the model's efficiency and reliability. The model was also tuned with hyper-parameters during the training of the training set, and the average accuracy of cross-validation of the model reached 91.32%. The feature importance analysis shows that the water quality indicators such as ammonia nitrogen and total phosphorus have an important influence in the prediction of eutrophication status of water bodies. This study provides an effective early warning tool for accurate monitoring of water body eutrophication and lake management, and has practical application value.

     

/

返回文章
返回