img

Wechat

Adv search

Journal of Desert Research ›› 2024, Vol. 44 ›› Issue (6): 70-78.DOI: 10.7522/j.issn.1000-694X.2024.00049

Previous Articles     Next Articles

Ensemble models for identifying automatically aeolian saltating tracks driven by datasets

Hongji Zhou1(), Fanmin Mei1(), Mengji Pu1, Chuan Lin1, Jin Su2, Jinguang Chen3   

  1. 1.School of Environmental and Chemical Engineering /, Xi'an Polytechnic University,Xi'an 710600,China
    2.School of Science /, Xi'an Polytechnic University,Xi'an 710600,China
    3.School of Computer Science, Xi'an Polytechnic University,Xi'an 710600,China
  • Received:2024-03-11 Revised:2024-05-12 Online:2024-11-20 Published:2024-12-06
  • Contact: Fanmin Mei

Abstract:

It is very vital for tracking sand particle to establish automatic identification of saltating tracks. Thus, the four ensemble models, including the Extremely randomized trees, the Random forests, the XGBoost, and the Gradient Boosting Decision Tree driven by the datasets we constructed, were proposed for identifying saltating tracks. Firstly, all the models perform well in spite of the dataset without very good discriminability, suggesting these models own an advantage when dealing with nonlinear relationships. Secondly, the Extremely randomized trees model holds the highest accuracy (0.9035), precision (0.9030), recall (0.9035), F1 score (0.8995), MCC (0.7378), and AUC score (0.9179), and time cost while the XGBoost model has the best balance between the higher scores and lower time cost. It implies that the former is most feasible for identifying offline saltating tracks and that the latter is prospective for tracking sand particle online. Finally, the improved datasets, which incorporate standard deviation of instant horizontal and vertical velocities, significantly enhance the predictive performances of Extremely randomized trees. This study effectively reduces the time cost of manual trajectory verification and broadens the application of machine learning in saltation.

Key words: aeolian saltating, extremely randomized trees, XGBoost, random forest, gradient boosting decision tree

CLC Number: