中国邮电高校学报(英文) ›› 2024, Vol. 31 ›› Issue (3): 72-79.doi: 10.19682/j.cnki.1005-8885.2024.1002
• Artificial Intelligence • 上一篇 下一篇
沈雨田1,陈建1,戴敏1,张思瑞2,徐晶1,王青3
Shen Yutian, Chen Jian, Dai Min, Zhang Sirui, Xu Jing, Wang Qing
摘要: In order to improve the accuracy of used car price prediction, a machine learning prediction model based on the
retention rate is proposed in this paper. Firstly, a random forest algorithm is used to filter the variables in the data.
Seven main characteristic variables that affect used car prices, such as new car price, service time, mileage and so
on, are filtered out. Then, the linear regression classification method is introduced to classify the test data into high
and low retention rate data. After that, the extreme gradient boosting ( XGBoost) regression model is built for the
two datasets respectively. The prediction results show that the comprehensive evaluation index of the proposed
model is 0. 548, which is significantly improved compared to 0. 488 of the original XGBoost model. Finally,
compared with other representative machine learning algorithms, this model shows certain advantages in terms of
mean absolute percentage error (MAPE), 5% accuracy rate and comprehensive evaluation index. As a result, the
retention rate-based machine learning model established in this paper has significant advantages in terms of the
accuracy of used car price prediction.