
Unveiling Factors Affecting Price of Second-hand Cars Using Multiple Linear Regression Models and Random Forest
- 1 Adcote School Shanghai, Shanghai, China
* Author to whom correspondence should be addressed.
Abstract
With the development of global economy and evolution of consumer concepts, automobiles, as an important means of transportation and consumer product, have seen a continuous increase in market demand. In recent years, with the growing number of vehicles in use, the second-hand car market has gradually become an important part of the automotive circulation sector. This study investigates the factors influencing second-hand car pricing in the current market environment. By combining multiple linear regression and Random Forest analysis, the author examines the significance and impact of various factors on the final selling price of second-hand vehicles. The data were collected from Kaggle and supplemented by relevant academic literature, covering variables such as power, transmission, mileage, engine type, and fuel type. The author seeks to reveal the characteristics and trends of current second-hand car pricing, provide guidance for marketing strategies, offer a basis for relevant policy-making, and explore the long-term impact of second-hand car pricing on the entire consumer market and social-economic development. As the results, the vehicle's power has the greatest impact on the final transaction price, which is the pricing in the second-hand car market, and it is positively correlated.
Keywords
Second-hand cars, vehicle's power, multiple linear regression, random forest
[1]. Wang Zhishan. (2024). The second-hand car market is poised to enter a rapid growth track. Business Observer (20), 6-9.
[2]. Tian Meng & Chang Kaidi. (2025). Analysis of the used passenger car market from January to November 2024. Auto Vertical and Horizontal (01), 110-113.
[3]. Zhao Qiuyuan. (2024). Analysis of influencing factors of used car value and appraisal value case. Automobile Repair & Maintenance (12),106-108.
[4]. Ming Jianping & Ouyang Ronghua. (2024). Research on the value evaluation of used cars of pure electric vehicles based on the improved replacement cost method. Automotive Practical Technology (20), 124-128.
[5]. Li Chen & Guan Yongjun. (2024). Value evaluation of new energy used vehicles based on gradient boosting tree model. Business Observer (29), 21-25.
[6]. Zhou Yuan & He Botao. (2024). Used car price prediction method based on weighted combination model. Computer and Digital Engineering (05), 1449-1452 1458.
[7]. Ning Yuantian, Zhu Qin, Wei Jinming, Yin Pengfei & Zhu Honglin. (2024). Research on used car price evaluation based on random forest regression model. Journal of Guilin Institute of Aerospace Technology (01), 82-93.
[8]. Jin Zhirong. (2023). Research on Used Car Pricing Strategy Based on Random Forest. Science & Technology Economic Market (06), 89-91.
[9]. Bai Weizhong. (2023). Research on Business Model Innovation in Domestic Second-hand Car Market under the Background of "Internet". Times Auto (23), 166-168.
[10]. Zheng Aiping, Li Binbin and Guo Chuanhao. (2023). “Used car transaction price prediction analysis based on linear regression and neural network model”. Intelligent Computers and Applications (09),103-110.
[11]. Soffritti, G., & Galimberti, G. (2010). Multivariate linear regression with non-normal errors: A solution based on mixture models. Statistics and Computing, 21(4), 523–536.
[12]. Sheather, S. J. (2009). A modern approach to regression with R. Springer.
[13]. Sangeetha, J. Margaret, & K. Joy Alfia. (2024). Financial stock market forecast using evaluated linear regression based machine learning technique. Measurement: Sensors, 31, 100950.
Cite this article
Luo,W. (2025). Unveiling Factors Affecting Price of Second-hand Cars Using Multiple Linear Regression Models and Random Forest. Theoretical and Natural Science,105,22-28.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 3rd International Conference on Mathematical Physics and Computational Simulation
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).