A Study of Robot Ground Classification Based on SMOTE Oversampling Technique and XGBoost

Zongbo Yu

doi:10.54254/2755-2721/2025.KL22356

1. Introduction

With the industrial development and the improvement of robots’ autonomous mobility, their applications in industrial inspection, agricultural automation, medical aid and other fields are becoming more and more widespread [1-3]. However, ground classification under complex terrain is still one of the core challenges in robot navigation, which directly affects its travelling stability and path planning[4]. In order to improve the accuracy of ground type classification and route planning, it is crucial to propose an algorithm that effectively classifies robot ground types.

In past studies, ground type classification methods can be mainly classified into traditional machine learning methods based on mathematics and deep learning methods based on neural networks [5]. Huang et al. [6] proposed a classification method based on Support Vector Machines (SVMs) and Binary Tree Clustering to classify the grasping patterns of robotic multi-fingered hands, Li et al. [7] classified China’s Southern hilly land terrain and found that the classification accuracy of random forest is better than SVM and KNN algorithms; Yang et al. [8] compared nine machine learning methods and selected XGBoost integrated learning algorithm with the highest eigenvalue score to construct a strong convective weather classification and identification model to classify and identify different weather, and examined the good prediction performance; Cheng [9] based on a Depth-separated convolutional neural network to extract flower features, using a miniature embedded device as the robot’s “eyes”, to achieve the robot’s automated classification technology for flowers; Xu et al. [10] used a deep recurrent neural network in the long and short-term memory model for collaborative robot dynamics model to compensate for the error, and the compensated collaborative robot dynamics model has good performance for the actual moment. Robot dynamics model has a better prediction of actual moments. Liu [11] compared the effect of both on large sample and small sample image scene classification using traditional machine learning methods and deep learning methods respectively, and found that the traditional machine learning has a better solution on small sample dataset, while the deep learning framework has a higher recognition accuracy on large sample dataset.

However, in practical applications, deep machine learning is not suitable for real-time applications due to high arithmetic consumption and high training costs. The classification of traditional maths-based machine learning is not accurate enough and features are not extracted sufficiently. In addition we found that the samples are not always balanced in the dataset [12] and most of the previous work does not consider this in detail, we need synthetic minority oversampling technique (SMOTE) to bridge the gap between the minority and the majority of samples [13].

Based on the above analysis, this paper proposes a robot ground type classification method based on SMOTE oversampling technique and XGBoost. Specifically, we first inimize the unbalanced dataset with the SMOTE method, after which we actively extract common statistical features and then train the classification with the XGBoost algorithm, which uses the characteristics of the gradient boosting tree to enable efficient and accurate classification on different types of ground data, and the computational cost is much lower than that of the neural network model, which is efficient and practical.

2. Methodology

XGBoost (eXtreme Gradient Boosting) is an efficient implementation of Gradient Boosting, proposed by Chen and Guestrin in 2016 [14] , which is an integrated learning method that gradually improves the model by constructing a set of weak learners (usually decision trees) mainly to improve the model performance to achieve the final strong classifier, using gradient-based optimisation to minimise the loss function, and combining with a regularisation strategy to improve the generalisation ability of the model. Compared with the traditional Gradient Boosting Tree (GBDT), XGBoost provides significant improvements in performance and computational efficiency.

2.1. XGBoost

XGBoost (eXtreme Gradient Boosting) is an efficient implementation of Gradient Boosting, proposed by Chen and Guestrin in 2016[14] , which is an integrated learning method that gradually improves the model by constructing a set of weak learners (usually decision trees) mainly to improve the model performance to achieve the final strong classifier, using gradient-based optimisation to minimise the loss function, and combining with a regularisation strategy to improve the generalisation ability of the model. Compared with the traditional Gradient Boosting Tree (GBDT), XGBoost provides significant improvements in performance and computational efficiency.

2.1.1. Design Ideas

XGBoost is constructed based on an additive model and a forward-stepping algorithm, which aims to minimise the following objective function:

\( L(ϕ)=\sum _{i=1}^{n}l({y_{i}},\hat{{y_{i}}})+\sum _{k=1}^{K}Ω({f_{k}}) \) (1)

Among them:

• \( l({y_{i}},\hat{{y_{i}}}) \) is the prediction error (e.g. log loss) of the training sample \( ({x_{i}},{y_{i}}) \) .

• \( \hat{{y_{i}}}=\sum _{k=1}^{K}{f_{k}}({x_{i}}) \) is the predicted value for the ith sample and \( \hat{{y_{i}}}=\sum _{k=1}^{K}{f_{k}}({x_{i}}) \) is the prediction for the kth tree.

• \( Ω(f)=γT+\frac{1}{2}λ\sum _{j=1}^{T}w_{j}^{2} \) is the regularisation term of the model complexity, where T is the number of leaf nodes of the tree and \( {w_{j}} \) is the weight of the jth leaf node.

2.1.2. Objective function optimisation

In the tth iteration, XGBoost fits a new tree \( {f_{t}} \) to minimise the following second-order expansion of the objective function on top of the already constructed \( t-1 \) tree:

\( {L^{(t)}}=\sum _{i=1}^{n}[{g_{i}}{f_{t}}({x_{i}})+\frac{1}{2}{h_{i}}{f_{t}}{({x_{i}})^{2}}]+Ω({f_{t}}) \) (2)

Among them:

• \( {g_{i}}=\frac{∂l({y_{i}},\hat{y_{i}^{(t-1)}})}{∂\hat{y_{i}^{(t-1)}}} \) is the first order derivative (gradient).

• \( {h_{i}}=\frac{{∂^{2}}l({y_{i}},\hat{y_{i}^{(t-1)}})}{∂\hat{y_{i}^{(t-1)2}}} \) is the second order derivative (diagonal element of the Hessian matrix).

By introducing the second-order derivative information, XGBoost achieves the approximate optimisation of the objective function, which significantly improves the optimisation efficiency and model performance.

2.1.3. Selection of the splitting point

XGBoost uses a greedy algorithm to select split points with the goal of maximising the objective function gain (Gain). Gain is defined as follows:

\( Gain=\frac{1}{2}[\frac{G_{L}^{2}}{{H_{L}}+λ}+\frac{G_{R}^{2}}{{H_{R}}+λ}-\frac{{({G_{L}}+{G_{R}})^{2}}}{{H_{L}}+{H_{R}}+λ}]- \) (3)

Among them:

• \( {G_{L}} \) , \( {H_{L}} \) and, \( {G_{R}}{H_{R}} \) are the first and second order gradient sums of the left and right child nodes respectively.

• λ and γ are regularisation parameters to control the complexity of the model.

The gain calculation process ensures that each split maximises the optimisation of the objective function while avoiding overfitting.

2.1.4. Regularisation

XGBoost introduces two regularisation parameters λ and γ in the objective function:

• λ controls the L2 regularisation of the leaf node weights and is used to suppress overfitting due to too large weights.

• γ controls the complexity of splitting, and the splitting operation is performed only if the splitting gain exceeds γ.

This regularisation mechanism enhances the generalisation ability of the model and enables it to achieve good performance on both training and test data.

2.2. SMOTE oversampling technique

SMOTE (Synthetic Minority Over-sampling Technique) is an oversampling technique for dealing with class imbalance problem [15], which balances the distribution of classes in a dataset by synthesising new samples of the minority class, the algorithm proceeds as follows.

2.2.1. Selecting a small sample of classes and calculating neighbours

First, the algorithm selects a target sample from the minority samples in the dataset: assuming that the set of minority samples in the dataset is \( {S_{minority}} \) , for each target minority sample \( {x_{i}}∈{S_{minority}} \) , the SMOTE algorithm calculates the distance of that sample from all other minority samples:

\( d({x_{i}},{x_{j}})=\sqrt[]{\sum _{k=1}^{n}{(x_{i}^{k}-x_{j}^{k})^{2}}} \) (4)

Where, \( x_{i}^{k} \) and \( x_{j}^{k} \) are the values of samples \( {x_{i}} \) and \( {x_{j}} \) on the kth feature and n is the total number of features.

2.2.2. Generating new samples

For each target sample \( {x_{i}} \) , a sample \( {x_{{i^{ \prime }}}} \) is randomly selected from its k nearest neighbour samples and a new sample \( {x_{new}} \) is generated by linear interpolation:

\( {x_{new}}={x_{i}}+λ×({x_{{i^{ \prime }}}}-{x_{i}}) \) (5)

Among them:

• \( λ \) is a randomly chosen value for \( λ∈[0,1] \) .

• \( {x_{i}} \) is the target sample.

• \( {x_{{i^{ \prime }}}} \) is a sample of selected neighbours.

This formula indicates that the new sample \( {x_{new}} \) is located between the target sample \( {x_{i}} \) and its neighbour \( {x_{{i^{ \prime }}}} \) and the interpolation factor \( λ \) controls where the new sample is generated.

2.2.3. Repeatedly generating samples and synthesising the training set

The SMOTE algorithm keeps repeating the previous step until it generates a sufficient number of minority class samples in order to reach a predetermined balancing goal. By generating new samples several times, SMOTE takes the newly generated synthetic samples together with the original minority class samples to form a new training set which will have a more balanced class distribution.

3. Experiments

3.1. Experimental hardware and software environment

The experiments were conducted in a GPU environment of NVIDIA GeForce RTX 4060, the programming language was python and the deep learning framework was pytorch version 2.5.1.

3.2. Data processing

3.2.1. Introduction to the dataset

The dataset of this experiment contains 3810 groups of signals collected from the robot, each group of signals has 10 channels and each channel has 128 sampling points, i.e. each group of signals is a 128*10 signal matrix. The output is the category of the ground to which each group of signals belongs, and the nine categories and their corresponding quantities are shown in Fig. 1.

/word/media/image1.png

Figure 1: Distribution of number of sample categories

3.2.2. SMOTE strategy data balancing

From Fig. 1, it can be found that the number of samples in each category is unevenly distributed. To solve this problem, we adopt the data balancing (SMOTE) strategy [16] to balance the number of samples in the training set, generating enough samples from few categories to make the number of samples in each category equal, Figure 2 shows the proportion of each category in the dataset before and after balancing.

/word/media/image2.png

Figure 2: Comparison of sample category distributions.

3.2.3. Feature extraction

Feature extraction is the core part of data preprocessing. In order to compress matrix information and save training cost, we extract multiple statistical features from each group of signals, such as mean, standard deviation, maximum, minimum, skewness and kurtosis. These features can effectively describe the trend, volatility, distribution characteristics and peaks of the signal, which are the core characteristics of the signal, and the extraction of these features is conducive to the learning of the subsequent model.

Finally, we divided the dataset into 80% training set and 20% testing set.

3.3. Hyperparameter selection

The hyperparameters of the XGBoost classifier are set as follows:

• Learning_rate: set to 0.1 to control the weight of each tree to avoid the model converging too quickly.

• Maximum tree depth (max_depth): set to 6 to limit the depth of a single tree to prevent overfitting.

• Number of base learners (n_estimators): set to 100, indicating that up to 100 trees are constructed.

• Evaluation metric (eval_metric): set to mlogloss, the multicategorical log-loss, which measures the predictive performance of the model.

• Random seed (random_state): set to 42, used to ensure the reproducibility of the experimental results.

3.4. Experimental indicators

In order to effectively assess the performance of the model, several experimental metrics were used in this experiment to judge the classification effect, specifically, the stipulation:

TP (True Positives): True cases, which are predicted to be positive and actually are;

FP (False Positives): False Positive cases, where the prediction is positive but the actual case is negative;

FN (False Negatives): False Negatives, which are predicted to be negative but are actually positive;

TN (True Negatives): True Negatives, which are predicted to be negative and actually are.

Categorical accuracy indicates the ratio of the number of correct predictions to the total number of positive and negative examples:

\( accuracy=\frac{TP+TN}{TP+FP+FN+TN} \) (6)

The precision rate is the proportion of correct predictions in a sample of positive cases where the prediction is judged on the basis of the outcome of the prediction:

\( precision=\frac{TP}{TP+FP} \) (7)

Recall is the proportion of correctly predicted positive cases out of the total sample of actual positive cases, judged on the basis of actual samples:

\( recall=\frac{TP}{TP+FN} \) (8)

The F1 score is a metric that neutralises precision and recall. Where P is the precision rate and R is the recall rate:

\( F1=\frac{2PR}{P+R} \) (9)

3.5. Comparative experiments

As shown in Table 1, we compare our method with svm and convolutional neural network models , and find that our method outperforms the remaining two methods in all common metrics, proving the superiority of our proposed algorithm.

Table 1: Comparison of results for XGBoost, SVM and Convolutional Neural Network.

	Classification accuracy	Accuracy	Recall rate	F1score
XGBoost+SMOTE	0.91	0.91	0.91	0.91
SVM	0.59	0.58	0.59	0.58
CNN	0.77	0.78	0.77	0.77

4. Conclusion

In this paper, we use the public dataset provided by the CareerCon 2019 competition in the kaggle website as the experimental dataset, and propose a robot ground type classification method based on SMOTE oversampling technique and XGBoost. The experimental results show that the recognition ability of the model is stable regardless of minority class ground types or majority class ground types, and the accuracy and F1 score in terms of the overall classification performance are 91% and 0.91, respectively, which are higher than 59% and 0.58 of SVM and 77% and 0.77 of CNN.In addition, it is also found that reasonable feature engineering and data balancing processing have a significant influence. This study can provide reliable support for autonomous robot navigation, path planning, and environment sensing, and can be extended to other classification tasks with data imbalance. Future research can further optimise the feature extraction method and combine it with other deep learning models to improve the ground recognition capability in complex environments.

References

[1]. Arents J, Greitans M. Smart industrial robot control trends, challenges and opportunities within manufacturing[J]. Applied Sciences, 2022, 12(2): 937.

[2]. Ning W, Yuxiao H A N, Yaxuan W, et al. Research progress of agricultural robot full coverage operation planning[J]. Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2022.

[3]. Guo Y, Chen W, Zhao J, et al. Medical robotics: opportunities in China[J]. Annual Review of Control, Robotics, and Autonomous Systems, 2022, 5(1): 361-383.

[4]. Karur K, Sharma N, Dharmatti C, et al. A survey of path planning algorithms for mobile robots[J]. Vehicles, 2021, 3(3): 448-468.

[5]. Li H. Machine learning methods [M]. Tsinghua University Press, 2022.

[6]. Yuanjie Huang,Congqing Wang. Support vector machine-based pre-grasping pattern classification for robotic multi-fingered hands[J]. Mechanical Engineering and Automation, 2006,(04):94-96.

[7]. Hengkai LI, Lijuan WANG, Songsong XIAO. Random forest classification of land use in southern hilly mountains based on multi-source data[J]. Transactions of the Chinese Society of Agricultural Engineering, 2021, 37(7).

[8]. He YANG, Hongbo MA, Weinan SUN, et al. Classification and identification of strong convective weather in Jilin Province based on XGBoost algorithm[J]. Meteorological Disaster Defense,2023,30(02):28-33.

[9]. Cheng T. Robotic flower sorting system based on depth-separated convolutional neural network[D]. Hunan University of Technology,2019.

[10]. XU Z, ZHANG G, WANG H, et al. Error compensation of collaborative robot dynamics based on deep recurrent neural network[J]. Chinese Journal of Engineering, 2021, 43(7): 995-1002.

[11]. Wang P, Fan E, Wang P. Comparative analysis of image classification algorithms based on traditional machine learning and deep learning[J]. Pattern recognition letters, 2021, 141: 61-67.

[12]. Siwei XU, Ming ZHOU, Rui ZOU, et al. A study on the effect of sampling ratio on classification results in unbalanced datasets[J]. Intelligent Computers and Applications,2024,14(09):111-117.DOI:10.20169/j.issn.2095-2163.240917.

[13]. Aihua Li, Wanxin Li, Sifan Chen, et al. Research on SMOTE-BO- XGBoost integrated credit scoring model for unbalanced data[J/OL] China Management Science,1-10[2025-2-13]Chen, Tianqi and Carlos Guestrin. "XGBoost: A Scalable Tree Boosting System." Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016): n. pag.

[14]. Chen T, Guestrin C. Xgboost: A scalable tree boosting system[C]//Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016: 785-794.

[15]. Chawla, N., K. Bowyer, Lawrence O. Hall and W. Philip Kegelmeyer. "SMOTE: Synthetic Minority Over-sampling Technique." ArXiv abs/1106.1813 (2002): n. pag.

[16]. ZHANG T, DING L. A new resampling method based on SMOTE for imbalanced data set [J][J]. Computer Applications and Software, 2021, 38(9): 273-279.

Cite this article

Yu,Z. (2025). A Study of Robot Ground Classification Based on SMOTE Oversampling Technique and XGBoost. Applied and Computational Engineering,149,52-59.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of CONF-MSS 2025 Symposium: Automation and Smart Technologies in Petroleum Engineering

ISBN：978-1-80590-061-0(Print) / 978-1-80590-062-7(Online)

Editor：Mian Umer Shafiq

Conference website: https://2025.confmss.org

Conference date: 21 March 2025

Series: Applied and Computational Engineering

Volume number: Vol.149

ISSN：2755-2721(Print) / 2755-273X(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).