Forecasting USA Unemployment Rate Base on ARIMA Model

Research Article
Open access

Forecasting USA Unemployment Rate Base on ARIMA Model

Dihan Zhang 1*
  • 1 Poole Management, North Carolina State University, Raleigh, United States    
  • *corresponding author dzhang22@ncsu.edu
Published on 1 December 2023 | https://doi.org/10.54254/2754-1169/49/20230486
AEMPS Vol.49
ISSN (Print): 2754-1169
ISSN (Online): 2754-1177
ISBN (Print): 978-1-83558-145-2
ISBN (Online): 978-1-83558-146-9

Abstract

This paper presents a detailed analysis of unemployment rate forecasting, a critical subject for various stakeholders including policymakers, businesses, and individuals. Amid significant economic events such as the global financial crisis and COVID-19 pandemic, the need for precise unemployment forecasts has become crucial. The research utilizes an Autoregressive Integrated Moving Average (ARIMA) model to analyze US unemployment rate data from 2000 to 2023, sourced from the Federal Reserve Economic Data (FRED). The paper identifies seasonality patterns, executes appropriate data transformations, and incorporates the Box-Jenkins methodology for ARIMA model identification. The findings reveal the model's resilience, demonstrating accurate forecasts despite significant disruptions. These insights offer valuable contributions in understanding labor market dynamics, facilitating informed decision-making and strategic planning. The paper highlights the robustness of ARIMA models, and their potential to adapt to rapid changes in the economic landscape, thereby proving invaluable in forecasting unemployment rates.

Keywords:

unemployment rate, forecasting, ARIMA model, time-series analysis, labor market dynamics

Zhang,D. (2023). Forecasting USA Unemployment Rate Base on ARIMA Model. Advances in Economics, Management and Political Sciences,49,67-76.
Export citation

1. Introduction

The accurate forecasting of the unemployment rate is of utmost importance for policymakers, businesses, and individuals seeking to navigate the complexities of the labor market [1]. Fluctuations in the unemployment rate hold substantial implications for various stakeholders, necessitating precise forecasts to inform decision-making and strategic planning. In recent years, the United States has experienced significant economic events, notably the global financial crisis of 2007-2008 and the subsequent COVID-19 pandemic, which have had profound effects on labor markets [2]. These crises resulted in unprecedented disruptions, including business closures, extensive job losses, and a sharp increase in unemployment rates. Policymakers faced the considerable challenge of formulating effective measures to mitigate economic downturns and foster recovery.

During times of crisis, policymakers rely heavily on accurate unemployment rate forecasts to gauge the severity of economic downturns, allocate resources for relief measures, and design targeted interventions to promote economic growth and reduce unemployment. Likewise, businesses depend on precise forecasts to inform critical decisions related to human resource planning, talent acquisition, and operational strategies. Understanding anticipated shifts in labor supply and demand enables companies to align their workforce strategies, optimize resource allocation, and enhance their competitive positioning [3]. Furthermore, individuals stand to gain significant benefits from unemployment rate forecasts as they make career decisions, evaluate job security, and plan for potential employment fluctuations. The ability to anticipate labor market conditions empowers job seekers to make informed choices concerning job search strategies, geographic mobility, and skill development efforts [4]. Access to accurate forecasts equips individuals to navigate labor market complexities, increasing the likelihood of securing stable employment and achieving long-term career success.

This research paper focuses on forecasting the unemployment rate in the United States, utilizing the widely recognized Autoregressive Integrated Moving Average (ARIMA) model. Insights are drawn from extensive historical data spanning from January 1, 2000, to May 1, 2023. By incorporating the historical context of significant events, such as the global financial crisis and the COVID-19 pandemic, this paper aims to provide policymakers, businesses, and individuals with reliable forecasts that inform decision-making, strategic planning, and labor market navigation.

2. Methodology

2.1. Data Collection

The research uses the American unemployment rate as the target in this paper to forecast. The unemployment rate is a significant economic statistic that indicates the percentage of people in the labor force who are jobless and actively looking for work. The unemployment rate is an essential indicator of a country's economic strength and residents' well-being since it gives insight into the health of an economy, the availability of job opportunities, and the general situation of the labor market [5]. High unemployment rates result in economic contraction, income inequality, reduced consumer confidence, increased social costs, and higher government expenditures. Conversely, low unemployment rates are associated with a stronger economy, increased consumer confidence, higher wages, reduced income inequality, and decreased government expenditures.

2.2. Data Selection

The data selection for this research encompasses the United States' unemployment rate data for the period 2000 to 2023, sourced from the Federal Reserve Economic Data (FRED) [6]. This period is chosen due to its encapsulation of diverse economic phases including a time of relative stability in 2000 post the late 1990s recession, the 2007-2008 financial crisis marked by a significant surge in unemployment rates, and the ensuing recovery period. Moreover, it covers the impact of COVID-19 pandemic. The selected timeframe thus spans a wide spectrum of unemployment rate variations, allowing for a robust and nuanced exploration of the temporal patterns and influencing factors of U.S. unemployment trends.

2.3. Methodological and Statistical Approach

This research undertakes a comprehensive methodological and statistical approach to analyze and forecast the U.S. unemployment rate, incorporating both foundational and advanced statistical models.

2.3.1. Preliminary Data Assessment

The research initiates with an examination of the time series data's white noise and stationarity characteristics. Utilizing the Augmented Dickey-Fuller (ADF) test, potential non-stationarity in the time series data, indicated by the presence of unit roots, is evaluated. In addition, the Ljung-Box test is employed to determine the presence of white noise characteristics within the time series. These crucial preliminary tests aid in shaping the subsequent analytical approach by identifying the inherent properties of the dataset.

2.3.2. Data Transformation

Given the initial non-stationary nature of the dataset, the research applies a series of transformations, specifically differencing and logarithmic transformations, to convert the data into a stationary sequence. Achieving stationarity is a critical prerequisite for successful application of time series modeling techniques.

2.3.3. Moving Average (MA) Model and Autoregressive (AR) Model

The Moving Average (MA) and Autoregressive (AR) models serve as the building blocks of this analysis. The MA model provides a smoothing technique that highlights longer-term trends or cycles in the time series by averaging out short-term fluctuations. The AR model incorporates past data points within the same time series as predictors, enabling the model to capture forms of temporal dependency.

2.3.4. Autoregressive Integrated Moving Average (ARIMA) Model

The ARIMA model, which integrates the principles of the AR and MA models and includes a differencing step, is applied to handle the time series data. This model effectively accommodates data with a trend, making it particularly useful when autocorrelation and trend are simultaneously present in the series.

2.3.5. Box-Jenkins Methodology

Underpinning the entire analytical process is the Box-Jenkins methodology. This method provides a structured framework for identifying, fitting, checking, and using integrated ARIMA models. The procedure encompasses model identification using the Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF), model fitting, and model validation through diagnostic tests.

2.3.6. Seasonality Examination and Forecasting

The research further incorporates an examination of seasonality patterns inherent in the unemployment rate data, enhancing the forecasting's robustness. Upon validating the chosen model, the forecast is generated and subsequently assessed using a Box test on the prediction residuals. This step ensures the forecasts' accuracy and the model's effectiveness in minimizing systematic patterns or biases. The amalgamation of these various statistical models and methodological approaches enables a rigorous and robust analysis of the U.S. unemployment rate, contributing significantly to the reliability and relevance of the research outcomes.

3. Results

Upon the initial assessment of the time series, the Augmented Dickey-Fuller (ADF) test, as presented in Table 1, indicated that the analyzed data was non-stationary due to a p-value of 0.4611, significantly above the established significance level of 0.05. This result necessitated a transformation of the dataset to a stationary form before proceeding with further analysis.

Table 1: ADF test of the initial data.

Alternative hypothesis

Stationary

Lag order

6

p-value

0.4611

The first transformation process involved differencing the data, leading to an ADF test that yielded a p-value of 0.01, as shown in Table 2. This result suggested stationarity within the differenced dataset.

Table 2: ADF test of the differencing data.

Alternative hypothesis

Stationary

Lag order

6

p-value

0.01

However, a subsequent Ljung-Box test, as shown in Table 3, produced a p-value of 0.5118, signifying the presence of white noise in the series, and hence, indicating that additional transformation was required.

Table 3: Box test of the differencing data.

Box Test

X- squared

0.43031

p-value

0.5118

As a response, a logarithmic transformation was implemented on the original dataset, followed by another differencing process, as shown in Table 4. The transformed dataset then underwent an ADF test, which resulted in a p-value of 0.01. This result suggests that the dataset is now stationary.

Table 4: ADF test of the logged and differencing data.

Alternative hypothesis

Stationary

Lag order

6

p-value

0.01

Subsequently, a Box test was conducted, yielding a p-value of 0.085, as shown in Table 5. This result indicates the existence of a non-white noise process, suggesting that the logarithmic transformation and differencing effectively removed white noise characteristics and rendered the series stationary.

Table 5: Box test of the logged and differencing data.

Box Test

X- squared

2.9601

p-value

0.08534

Following the successful transformations and establishment of stationarity, the data series was then modeled using the Auto ARIMA algorithm. (Table 6) This process identified the ARIMA (0,0,2) (0,0,2) [12] model as the most appropriate fit for the dataset, thereby capturing the inherent temporal dependencies and seasonal patterns. The first "0" denotes the order of the 'Autoregressive' (AR) part (p). It indicates how many past values from the time series are used. In this case, no past values are being used. The second "0" is for the 'Integrated' (I) part (d), representing the order of differencing required to make the time series stationary. In this context, the series didn't require any differencing. The "2" is for the 'Moving Average' (MA) part (q). It indicates the number of lagged forecast errors that are included in the model. In this context, it means that two lagged forecast errors are used. The seasonal component (P, D, Q) [s] of the model, represented as (0,0,2) [12], shows that the data has a seasonal component repeating every 12 periods (likely indicating monthly data with yearly seasonality). The (0,0,2) parameters suggest that no autoregressive or differencing terms are used for the seasonal component, but two past seasonal forecast errors are incorporated in the model [7].

Table 6: ARIMA model.

ARIMA (0,0,2) (0,0,2) [12]

Coefficients

ma1

ma2

sma1

sma2

0.1014

-0.1711

0.1848

0.1144

s.e.

0.0608

0.0697

0.0598

0.0544

AIC

-516.57

AICc

-516.35

BIC

-498.4

Seasonality patterns within the model were then explored. During the month of January, the unemployment rate tends to be higher. One reason for this could be the conclusion of the holiday season, including Christmas and New Year's, during which time businesses often slow down operations and reduce hiring. Additionally, colder weather during this period might discourage job-seeking activities, further contributing to the elevated unemployment rates (in Figure 1 and Figure 2).

/word/media/image1.png

Figure 1: Seasonal pattern.

/word/media/image2.png

Figure 2: Seasonality.

From February to April, an increase in job-seeking activities is observed, aligning with the observed decrease in the unemployment rate during this timeframe. This could be attributed to businesses resuming normal operations after the holiday season, leading to increased job openings. Furthermore, improving weather conditions during this period might encourage individuals to actively seek employment.

However, the months of June and July exhibit relatively higher unemployment rates. This could be due to various factors, such as seasonal industries, like tourism, undergoing temporary downturns. Additionally, some job seekers might temporarily stop seeking employment due to holiday breaks or warmer weather conditions.

/word/media/image3.png

Figure 3: Decomposition.

The significant decrease in the unemployment rate observed in September 2019, reaching 3.5% (the lowest rate since December 1969), might be due to a combination of factors such as favorable economic conditions, increased job opportunities in certain sectors, and potential policy changes [8]. However, without specific data related to these factors, it is difficult to conclusively attribute this decrease to any single factor (Figure 3).

/word/media/image4.png

Figure 4: Graph with forecasting.

A forecast generated based on these insights was subjected to a Box test, the result of which is depicted in Figure 4. Furthermore, these forecasts were subjected to a Ljung-Box test to evaluate the residuals for white noise characteristics, which means the model is well-done (Table 7).

Table 7: Box test of the forecasting residuals.

Box Test

X- squared

0.019014

p-value

0.8903

Considering the extreme variables of COVID-19 pandemic, the research built a new model by using the same method (Table 8). Following the strategic removal of data collected during and after the pandemic outbreak, predictions were generated using the refined dataset. Notably, these new predictions were remarkably consistent with the forecasts derived solely from the pre-2020 data, despite the significant disruptions caused by the 2020 events (Figure 5).

Table 8: New ARIMA model.

ARIMA (5,0,1) (1,1,2) [12]

Coefficients

ar1

ar2

ar3

ar4

ar5

ma1

sar1

sma1

sma2

0.3508

0.2048

-0.1356

-0.0014

0.3785

-0.4140

-0.8278

-0.0121

-0.6126

s.e.

0.1254

0.0661

0.0703

0.0684

0.0723

0.1291

0.2080

0.2296

0.2051

AIC

-924.97

AICc

-923.95

BIC

-890.72

/word/media/image5.png

Figure 5: Comparing of the forecasts.

This consistency suggests that the original model effectively captures underlying patterns and dynamics of the time series, even in the face of unprecedented disturbances. The fact that the original model's predictions did not greatly deviate during the 2020 disruption underscores its capacity to adapt to change and adjust forecasts accordingly.

Such resilience bolsters confidence in the reliability and accuracy of the model's predictions, enhancing their usefulness for informed decision-making. The exclusion of post-outbreak data and the achievement of comparable predictions helps to offset potential biases and anomalies brought about by the pandemic's extraordinary circumstances. This approach allows for more reliable and consistent forecasts, as indicated by the model's proven performance.

4. Discussion

The findings from the ARIMA model used to forecast unemployment rates demonstrate the considerable potential of time series analysis in labor market predictions. This research highlights the importance of initial assessments, data transformation, modeling, and validation procedures to achieve accurate forecasts.

An interesting observation from the research was the effect of seasonal fluctuations on unemployment rates, with increases observed in January and during the summer months. This finding indicates that any forecasts of unemployment rates need to account for such regular annual patterns. Despite the widespread disruptions caused by the COVID-19 pandemic, the model was remarkably consistent with the forecasts derived solely from pre-2020 data, pointing to its robustness and resilience. This underscores the potential of ARIMA models to adapt to change and adjust forecasts accordingly, which is crucial in an increasingly dynamic and uncertain global economy [9].

The model's ability to provide accurate forecasts despite the unprecedented disruptions caused by the pandemic could be attributed to the way the ARIMA model incorporates past values and errors to predict future trends. By doing so, the model can adjust to new data and provide accurate forecasts, even in the face of sudden changes in trends.

However, it is important to remember that while the ARIMA model can effectively capture past trends and dynamics in the data, it does not explicitly consider other potential macroeconomic influences. Future research could consider expanding the scope of the ARIMA model to incorporate external variables such as GDP, inflation rates, or labor market policies, which may provide even more accurate and robust forecasts of unemployment rates [10].

5. Conclusion

The endeavor to predict the unemployment rate accurately is a significant and complex task, driven by the compelling need to navigate the labyrinthine nature of labor markets, particularly in the face of major economic events. This research arose from the motivation to provide robust forecasts of unemployment rates to enable more informed decision-making for a wide spectrum of stakeholders, including policymakers, businesses, and individuals. Given the substantial impacts of fluctuations in unemployment rates, precise predictions are essential for strategic planning, policy formulation, human resource management, and career decision-making.

The research methodology adopted for this research involved the use of the Autoregressive Integrated Moving Average (ARIMA) model, a widely recognized statistical tool for time series analysis. Utilizing this approach, along with the extensive dataset from the Federal Reserve Economic Data (FRED) spanning 2000 to 2023, this paper generated insightful forecasts by capturing the inherent temporal dependencies and seasonal patterns in the unemployment rates. Furthermore, the research findings underscore the resilience of the ARIMA model, as evidenced by its ability to generate accurate forecasts despite significant disruptions such as the global financial crisis and the COVID-19 pandemic. This capacity to adapt to rapidly changing economic conditions reinforces the ARIMA model's suitability for employment forecasting. This research bears significant importance for various stakeholders. Policymakers can leverage these insights to formulate more effective measures to tackle economic downturns and foster recovery. Businesses can utilize these predictions to inform human resource planning, talent acquisition, and operational strategies. Individuals, on the other hand, can make more informed career decisions and plan for potential employment fluctuations.

However, the research is not without limitations. Although the ARIMA model can effectively capture the past trends and dynamics in the data, it does not explicitly consider other potential macroeconomic influences. Future research, therefore, could expand the scope of the ARIMA model to incorporate external variables such as GDP, inflation rates, or labor market policies. By integrating these factors, the forecasting model could provide even more accurate and robust predictions of unemployment rates, further enhancing its utility and relevance in labor market analysis.


References

[1]. Davidescu, A. A., Apostu, S.-A., & Paul, A. (2021). Comparative analysis of different univariate forecasting methods in modelling and predicting the Romanian unemployment rate for the period 2021–2022. Entropy, 23(3), 325.

[2]. Job retention schemes during the COVID-19 lockdown and beyond. (2020). OECD Policy Responses to Coronavirus (COVID-19).

[3]. Siami‐Namini, S., Lyford, C., & Trindade, A. A. (2020). The effects of monetary policy shocks on income inequality across U.S. states. Economic Papers: A Journal of Applied Economics and Policy, 39(3), 204–221.

[4]. The importance of workforce and Labor Market Information. URL: https://www.dol.gov/sites/dolgov/files/ETA/wioa/pdfs/Informational_Handout.pdf, last accessed 2023/07/25.

[5]. Pratap, P., Dickson, A., Love, M., Zanoni, J., Donato, C., Flynn, M. A., & Schulte, P. A. (2021). Public health impacts of underemployment and unemployment in the United States: Exploring Perceptions, gaps and opportunities. International Journal of Environmental Research and Public Health, 18(19), 10021.

[6]. Federal Reserve Bank of St. Louis. (n.d.). Welcome to Fred, your trusted source for economic data since 1991. FRED. URL: https://fred.stlouisfed.org/, last accessed 2023/07/25.

[7]. Lawal, O. A., & Teh, J. (2023). Assessment of Dynamic Line Rating Forecasting methods. Electric Power Systems Research, 214, 108807.

[8]. U.S. Bureau of Labor Statistics. https://www.bls.gov/opub/mlr/2020/article/job-market-remains-tight-in-2019-as-the-unemployment-rate-falls-to-its-lowest-level-since-1969.htm, last accessed 2023/07/25.

[9]. Ahmad, M., Khan, Y. A., Jiang, C., Kazmi, S. J., & Abbas, S. Z. (2021). The impact of <scp>covid</scp> ‐19 on unemployment rate: An intelligent based unemployment rate prediction in selected countries of Europe. International Journal of Finance &amp; Economics, 28(1), 528–543.

[10]. Nguyen, P.-H., Tsai, J.-F., Kayral, I. E., & Lin, M.-H. (2021). Unemployment rates forecasting with grey-based models in the post-covid-19 period: A case study from Vietnam. Sustainability, 13(14), 7879.


Cite this article

Zhang,D. (2023). Forecasting USA Unemployment Rate Base on ARIMA Model. Advances in Economics, Management and Political Sciences,49,67-76.

Data availability

The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.

Disclaimer/Publisher's Note

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

About volume

Volume title: Proceedings of the 2nd International Conference on Financial Technology and Business Analysis

ISBN:978-1-83558-145-2(Print) / 978-1-83558-146-9(Online)
Editor:Javier Cifuentes-Faura
Conference website: https://www.icftba.org/
Conference date: 8 November 2023
Series: Advances in Economics, Management and Political Sciences
Volume number: Vol.49
ISSN:2754-1169(Print) / 2754-1177(Online)

© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open access policy for details).

References

[1]. Davidescu, A. A., Apostu, S.-A., & Paul, A. (2021). Comparative analysis of different univariate forecasting methods in modelling and predicting the Romanian unemployment rate for the period 2021–2022. Entropy, 23(3), 325.

[2]. Job retention schemes during the COVID-19 lockdown and beyond. (2020). OECD Policy Responses to Coronavirus (COVID-19).

[3]. Siami‐Namini, S., Lyford, C., & Trindade, A. A. (2020). The effects of monetary policy shocks on income inequality across U.S. states. Economic Papers: A Journal of Applied Economics and Policy, 39(3), 204–221.

[4]. The importance of workforce and Labor Market Information. URL: https://www.dol.gov/sites/dolgov/files/ETA/wioa/pdfs/Informational_Handout.pdf, last accessed 2023/07/25.

[5]. Pratap, P., Dickson, A., Love, M., Zanoni, J., Donato, C., Flynn, M. A., & Schulte, P. A. (2021). Public health impacts of underemployment and unemployment in the United States: Exploring Perceptions, gaps and opportunities. International Journal of Environmental Research and Public Health, 18(19), 10021.

[6]. Federal Reserve Bank of St. Louis. (n.d.). Welcome to Fred, your trusted source for economic data since 1991. FRED. URL: https://fred.stlouisfed.org/, last accessed 2023/07/25.

[7]. Lawal, O. A., & Teh, J. (2023). Assessment of Dynamic Line Rating Forecasting methods. Electric Power Systems Research, 214, 108807.

[8]. U.S. Bureau of Labor Statistics. https://www.bls.gov/opub/mlr/2020/article/job-market-remains-tight-in-2019-as-the-unemployment-rate-falls-to-its-lowest-level-since-1969.htm, last accessed 2023/07/25.

[9]. Ahmad, M., Khan, Y. A., Jiang, C., Kazmi, S. J., & Abbas, S. Z. (2021). The impact of <scp>covid</scp> ‐19 on unemployment rate: An intelligent based unemployment rate prediction in selected countries of Europe. International Journal of Finance &amp; Economics, 28(1), 528–543.

[10]. Nguyen, P.-H., Tsai, J.-F., Kayral, I. E., & Lin, M.-H. (2021). Unemployment rates forecasting with grey-based models in the post-covid-19 period: A case study from Vietnam. Sustainability, 13(14), 7879.