Dr.
ArushiMehta1✉EmailArushim671@gmail.com1A
A
Jamia Millia IslamiaIndia Arushi Mehta
Jamia Millia Islamia, India
Arushim671@gmail.com
1. Author Contributions:
Dr. Arushi Mehta conceptualized the study, developed the deep learning framework, conducted the empirical analysis, and wrote the manuscript. All work, including data processing, model development, hyperparameter tuning, and interpretation of results, was performed by the sole author.
A
5. Data Availability Statement:
The currency data used in this study were sourced from publicly accessible databases, including central bank and financial data repositories. Further details can be provided upon request.
6. Code Availability:
The Python scripts and deep learning model configurations used for forecasting are available from the author upon reasonable request.
Declarations
of Originality:
The author affirms that the work is original, has not been published previously, and is not currently under consideration for publication elsewhere. All sources used have been appropriately acknowledged.
Abstract
This study evaluates the predictive performance of Wavelet-LSTM models in forecasting exchange rates for BRICS currencies (INR, CNY, RUB, BLR, and ZAR). The data period studied is from March 2013 to February 2024. The analysis reveals significant variability in model performance, with the Chinese Yuan (CNY) exhibiting robust predictive accuracy, as indicated by low errors and high R² values on both training and test datasets. Conversely, the Indian Rupee (INR) and South African Rand (ZAR) show poor test set performance, highlighting overfitting and limited generalization capabilities. Moderate performance for the Russian Ruble (RUB) and Brazilian Real (BLR) suggests potential for improvement through fine-tuning. These findings underscore the need for strategies such as regularization, cross-validation, and enhanced data preprocessing to address overfitting and improve generalization. A tailored, currency-specific modelling approach is recommended to account for diverse exchange rate dynamics. Future research should explore advanced techniques, including ensemble learning, to enhance model robustness and applicability across varying economic contexts.
Keywords –
Neural Networks
Forex
Forecasting
BRICS
Wavelet
JEL Classification -
C45
F31
C53
O57
Introduction
The prediction of exchange rates in BRICS countries (Brazil, Russia, India, China, and South Africa) holds significant importance in the current global geopolitical environment, given their critical roles in global trade, investment flows, and economic alliances. The geopolitical risk is key determinant of variations in financial markets and investment related decisions (Pastor and Veronesi, 2013). BRICS countries are recipient of global investment from the major countries like Japan, USA, and Germany (Sui and Sun, 2016), are affected by the investment flows from abroad, and exchange rate regime followed. Exchange rate fluctuations directly impact trade balances, affecting export competitiveness and import costs. In a world where BRICS countries are deepening their trade ties and economic cooperation, predicting exchange rates allows businesses to manage risks and optimize profits. BRICS group consists 42 percent of world population, 23 per cent of GDP, 30 per cent of territory, and 18 per cent of trade (Salisu et al., 2022a). Additionally, the geopolitical tensions, such as Russia’s involvement in the Ukraine war and rising U.S.-China rivalries, have added uncertainty to exchange rate movements. Accurate predictions help governments and businesses mitigate these risks, maintaining stability in trade and pricing.
Amid the ongoing internationalisation of BRICS currencies, de-dollarization and discussions around a potential common BRICS currency, accurate forecasting can strengthen the feasibility of its introduction (Coquidé et al. 2023; Kondratov, 2021; Das and Roy, 2023; Ahmed et al., 2025; Atif et al., 2022; Ahmed, 2025c). Moreover, exchange rates influence inflation by affecting the costs of imported goods, which is particularly sensitive for BRICS nations like Brazil and South Africa. Predicting exchange rate trends enables central banks to craft monetary policies that control inflation and stabilize economies. BRICS members are also exploring currency cooperation, including discussions on a common currency, making exchange rate forecasts vital for assessing the feasibility and impact of such initiatives. Investors rely on these predictions to assess the risks and returns of foreign direct investment (FDI) and portfolio investments. Currency volatility can deter investment or cause capital flight, making stable forecasts essential for attracting and retaining capital inflows. For resource-dependent BRICS countries, such as Russia, Brazil, and South Africa, exchange rate forecasts are also critical for predicting commodity prices and revenues, especially in response to global demand shifts. Governments and central banks rely heavily on exchange rate predictions for policy formulation and economic planning, influencing interest rates, trade tariffs, and fiscal strategies. As BRICS nations seek to position themselves in a more multipolar world, the ability to accurately predict exchange rates has become both an economic and geopolitical necessity (Salisu et al., 2021b).
Traditional econometric methods, such as ARIMA, VAR, and GARCH, have been widely used to forecast exchange rates. However, these methods face several challenges such as linear assumption, as the movement of forex rates is complex, and nonlinear due to factors such as interest rates, inflation, geopolitical events, and market sentiment. The traditional model flexibility with big data is limited, and often fail to account for external shocks (Ramos-Pérez et al. 2019; Liu, 2019; Pradeepkumar and Ravi, 2017). The assumption of stationarity, that statistical properties are constant over the time, is also limitation of traditional models, along with the model specification such as lag length, autocorrelation etc.
In recent, the literature around time series forecasting using Artificial Intelligence (AI) has grown significantly. Machine learning ML, and DL models, such as ANNs, random forests, and SVMs, offer several advantages over traditional econometric methods in predicting exchange rates. The ML models excel at capturing complex and nonlinear linkages and can detect hidden patterns in data (Srivastava et al. 2023; Rubio et al 2023; Kamalov et al. 2022). Further, the machine learning models can process the large datasets, incorporating multiple factors. The ML and DL based models can also adapt to non-stationary data. The RNNs or LSTM technique are efficient in handling the time series with changing patterns. The AI based models can be trained on high-frequency data, enhancing the ability to handle the external shocks. Further, the Automatic feature selection, robustness and flexibility, and overall improved forecast accuracy. Due to this, the ML techniques are increasingly becoming a preferred approach for exchange rate forecasting in the evolving global financial environment (Zhang et al. 2024).
A promising approach to address this complexity is the utilization of Wavelet-LSTM networks. This hybrid model integrates the advantages of wavelet analysis, which facilitates the decomposition of time series data into distinct frequency components, with the robust sequential learning capabilities of LSTM neural networks (Tamilselvi et al. 2024; Chen et al. 2019). By employing this methodology, researchers can effectively capture both long-term trends and short-term fluctuations in BRICS forex rates, potentially enhancing forecast accuracy. The wavelet decomposition process contributes to the management of non-stationary data and the reduction of noise, while the LSTM network demonstrates proficiency in learning intricate temporal dependencies. This approach may prove particularly valuable for policymakers, investors, and financial institutions operating within BRICS countries, as it has the potential to yield more reliable predictions of forex rate movements in these volatile markets. Forecasting foreign exchange (forex) rates in BRICS economies presents a significant challenge due to the dynamic nature of these emerging markets. The complexity of these economies, characterized by rapid growth, evolving financial systems, and susceptibility to global economic shifts, necessitates sophisticated forecasting techniques. A promising approach to address this complexity is the utilization of Wavelet LSTM networks. This hybrid model integrates the advantages of wavelet analysis, which facilitates the decomposition of time series data into distinct frequency components, with the robust sequential learning capabilities of LSTM neural networks.
This methodology captures both long-term trends and short-term fluctuations in BRICS forex rates can be captured, potentially enhancing forecast accuracy. The wavelet decomposition process contributes to the management of non-stationary data and the reduction of noise, which are common challenges in forex rate analysis. This step allows for a more nuanced examination of the underlying patterns in the time series data, separating high-frequency fluctuations from low-frequency trends. Such decomposition can reveal hidden structures within the data that might be obscured in traditional time series analysis methods. The LSTM network, a specialized form of recurrent neural network, demonstrates proficiency in learning intricate temporal dependencies. This capability is particularly valuable in forex rate prediction, where complex interactions between various economic factors, geopolitical events, and market sentiments influence exchange rates over different time horizons. LSTM networks can effectively capture and remember relevant information over extended periods, making them well-suited for identifying long-term dependencies in forex rate movements. The combination of wavelet analysis and LSTM networks in this hybrid approach offers several advantages. Firstly, it allows for the simultaneous consideration of multiple time scales, from short-term fluctuations to long-term trends, providing a more comprehensive view of forex rate dynamics. Secondly, the noise reduction achieved through wavelet decomposition can lead to more stable and reliable input features for the LSTM network, potentially improving the overall predictive performance.
This approach is valuable particularly, for policymakers, investors, and financial institutions operating within BRICS countries, as it has the potential to yield more reliable predictions of forex rate movements in these volatile markets. Policymakers can utilize these forecasts to inform monetary policy decisions and manage currency stability. Investors and traders can leverage the predictions to optimize their portfolio strategies and manage risk exposure in BRICS currencies. Financial institutions can enhance their risk management practices and improve their foreign exchange operations based on more accurate forecasts. Moreover, the application of Wavelet LSTM networks to BRICS forex rate forecasting opens up avenues for further research and development in the field of financial time series analysis. Researchers can explore variations of this hybrid model, such as incorporating additional economic indicators or experimenting with different wavelet transformation techniques to further refine the forecasting accuracy. The success of this approach in BRICS economies could also pave the way for its application in other emerging markets or even developed economies, contributing to the broader field of economic forecasting and financial modelling.
Literature Review
The use of machine learning and deep learning has grown significantly in the prediction of the time series variables(Ahmed, 2025a, 2025b, 2025d, 2025e). The use of ML and DL has advantages of accuracy and capturing the non-linearity and complexity of the time series variables, irrespective of the stationarity and normality, the necessary assumptions of the many traditional methods of forecasting. The review study by, Junior et al. (2023) provides, comprehensive background of the previous research in the forex market forecasting, whereas the Liu and Wang (2024), reviewed ML and DL models used in the time series prediction.
The selection of model and evaluation is crucial in accurate prediction. To probe this question, Junior et al. (2023) reviewed the literature of forex forecasting methodologies. In reviewing of 60 studies from 2010 to 2021, authors found use of LSTM and ANN models to be most commonly used algorithm of machine learning. Further, authors found that MSE, RMSE, MAE, and MAPE were the most commonly used evaluation metrics. Authors concluded that there is significant scope of research in the exchange rate prediction. As per Fletcher (2012), the SVM, Relevance Vector Machine (RVM) and Neural Network shows improved results when exogenous variables, affecting the basket of forex returns are included. Panda et al. (2020) reviewed the advance exchange rate prediction of studies from 2000 to 2019. The author found the use of different models such as ANN, FLANN, HMM, SVM, and AR models.
Islam et al. (2020) examined the studies which have studied the advancements in forex prediction. Based on the analysis of 39 studies from select publishers, author found the use of neural networks, GRU and LSTM to be most prevalent in the forecasting of exchange rates. In their review, Sezer et al. (2020) indicated increased use of DL methods in time series forecasting. Berradi et al. (2020) favoured the deep learning techniques over the traditional ones in the forecasting of time series. The study further found that RNNs are mostly used model in the predictions. Ryll and Siedens (2019) evaluated the machine learning models from 150 research papers, and found them to be outperforming the traditional models. Further, among the findings, the RNN models perform better than the feed forward neural network as well as SVM. The findings hinted at the temporal effect in prediction across the assets and location, and can be utilised in forecasting of time series. Henriques et al. (2019) recognised the difficulty in forecasting the financial time series due to chaotic, non-linear and dynamic nature of them. Further in the findings of bibliometric analysis, SVM and neural networks are the dominantly used ML models, and data is mostly used from north American markets. Authors suggested the relevancy of the research field, and required focus on the data of emerging markets.
Abedin et al. (2021) proposed a model by combining the bagging ridge regression and Bi-LSTM model to predict the forex rate of 21 currencies before the Covid and after the Covid. Further, the authors compared the models with the traditional models such as SVM, Decision Tree, random forest as well as DL methods of LSTM and Bi-LSTM. The proposed model was found to be more accurate, however, the accuracy varied before and during the pandemic. Dautel et al. (2020) compared the LSTM and GRU with the traditional recurrent network architecture and feedforward network in directional forecasting. The authors found the DL methods to be suitable in prediction, but also pointed out the difficulty in tuning and implementing the architectures. Rabbi et al. (2022) forecasted the currencies of 22 countries by using the various ML and DL techniques. Out of the SVR, and RVR, and LSTM. The LSTM is found to be the more accurate as compared to the others. Datta et al. (2021) proposed different ML and DL models in prediction of 22 currencies. The study employed the lasso regression, decision tree, ridge regression, and the Bi-LSTM in prediction, and found the Bi-LSTM to be better performing as compared to the other models. The MAE, MSE, RMSE, and MAPE were used to evaluate the model accuracy.
Ince and Trafalis (2006) proposed forecasting model combining the parametric and non-parametric test such as ARIMA and VAR, with the SVR and ANN. The study found the SVR technique to be accurate in performance. Wang et al. (2022) proposed the CNN-TLSTM model in prediction of Canadian dollar. The CNN model selects the features whereas the TLSTM does the prediction of time series. In comparison of MLP, CNN, RNN, LSTM and CNN-LSTM, the CNN-LSTM model is found to be better predictor of the Chinese Yuan forex rate. Wang and Chen (2021) combine the two-stage feature extraction model and Adaboost based reinforcement ensemble learning framework. The resultant deep RNN, is found to be a better in accuracy. Islam and Hossain (2021) combined the LSTM and GRU to predict the four commonly used currencies, EUR, GBP, CAD, and CHF. The author also compared the model with the standalone LSTM and GRU. on the basis of MSE, RMSE, MAE, and R-Squared, the study found the combined LSTM-GRU model to be best performing.
On the basis of the analysis of above, it can be noticed that literature has been in the favour of deep learning in forecasting of the time series data. Further, it can be observed that, the use of combination of traditional and deep learning models is lacking in time series forecasting. The wavelet LSTM model is widely used in forecasting of various type of variables, such as wind power, fault in power grids, solar irradiance, electricity price prediction, stock market prediction, fault detection in rotating machinery, air quality, electricity consumption (Liu et al., 2019; Branco et al., 2022; Wang et al., 2018; Chang et al., 2019; Jalayer et al., 2021; Zeng et al., 2022; Chi, 2022). However, the use wavelet LSTM in forex rate forecasting has not been observed in the literature. To the best of authors knowledge, no study has applied the combined approach of Wavelet-LSTM in forex rate prediction. This study probes the predictive accuracy of this ensemble model this paper. This study aims to investigate the predictive accuracy of a Wavelet Transformed LSTM model in forecasting forex rates for the BRICS countries. By combining the signal processing capabilities of wavelets with the powerful temporal learning abilities of LSTM, this novel approach may provide valuable insights and improved forecasting performance compared to existing methods.
Methods and Materials
Data
The data consists the crude oil prices and forex rate of five BRICS countries. The variables are OIL – Brent crude oil prices, INR-Indian Rupee, CNY- Chinese Yuan, RUBLE- Russian Ruble, and ZARSA denotes South African Rand.
A
Table 1
| | OIL | INR | CNY | RUBLE | BLR | ZARSA |
|---|
Mean | 72.075 | 69.629 | 6.6150 | 62.636 | 3.9085 | 14.152 |
Std. Error | 0.4392 | 0.1372 | 0.0064 | 0.2997 | 0.0207 | 0.0480 |
Median | 68.695 | 68.660 | 6.6061 | 64.092 | 3.8211 | 14.261 |
Mode | 70.710 | 64.860 | 7.1942 | 32.985 | 5.1706 | 14.485 |
Std. Dev | 23.688 | 7.3999 | 0.3452 | 16.164 | 1.1214 | 2.5887 |
Sam. Var | 561.14 | 54.758 | 0.1192 | 261.28 | 1.2575 | 6.7015 |
Kurtosis | -0.7537 | -0.6437 | -1.1339 | 0.4219 | -1.2346 | -0.5920 |
Skewness | 0.2884 | 0.1669 | 0.1397 | -0.2314 | -0.0354 | 0.0014 |
Range | 124.06 | 30.391 | 1.3063 | 113.13 | 3.9445 | 11.347 |
Minimum | 9.1200 | 53.035 | 6.0412 | 29.863 | 1.9447 | 8.4580 |
Maximum | 133.18 | 83.426 | 7.3475 | 143.00 | 5.8892 | 19.805 |
Source – Author’s Work
The descriptive statistics for the five BRICS currencies (INR, CNY, RUBLE, BLR, and ZARSA) and oil prices reveal important insights into their central tendencies, dispersion, and distribution shapes. The mean values show that oil prices averaged $72.08, while the Indian Rupee (INR) averaged 69.63, the Chinese Yuan (CNY) 6.62, the Russian Ruble (RUBLE) 62.64, the Brazilian Real (BLR) 3.91, and the South African Rand (ZARSA) 14.15 per US Dollar. Among these currencies, the CNY exhibited the least fluctuation around its mean with a small standard error of 0.0064, indicating its relative stability, while the RUBLE had the largest standard error of 0.2998, signifying greater variability in its average exchange rate.
Standard deviation (a measure of dispersion) and variance (the square of standard deviation) further highlight the variability of these currencies. The RUBLE shows the highest standard deviation (16.16) and variance (261.29), confirming its status as the most volatile currency in the group. In contrast, the CNY has the smallest standard deviation (0.345) and variance (0.119), underscoring its stability. The INR and ZARSA show moderate variability, with standard deviations of 7.40 and 2.59, respectively. When examining kurtosis, the most currencies, including the INR, CNY, BLR, and ZARSA, have negative kurtosis values, indicating flatter distributions with fewer extreme values. The RUBLE, with a slightly positive kurtosis (0.42), indicates a higher likelihood of extreme values or outliers in its data. Skewness, which measures asymmetry in the data, is generally close to zero for most currencies, with the INR, CNY, and ZARSA showing slightly positive skewness, suggesting a nearly symmetric distribution but with a slight tilt towards higher values. On the other hand, the RUBLE and BLR exhibit slight negative skewness, meaning their distributions have more frequent lower values.
Wavelet – LSTM
Wavelet Long Short-Term Memory (Wavelet LSTM) is a hybrid model that combines wavelet transform techniques with Long Short-Term Memory (LSTM) neural networks to enhance time-series forecasting. The wavelet transform decomposes the original time-series data into different frequency components, allowing the model to focus on both short-term fluctuations and long-term trends separately. By capturing both high and low-frequency information, Wavelet LSTM improves the model's ability to predict complex time-series patterns more effectively than traditional LSTM models, which may struggle with multi-scale data.
The process typically involves applying the discrete wavelet transform (DWT) to decompose the input time-series
into approximation (A) and detail (D) coefficients at different levels of decomposition. These components are then fed into separate LSTM models, which are specialized in learning temporal dependencies from each frequency band. The outputs of the LSTM models are then aggregated, often by reconstructing the signal through an inverse wavelet transform (IWT), to provide the final forecast.
Mathematically, the wavelet decomposition of the time-series
can be expressed as:
Where
represents the low-frequency components (approximation), and
represents the high-frequency components (details). These components are then processed individually through LSTM layers to predict future values:
Finally, the predicted components are combined to form the overall forecast. The Wavelet LSTM approach is particularly useful in scenarios with complex, noisy, or multi-scale data, such as financial markets, where different time frames may show distinct patterns.
Evaluating models goes beyond simply checking their accuracy; it's about assessing their reliability in making predictions. The accuracy and reliability are crucial in making forecast. There are various metrics which evaluate the models, including MSE, RMSE, and R-squared. Various studies have used the MSE, RMSE, and R-Square for the evaluation of forecasting accuracy of the machine learning models (Alqahtani & Abdelhafez, 2023; Aziz et al., 2022; Chen, 2023; Kandil et al., 2023; Muganda & Kasamani, 2023; Nanthiya et al., 2023; Prakash & Singh, 2022; Wang et al., 2022; Ahmed and Kaur, 2025).
In machine learning, MSE and RMSE gauge how accurate a model's predictions are by capturing the average difference among forecasted and observed values. MSE acts like a trainer during the learning process, guiding the model to minimize this error. It excels at penalizing the large mistakes because it penalizes huge differences more heavily. However, MSE is measured in squared units, different from the target variable's unit, making interpretation a bit complex. To address this, RMSE simply takes the root square of MSE, presenting the error in the same units as the target variable for easier understanding. Both metrics are popular for their focus on large errors but are also sensitive to outliers, so keep the target variable's unit in mind when evaluating the error values. The R squared focuses on the part of variance explained in the dependent variable by the model. It essentially quantifies how well the model fits the data, indicating the degree to which it can explain the target variable's variance. R-squared is an easy-to-understand measure of model fit, and varies from zero to one, where one signifies a perfect fit. While it's scale-independent, adding more predictors can artificially inflate R-squared. Importantly, R-squared about the model's predictive power; it simply helps us understand the proportion of variance explained by the factors considered in the model.
Results and Findings
The graph in Fig. 1 illustrates a comparison between the actual and predicted INR exchange rates of test data using a Wavelet-LSTM model. The data was split in ratio of 80:20. The test data predictions are based on the trained data by the used algorithm. The blue line represents the actual exchange rate, showing real-world fluctuations over time, marked by periods of sharp increases and decreases. In contrast, the orange line represents the model's predictions, which, while capturing the general trend, appear much smoother and show fewer sharp fluctuations than the actual data. From graph, it can be observed that is the underestimation of volatility by the Wavelet-LSTM model. Although the model successfully tracks the overall trend, it struggles to predict the more rapid and extreme changes in the exchange rate. The predicted line, while aligned with the overall direction of movement, fails to capture the sharp spikes or dips that are evident in the actual data. This suggests that the model captures long-term trends effectively but may need further refinement to accurately model short-term volatility. Overall, the Wavelet-LSTM model performs well in forecasting general trends but leaves room for improvement in handling rapid and complex fluctuations in exchange rates.
The graph in Fig. 2 indicate the predicted and actual values of Chinese Yuan. The blue line represents the actual CNY exchange rate, showing its real fluctuations over time. In this case, the model appears to perform well, closely following the actual exchange rate movements with a good degree of accuracy. The model tracks trends and volatility, with the predicted line closely mirroring the peaks, troughs, and general direction of the actual data. While there are some minor discrepancies, particularly around the sharper changes in the exchange rate, the predicted values align well with the overall pattern of the actual exchange rate. This suggests that the Wavelet-LSTM model is effectively capturing both short-term fluctuations and long-term trends in the CNY exchange rate. In summary, the Wavelet-LSTM model demonstrates strong predictive performance in this scenario, offering accurate forecasts that closely match the actual exchange rate, indicating that it successfully handles both trends and volatility.
The graph in Fig. 3 compares the actual and predicted RUB (Russian Ruble) exchange rates using a Wavelet-LSTM model. It can be noted, one key observation is that the predicted values track the overall trend of the actual exchange rate relatively well, especially in the earlier phases (up to around the 40th observation). The model captures the upward trend in the exchange rate accurately, though it tends to smooth out some of the sharper fluctuations. In the later phase of the graph, particularly after the 60th observation, the predicted values start to deviate more from the actual values, missing some of the rapid rises and dips. Overall, while the model is effective at predicting the general trend and long-term movements, it struggles with the more volatile behavior of the Ruble in this case. The predicted line is smoother, which indicates that while the model captures overall direction well, it underestimates the magnitude of short-term fluctuations.
The graph in Fig. 4 compares the Actual Brazilian Real Exchange Rate with the Predicted Brazilian Real Exchange Rate using the Wavelet-LSTM method. The horizontal axis represents time, while the vertical axis shows the exchange rate values ranging from 7.6 to 8.8. Throughout the graph, the predicted values closely follow the actual exchange rates, indicating the effectiveness of the Wavelet-LSTM model in forecasting. There are several points where both lines peak and dip simultaneously, suggesting accurate predictions. However, there are also instances where the lines diverge, highlighting areas where the model’s predictions were less accurate. This analysis is crucial for economists and investors who rely on accurate exchange rate forecasts for decision-making.
The graph in Fig. 5, compares the Actual South African Rand Exchange Rate with the Predicted South African Rand Exchange Rate using the Wavelet-LSTM method. The horizontal axis represents time, spanning from 0 to approximately 80 units, while the vertical axis shows the exchange rate values ranging from 25 to 31. Throughout the graph, the predicted values closely follow the actual exchange rates, indicating the effectiveness of the Wavelet-LSTM model in forecasting. The actual exchange rate line has sharper turns and more pronounced peaks, while the predicted rate line appears smoother but follows a similar overall trend. This analysis is crucial for economists and investors who rely on accurate exchange rate forecasts for decision-making.
Source – Author’s Work
Source – Author’s Work
Evaluation Metrics
The Table 2 provides evaluation metrics for predicting exchange rates of five BRICS currencies (INR, CNY, RUBLE, BLR, and ZAR) using Wavelet-LSTM model and evaluates the performance on both test and training datasets. The key metrics include Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R-squared (R²) for both test and train sets, offering insights into the model's accuracy and generalization.
For the Indian Rupee (INR), the test MSE is 2009.1 and RMSE is 44.820, which represent the squared and root mean differences between the predicted and actual values. These relatively high error values indicate significant deviations in the model's predictions for the INR. Moreover, the negative R² (-0.5081) on the test set shows that the model performs worse than a simple mean-based prediction, indicating poor fit. However, in the training set, the model performs better with an MSE of 2909.6, an RMSE of 53.940, and a strong R² of 0.9628, suggesting overfitting as the model fits well on the training data but poorly on unseen test data. For the Chinese Yuan (CNY), the model performs well on both test and train sets. The test MSE is 0.0310 and RMSE is 0.1761, indicating very low errors, and the R² of 0.8432 shows that the model explains a large portion of the variance in the data. The performance improves even further on the training set with an MSE of 0.0141, RMSE of 0.1191, and an R² of 0.9771, indicating excellent predictive power with minimal error and high explanatory strength.
For the Russian Ruble (RUBLE), the model shows relatively large error values on both the test and train sets due to the high MSE (111,517 on the test set and 203,398 on the train set), resulting in high RMSE values (1056.0 for test and 450.99 for train). Despite the large errors, the R² values of 0.8284 (test) and 0.9708 (train) indicate that the model captures a significant portion of the variance in the data, particularly in the training set, which again suggests possible overfitting. For the Brazilian Real (BLR), the model performs decently, with a test MSE of 0.4518 and RMSE of 0.6722, meaning that the errors are moderate. The R² value of 0.7186 on the test set indicates that the model explains around 72% of the variance, while in the training set, with an MSE of 1.1258 and RMSE of 1.0610, the model performs even better, reflected in a high R² of 0.9814. This shows that the model fits the training data well and generalizes reasonably to the test data. For the South African Rand (ZARSA), the model performs poorly on the test set, with an MSE of 259.73 and RMSE of 16.120, and an R² of -0.0968, indicating that the model is not capturing the relationship between the variables effectively. However, on the training set, the performance is much better, with an MSE of 85.700, RMSE of 9.2570, and an R² of 0.9492, which highlights a strong fit on the training data but a significant drop in accuracy when applied to the test set.
In summary, the model tends to perform better on the training data than on the test data across most currencies, suggesting overfitting issues, particularly for INR and ZARSA. The CNY model shows strong predictive performance on both test and training datasets, making it the most reliable among the currencies.
Table 2
| | MSE _Test | RMSE _Test | R2_Test | MSE_ Train | RMSE_ Train | R2_Train |
|---|
INR | 2009.1 | 44.823 | -0.5080 | 2909.6 | 53.941 | 0.9627 |
CNY | 0.0310 | 0.1761 | 0.8432 | 0.0141 | 0.1190 | 0.9771 |
RBL | 111517 | 1056.0 | 0.8284 | 203398 | 450.99 | 0.9707 |
BLR | 0.4518 | 0.6721 | 0.7186 | 1.1257 | 1.0610 | 0.9813 |
ZAR | 259.73 | 16.116 | -0.0968 | 85.698 | 9.2573 | 0.9492 |
Source – Author’s Work
Conclusion and Implications
The analysis of predictive performance for the exchange rates of BRICS currencies (INR, CNY, RUB, BLR, and ZAR) reveals distinct trends in the model's ability to generalize and accurately forecast. The evaluation metrics, including MSE, RMSE, and R², provide valuable insights and highlight key areas for future model improvements and practical applications.
The model exhibits overfitting for several currencies, particularly INR and ZAR. While the training set results show strong performance with high R² values, there is a significant drop in test set performance, indicating limited generalization capabilities. This disparity underscores the need for improved regularization and validation techniques to enhance the model’s ability to generalize effectively. In contrast, the Chinese Yuan (CNY) stands out as a benchmark, showing strong performance on both training and test datasets. The model produces low errors and high R² values, suggesting it effectively captures the dynamics of CNY exchange rates and provides a reliable benchmark for robust predictive performance. The model’s predictive accuracy varies considerably across different currencies. For INR and ZAR, poor performance on the test sets suggests that the model struggles to generalize, pointing to potential issues with data structure, feature selection, or model complexity. RUB and BLR show moderate performance, indicating that fine-tuning is required to improve generalization while maintaining strong training set results. The CNY, however, shows reliable predictions, demonstrating that the model is well-suited to certain exchange rate behaviours, likely influenced by smoother patterns or higher-quality data.
The findings have several implications for future improvements. First, the presence of overfitting suggests the need for strategies like reducing model complexity, employing regularization techniques, or incorporating cross-validation to improve generalization and prevent over-reliance on training data. Additionally, the poor performance for INR and ZAR may be attributed to noisier or less predictable data, suggesting that enhancing data preprocessing, incorporating additional explanatory variables, or using domain-specific insights could improve predictive accuracy. The varied performance across currencies indicates that a one-size-fits-all approach may not be optimal. Developing currency-specific models tailored to the unique characteristics of each exchange rate could yield better results. In practical terms, stakeholders relying on exchange rate forecasts can have confidence in the model's robustness for CNY, whereas further caution and refinement are necessary for other currencies, particularly INR and ZAR. Future research should explore advanced methods, such as ensemble learning or deep learning approaches, to address overfitting and improve model robustness across all currencies. By addressing the identified limitations and leveraging the insights from CNY’s performance, the model’s applicability can be broadened, ultimately enhancing its utility in forecasting exchange rates in diverse economic contexts.