Iulian NASTAC Emilian DOBRESCU Elena PELINESCU NEURO-ADAPTIVE MODEL FOR FINANCIAL FORECASTING

    The paper advances an original artificial intelligence-based mechanism for specific economic predictions. The aim is to forecast the exchange rate of euro versus the Romanian currency using a large set of financial data. The possible influence of specific forecasting indicators (such as Sibiu Futures Stock Exchange market) on the evolution of the exchange rate in Romania is also analyzed. The time series under discussion are inherently non-stationary. This aspect implies that the distribution of the time series changes over time. The recent data points could provide more important information than the far distant data points. Therefore, we propose a new adaptive retraining mechanism to take this characteristic into account. The algorithm establishes how a viable structure of an artificial neural network (ANN) at a previous moment of time could be retrained in an efficient manner, in order to support modifications in a complex input-output function of a financial forecasting system. In this system, all the inputs and outputs vary dynamically, and different time delays might occur. A “remembering process” for the former knowledge achieved in the previous learning phase is used to enhance the accuracy of the predictions.
    The results show that the first training (which includes the searching phase for the optimal architecture) always takes a relatively long time, but then the system can be very easily retrained, since there are no changes in the structure. The advantage of the retraining procedure is that some relevant aspects are preserved (“remembered”) not only from the immediate previous training phase, but also from the previous but one phase, and so on. A kind of “slow forgetting process” also occurs; thus for the ANN it is much easier to remember specific aspects of the previous training instead of the first training.
    The experiments reveal the high importance of the retraining phase as an upgrading/updating process and the effect of ignoring it, as well. There has been a decrease in the test error when successive retraining phases were performed and the neural system accumulated experience.

    Keywords: Neural networks, exchange rate, adaptive retraining, delay vectors, iterative simulation

2. THE MODEL ARCHITECTURE

    The time delay or dead time is frequently encountered in financial systems. It is well known that feedback control in the presence of time delay leads to particular difficulties, since a delay places a limit on the time interval.
    Figure 1 shows our idea of training a feed-forward ANN such that the latter becomes a predictor. We use delayed rows of more than 30 input data (see the final part of this section) to simulate the current states of the EUR/ROL exchange rate. For learning purposes, the network inputs involve many blocks with several time-delayed values of financial system inputs, and fewer blocks with system delayed output. The ANN target-output consists of the current value of the corresponding EUR/ROL exchange rate. Therefore, the system tries to match the current values of the output, by properly adjusting a function of the past values of the inputs and output (Figure 1).
    At the current moment, t, the output (see Figure 1) is affected by the P inputs at different previous time steps
(t - i_d₁, …, t - i_d_n), and also by the outputs at other previous time steps (t - o_d₁, …, t - o_d_m), respectively. We denote by In_Del and Out_Del two delay vectors that include the delays that we take into account:

In_Del=[i _ d₁,i _ d₂ ,...,i _ d_n]          (1)
Out_Del=[0 _ d₁,0 _ d₂ ,...,0 _ d_n]          (2)

     where n > m.

    For In_Del, we use various delay vectors with n = 7, 8 or 9 elements, whose values are within a range of twenty days. Regarding Out_Del, we employ different combinations, with m = 3, 4 or 5 elements, covering about one week. The distribution of the vector elements is preferably (but not compulsory) chosen similarly to the Gamma distribution. The elements of each vector are in ascending order. Consequently, the maximum values of any delay vector are i_d_n or o_d_m, respectively. The recurrent relation performed by the model is as follows:

y(t + 1) = F(X (t + 1 - In_Del(i)), y(t - Out_Del( j)))      (3)
    where X is the input vector; i =1,...,n and j =1,...,m .
    We use feed-forward ANNs with two hidden layers in order to achieve a good approximation function, based on our preliminary research, where we have obtained better results in the case of two hidden layers than in the case of one hidden layer, however maintaining a similar ratio (approx. 5/1) between the number of training samples and the total number of weights. The ANN models, depicted in Figure 1, use training sets of V-i_d_n input-output pairs for model adaptation (see the next section), where V = 2240 is the initial time steps interval employed for the training purpose. Once we have established all the influences on the output at the moment t, we apply Principal Component Analysis (PCA) (Jackson 1991) to reduce the dimensionality of the input space and to un-correlate the inputs. Before applying PCA, we preprocessed the inputs and outputs, by replacing the missing data using the previously available values and, then, by applying the normalization. Data preprocessing prepares the raw data for the forecasting model and turns it into a format that will be easier and more effectively processed. Finally, we have applied the reverse process of normalization, in order to de-normalize the simulated outputs. Data preprocessing and data postprocessing are essential steps of the knowledge discovery process in the real world applications, and they greatly improve the network’s ability to capture valuable information, if they are correctly carried out (Hagan et al., 1996, Basheer et al., 2000).
    Our attempt involves a number of P variables (more than 30). Statistical data have different frequencies, such as:

daily frequency (forex exchange rate, future exchange rate for one month and the BET index);
quarterly frequency (GDP, the share of consolidated budget in GDP);
monthly frequency (CPI, interest rate, exports and imports of goods and services, etc).

In order to use all these data with different frequencies, we decided to transform them into such data of higher frequency, on the basis of a natural formation mechanism of the market operators’ behavior that implied to keep the information unchanged during the period between two updating time steps. For example, if we have only annual data, we keep it unchanged during 365 or 366 days. For the days without transactions, we decided to keep the previous transaction figure, in order to have a complete data series.
In our model we used two different kinds of data:

Statistical data that were classified as:

General data that characterizes the macroeconomic development of Romania (9 indicators);
Specific data that are directly linked to the exchange rate evolution (11 indicators);
External data that refer to significant indicators of external market evolution, focused on the European Union market and the US market (6 indicators).

Forecasting data that were also classified as:

General data that characterize the macroeconomic development of Romania (10 indicators);
Specific data that are directly related to the exchange rate evolution (4 indicators);
There is a possibility to use External data also, but we decided to employ them in a further investigation that is not the subject of this paper.

Additionally, we introduced a Month indicator L (the days of January are denoted by 1, the days of February by 2 and so on).
We also tested the influence of the other three supplementary inputs that represent the “Sibiu Futures exchange rate of one month” for EUR/ROL, USD/ROL and EUR/USD exchange rates.

NEURO-ADAPTIVE MODEL FOR FINANCIAL FORECASTING

Romanian Journal of Economic Forecasting – 3/2007
Institute of Economic Forecasting