This report synthesizes the findings from four financial prediction projects, each leveraging machine learning (ML), statistical, and neural network models to predict market prices (Close, Open, High, and Low). The assets analyzed include:
- Bitcoin
- EUR/USD Exchange Rate
- Gold
- S&P 500 Index (GSPC)
- Identify the most effective models across various financial domains.
- Highlight common patterns, challenges, and insights for AI applications in financial markets.
- Provide recommendations for practical implementation.
- Statistical Models: ARIMA, SARIMA
- Traditional Machine Learning Models: SVR, Random Forest, Gradient Boosting (CatBoost, XGBoost, LightGBM), k-Nearest Neighbors (kNN), Decision Trees
- Neural Networks: GRU, RNN (LSTM), BiLSTM, Transformer Models, WaveNet
- Hybrid and Other Models: Prophet, Temporal Convolutional Networks (TCN)
- R-squared: Measures goodness of fit (higher is better).
- Mean Squared Error (MSE): Measures prediction accuracy (lower is better).
- Mean Absolute Error (MAE): Evaluates average prediction errors (lower is better).
- Median Absolute Error (MedAE): Captures median deviation of errors (lower is better).
- Bitcoin: SVR and ARIMA excelled consistently with R-squared values above 0.98 across all price types.
- EUR/USD: SVR and ARIMA emerged as top contenders, with high R-squared and low MSE values.
- Gold: ARIMA and SVR demonstrated strong performance, particularly for Open and High price predictions.
- GSPC: SVR and ARIMA ranked highest, offering a balance of accuracy and interpretability.
- Models such as GRU, BiLSTM, and RNN (LSTM) underperformed compared to simpler statistical and traditional ML models in all cases.
- Complex architectures like Transformer Models and WaveNet exhibited significant challenges, including overfitting, negative R-squared values, and large errors.
- Neural networks are highly sensitive to dataset size and preprocessing quality, which often limited their effectiveness in these studies.
- Gradient Boosting methods (e.g., CatBoost, XGBoost) and Random Forest consistently ranked in the top tier but fell short of SVR and ARIMA.
- These models provided a balance of accuracy and computational efficiency, especially in larger datasets like EUR/USD and GSPC.
- Statistical models like ARIMA proved robust for time-series data, outperforming machine learning and neural networks in many scenarios.
- ML models like SVR provided a reliable alternative, combining simplicity with competitive accuracy.
- Transformer Models and WaveNet were the least effective across all datasets, with negative R-squared values and excessive prediction errors.
- Prophet, while useful for trend analysis, lagged in accuracy for point-wise price predictions.
- Dataset-Specific Performance: Simpler models (ARIMA, SVR) consistently outperformed complex ones in structured financial datasets.
- Neural Network Limitations: These models struggled due to inadequate feature representation, overfitting, or insufficient data volume.
- Hybrid Approaches: Combining ARIMA with ML methods could leverage the strengths of both for enhanced prediction capabilities.
- Top Recommendations: SVR and ARIMA are the most reliable models for financial market predictions, excelling across diverse assets and price metrics.
- Secondary Recommendations: Ensemble methods (CatBoost, XGBoost) provide a robust alternative with reasonable accuracy and computational cost.
- Caution with Neural Networks: Deploy neural networks only when large, high-quality datasets and sufficient computational resources are available.
- Hybrid Models: Explore combinations of ARIMA and ML techniques to optimize performance.
- Improved Preprocessing: Focus on feature engineering and normalization for neural network models.
- Expanded Datasets: Larger, richer datasets could unlock the potential of deep learning models.
- Use SVR or ARIMA for quick, accurate results in financial prediction tasks.
- Consider ensemble models like CatBoost or XGBoost for tasks requiring robust predictions with interpretability.
- Avoid deploying neural networks unless dataset size and computational resources justify their complexity.
This report underscores the effectiveness of traditional and ensemble methods in financial prediction tasks while highlighting the challenges of leveraging neural networks. It provides actionable insights for researchers and practitioners aiming to enhance forecasting accuracy across various financial domains.
The code for all the models used in this project is available on my GitHub repository: GitHub - taleblou. You are free to use it for your own projects.
You can access this research through the following links:
-
Predict-Price: This platform focuses on price and trend prediction using advanced AI models. It provides actionable insights for financial forecasting.
-
MagicalPrediction: This site offers daily predictions and signals derived from a variety of AI models, tailored for accurate market analysis.
-
MagicalAnalysis: A resource for daily technical analysis-based signals, designed for traders and analysts seeking data-driven insights.
This project is licensed under the MIT License.
For questions or contributions, please contact my.
Feel free to explore and contribute to this project!