Overview
Key Features
- Multi-Factor Prediction: Integrated historical race data, weather conditions, driver performance metrics, and qualifying results
- Real-Time Dashboard: Built an interactive web application for exploring predictions and race analytics
- High Accuracy: Achieved 68.5% prediction accuracy across multiple racing seasons
- Historical Analysis: Comprehensive analysis of racing trends and performance patterns
Data Sources
- Race Results: Historical Formula 1 race results spanning multiple seasons
- Weather Data: Real-time and historical weather conditions for race circuits
- Driver Performance: Individual driver statistics, career performance, and recent form
- Circuit Analysis: Track-specific characteristics and historical performance data
- Qualifying Results: Starting positions and qualifying session performance
Technologies Used
- Python: Core development for data processing and machine learning
- Pandas & NumPy: Data manipulation and statistical analysis
- Scikit-learn: Machine learning model development and evaluation
- Dashboard Framework: Interactive web application for data visualization
- APIs: Integration with live racing and weather data sources
Machine Learning Approach
- Feature Engineering: Created comprehensive features from raw racing data including driver form, circuit characteristics, and weather impact
- Model Selection: Tested multiple algorithms including Random Forest, Gradient Boosting, and Neural Networks
- Ensemble Methods: Combined multiple models to improve prediction accuracy
- Validation: Used time-series cross-validation to ensure model robustness