Data Layer
Relies on high-quality data, including historical prices, financial statements, market indicators of S&P 100 components, etc., which may come from public data sources like Yahoo Finance, Alpha Vantage, or professional APIs. Data acquisition, cleaning, and preprocessing are key steps.
Feature Engineering
Constructs multiple types of features: technical aspects (moving averages, RSI, MACD, etc.), fundamental aspects (P/E ratio, P/B ratio, ROE, etc.), macro indicators, and complex features (volatility, liquidity, sector rotation signals, etc.). Feature selection directly affects the model's predictive ability.
Machine Learning Models
Uses models such as gradient boosting trees (XGBoost, LightGBM), random forests, and support vector machines to output stock ranking scores. A higher score indicates greater potential for future performance.
Portfolio Optimization
Selects stocks and allocates weights based on rankings, possibly using methods like mean-variance optimization and risk parity to balance returns and risks, while considering practical constraints such as transaction costs and liquidity limits.