Tabular data is the most common data form in industry. From financial risk control to medical diagnosis, from e-commerce recommendation to supply chain optimization, almost all industries rely on structured data for decision-making. However, when evaluating machine learning models on tabular data, researchers often face the following challenges:
- Data Leakage Issue: Information crossover between training and test sets inflates model performance metrics
- Unfair Comparison: Different models use different preprocessing flows or hyperparameter search strategies, leading to incomparable results
- Reproducibility Difficulty: Lack of standardized experimental procedures makes it hard for others to verify existing results
- Hardware Dependency: Many benchmarks default to using GPUs, ignoring the actual needs of CPU environments
The original intention of SmartML's design is to eliminate these obstacles and establish a truly fair, transparent, and reproducible evaluation system.