Section 01
OpenEnv Data Wrangler: A Standardized Test Environment for LLM Data Engineering Capability Evaluation
OpenEnv Data Wrangler is an OpenEnv-compliant evaluation environment designed to test large language models (LLMs) on complex data engineering and Pandas data processing tasks. It addresses the industry challenge of objectively and standardly assessing LLMs' real-world data engineering capabilities, filling the gap in specialized benchmarks for this domain while ensuring reproducibility and comparability of results.