Section 01
[Main Post/Introduction] RealBench: Bringing Code Generation Evaluation Back to Real Software Development Scenarios
The new benchmark RealBench introduces UML design diagrams and natural language requirements, bridging the gap between existing code generation benchmarks and real enterprise-level development scenarios, and revealing the capabilities and limitations of LLMs in real software development. Keywords: code generation, LLM, benchmark, software development, UML, enterprise application, AI programming assistant.