The design philosophy of the CGAP system is to decompose the case transfer process into three explicit reasoning stages, each with clear inputs, outputs, and verification standards.
The first stage is Scenario Modeling. The system first performs structured modeling of the target city's current situation, extracting key urban feature dimensions such as population density, transportation network, land use, economic industries, and ecological environment. The core task of this stage is to convert unstructured urban descriptions into intermediate representations that can be used for calculation and comparison.
The second stage is Case Retrieval. Based on the results of scenario modeling, the system retrieves potential reference cases from the case library. Unlike the simple similarity matching of traditional RAG, CGAP's retrieval considers multi-dimensional feature matching and introduces a transferability scoring mechanism, prioritizing cases that are similar to the target city in key dimensions and have achieved success in specific planning goals.
The third stage is Difference Analysis. This is the most distinctive part of CGAP. The system not only finds similar cases but also deeply analyzes the structural differences between the source case city and the target city, identifying which successful elements can be directly transferred, which need to be adjusted according to local conditions, and which are completely inapplicable. The results of difference analysis are output in a structured way, providing clear basis for planning decisions.
A key innovation of CGAP lies in its systematic use of Intermediate Representations. In traditional LLM applications, information usually flows at the natural language level, relying on the model's context understanding ability for implicit reasoning. This approach is prone to information loss and broken reasoning chains when dealing with complex tasks.
CGAP defines structured intermediate representation formats for each reasoning stage. The scenario modeling stage outputs standardized urban feature vectors; the case retrieval stage outputs a list of cases with metadata; the difference analysis stage outputs structured transfer suggestions. These intermediate representations not only improve the accuracy of information transmission but also allow each stage of the system to be independently verified and optimized.
More importantly, the intermediate representation mechanism enables CGAP to integrate multiple information sources and reasoning tools. For example, scenario modeling can combine Geographic Information System (GIS) data, statistical data, and text descriptions; case retrieval can integrate vector databases and rule engines; difference analysis can call specialized comparison algorithms and domain knowledge bases. This modular architecture greatly improves the system's flexibility and scalability.