Section 01
Innovative Application Research of Open-Source Large Language Models in Software Metadata Entity Disambiguation (Introduction)
This article introduces a study that uses open-source large language models to solve the problem of software metadata entity disambiguation. Key contents include: constructing a multi-annotator benchmark dataset, comparing three reasoning strategies (direct prompting, self-consistency, multi-step agent-based), exploring feasible paths for high-precision entity resolution in noisy and heterogeneous data environments, and providing academic institutions and enterprises with new reproducible and controllable data governance ideas.