Section 01
EDGAR Dataset Guide: A Large Language Model-Driven Tool for Automated Extraction of Geopolitical Events
The EDGAR dataset released by the HCSS Data Lab uses large language models to automatically extract geopolitical events from English news. It adopts 16 event types defined by the PLOVER ontology, extends trilateral roles to capture multilateral interactions, provides structured event data for international relations research, and uses the CC BY 4.0 open license to support academic research and secondary development.