Section 01
[Introduction] OpenRTLSet: An Open-Source Verilog Dataset for Hardware Design with Large Language Models
OpenRTLSet is an open-source Verilog dataset for hardware design with large language models. It releases over 130,000 samples, generates natural language descriptions using DeepSeek-R1, and supports fine-tuning of models like Qwen and Granite. Its aim is to address bottlenecks in hardware design automation, such as the scarcity of HDL training data and restrictions on commercial licensing.