Section 01
[Introduction] Doc2Table: Exploring Challenges and Solutions of Large Vision-Language Models in End-to-End Table Extraction
The Doc2Table project focuses on the application of Large Vision-Language Models (LVLM) in end-to-end document table extraction, covering core challenges of table extraction, advantages of LVLM, key components of the project (end-to-end framework, challenging benchmark tests, model comparison), as well as experimental findings and future directions.