Transformers Utilization in Chart Understanding: A Review of Recent Advances & Future Trends
View PDF
HTML (experimental)
Abstract:In recent years, interest in vision-language tasks has grown, especially those involving chart interactions. These tasks are inherently multimodal, requiring models to process chart images, accompanying text, underlying data tables, and often user queries. Traditionally, Chart Understanding (CU) relied on heuristics and rule-based systems. However, recent advancements that have integrated transformer architectures significantly improved performance. This pap...
Read more at arxiv.org