Skip to main navigation Skip to search Skip to main content

ICPR 2020 - Competition on Harvesting Raw Tables from Infographics

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

This work summarizes the results of the second Competition on Harvesting Raw Tables from Infographics (ICPR 2020 CHART-Infographics). Chart Recognition is difficult and multifaceted, so for this competition we divide the process into the following tasks: Chart Image Classification (Task 1), Text Detection and Recognition (Task 2), Text Role Classification (Task 3), Axis Analysis (Task 4), Legend Analysis (Task 5), Plot Element Detection and Classification (Task 6.a), Data Extraction (Task 6.b), and End-to-End Data Extraction (Task 7). We provided two sets of datasets for training and evaluation of the participant submissions. The first set is based on synthetic charts (Adobe Synth) generated from real data sources using matplotlib. The second one is based on manually annotated charts extracted from the Open Access section of the PubMed Central (UB PMC). More than 25 teams registered out of which 7 submitted results for different tasks of the competition. While results on synthetic data are near perfect at times, the same models still have room to improve when it comes to data extraction from real charts. The data, annotation tools, and evaluation scripts have been publicly released for academic use.

Original languageEnglish
Title of host publicationPattern Recognition. ICPR International Workshops and Challenges, 2021, Proceedings
EditorsAlberto Del Bimbo, Rita Cucchiara, Stan Sclaroff, Giovanni Maria Farinella, Tao Mei, Marco Bertini, Hugo Jair Escalante, Roberto Vezzani
PublisherSpringer Science and Business Media Deutschland GmbH
Pages361-380
Number of pages20
ISBN (Print)9783030687922
DOIs
StatePublished - 2021
Event25th International Conference on Pattern Recognition Workshops, ICPR 2020 - Milan, Italy
Duration: Jan 10 2021Jan 11 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12668 LNCS

Conference

Conference25th International Conference on Pattern Recognition Workshops, ICPR 2020
Country/TerritoryItaly
CityMilan
Period01/10/2101/11/21

Keywords

  • Chart dataset
  • Chart recognition
  • Competition

Fingerprint

Dive into the research topics of 'ICPR 2020 - Competition on Harvesting Raw Tables from Infographics'. Together they form a unique fingerprint.

Cite this