Publications

ChemScraper: Graphics Extraction, Molecular Diagram Parsing, and Annotated Data Generation for PDF Images

Submitted to International Journal on Document Analysis and Recognition (IJDAR), 2024

Recommended citation: A. K. Shah, B. M. Amador, A. Dey, M. Creekmore, B. Ocampo, S. Denmark, and R. Zanibbi, "ChemScraper: Graphics Extraction, Molecular Diagram Parsing, and Annotated Data Generation for PDF Images," in Document Analysis and Recognition (Journal) - IJDAR vol. 27, May. 2024, submitted.

[url] [pdf] [code]

Line-of-sight with Graph Attention Parser (LGAP) for Math Formulas

Published in International Conference on Document Analysis and Recognition (ICDAR), 2023

Recommended citation: A. K. Shah and R. Zanibbi, “Line-of-Sight with Graph Attention Parser (LGAP) for Math Formulas,” in Document Analysis and Recognition - ICDAR 2023, G. A. Fink, R. Jain, K. Kise, and R. Zanibbi, Eds., in Lecture Notes in Computer Science. Cham: Springer Nature Switzerland, 2023, pp. 401–419. doi: 10.1007/978-3-031-41734-4_25.

[url] [pdf] [poster] [video] [code]

Searching the ACL Anthology with Math Formulas and Text

Published in International ACM SIGIR Conference on Research and Development in Information, 2023

Recommended citation: B. Amador, M. Langsenkamp, A. Dey, A. K. Shah, and R. Zanibbi, “Searching the ACL Anthology with Math Formulas and Text,” in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, in SIGIR ’23. New York, NY, USA: Association for Computing Machinery, Jul. 2023, pp. 3110–3114. doi: 10.1145/3539618.3591803.

[url] [pdf] [poster] [code]