Research

Research Interests:

Pattern recognition, recognition of graphical structures, computer vision, speaker understanding, large language models, multi-modal deep learning, natural language processing.

Current work:

My work centers around designing fast, efficient, and interpretable parsers for recognizing mathematical formulas and chemical diagrams across multiple formats, including PDFs, typeset images, and handwritten strokes. Through graph attention-based techniques and the integration of Large Language Models (LLMs), I aim to enhance how contextual information is processed while preserving a natural and interpretable graph representation.

Past/ongoing research works: