DocInsights 2026 - Workshop on Document Intelligence and Understanding, Co-located with EMNLP 2026, October 24-29, Budapest, Hungary
Full-Day Workshop Hybrid: Online + In-Person Submission Deadline: August 2, 2026

Overview

Documents are central to how knowledge is created, communicated, and acted upon in domains such as science, healthcare, law, finance, and government. Yet real-world documents are rarely plain text. They combine natural language with structured and semi-structured content such as tables, forms, charts, figures, lists, and layout cues. Understanding these documents requires models that can reason not only over text, but also over structure, visual organization, and cross-element relationships.

Recent advances in document foundation models, multimodal language models, and structure-aware NLP have significantly improved document understanding. However, major challenges remain: reasoning across text and tables, grounding model outputs in document evidence, handling long and multi-page documents, supporting multilingual and domain-specific documents, and evaluating systems under OCR, layout, and extraction noise.

DocInsights 2026 aims to bring together researchers and practitioners working at the intersection of NLP, Document AI, multimodal learning, information retrieval, and knowledge representation to advance the next generation of document intelligence systems.

The workshop focuses on moving beyond plain text toward methods that deeply understand the structure, content, and purpose of complex documents. We especially encourage work that highlights real-world document challenges and proposes methods for trustworthy, scalable, and practical document understanding.

Important Dates

All deadlines are 11:59 PM UTC-12:00 (“Anywhere on Earth”).

EventDate
Direct Submission DeadlineAugust 2, 2026
ARR Commitment DeadlineAugust 30, 2026
Acceptance NotificationsSeptember 13, 2026
Camera-ready DeadlineSeptember 27, 2026
Workshop DateDuring EMNLP 2026, October 24–29

News

Contact