The 21st ACM Symposium on Document Engineering

August 24, 2021 to August 27, 2021
Limerick, Ireland

Draft Programme - subject to change!!

This year DocEng will be a virtual event using Zoom for the video conference and Slack for instant messaging/collaboration.

All times below are IST (Irish Standard Time or UTC+1)

Day 0

Time (IST) Session
3:00 pm Tutorials 1 & 2 parallel

Day 1

Time (IST) Paper ID Session Type
3:00 pm Welcome note
3:15 pm Keynote 1
4:15 pm Break
Document content analysis
4:35 pm 32 Efficient Clustering of Short Text StrePMs using Online-Offline Clustering full
4:50 pm 77 Efficient Sparse Spherical k-Means for Document Clustering short
5:00 pm 35 Small-step pipelines reduce the complexity of XSLT/XPath progrPMs short
5:10 pm 51 MTLV: A Library for Building deep multi-task learning Architectures short
5:20 pm 57 ELSKE: Efficient Large-Scale Keyphrase Extraction short
5:30 pm Break
Generation manipulation & presentation
5:50 pm 3 Ordering Sentences and Paragraphs with Pre-trained Encoder-Decoder Transformers and Pointer Ensembles full
6:05 pm 80 SlideGen: An Abstractive Section-Based Slide Generator for Scholarly Documents short
6:15 pm 65 Engineering of An Artificial Intelligence Safety Data Sheet Document Processing System for Environmental, Health, and Safety Compliance short
6:25 pm The DocEng Book Series - Steve Simske and Nicki Dennis
6:35 pm Birds of a feather
7:35 pm End of Day

Day 2

Time (IST) Paper ID Session Type
3:00 pm Keynote 2
4:00 pm Break
Security & Sensitive Documents
4:20 pm 26 A Novel Approach on the Joined De-Identification of Textual and Relational Data with a Modified Mondrian Algorithm full
4:35 pm 21 Pornographic content classification using deep-learning full
4:50 pm 41 Counterfeit detection with QR Codes short
5:00 pm 66 Trustworthiness of spPM email addressess using machine learning short
5:10 pm Break
Applications & User Experiences
5:30 pm 7 Recognizing Creative Visual Design: Multiscale Design Characteristics in Free-Form Web Curation Documents full
5:45 pm 56 Rescuing Historical Climate Observations to Support Hydrological Research: A Case Study of Solar Radiation Data short
5:55 pm 54 ALiBERT - Improved Automated List Inspection (ALI) with BERT short
6:05 pm 82 A Large-Scale Exploration of Terms of Service Documents on the Web short
6:15 pm 74 Metadata-Driven Eye Tracking for Real-Time Applications short
6:25 pm ACM Town hall
6:35 pm Networking/Free Chat
7:35 pm End of Day

Day 3

Time (IST) Paper ID Session Type
Systems for Visual Document Analysis
3:00 pm 16 Table-structure Recognition Method Using Neural Networks for Implicit Ruled Line Estimation and Cell Estimation full
3:15 pm 78 Evaluating Deep Neural Networks for Image Document Enhancement short
3:25 pm 39 Towards Extraction of Theorems and Proofs in Scholarly Articles short
3:35 pm 76 A Comparative Study on Methods and Tools for Handwritten Mathematical Expression Recognition short
3:45 pm 83 Text line extraction using deep learning and minimal sub sePMs short
3:55 pm 63 Direct Binarization A Quality-and-Time Efficient Binarization Strategy short
4:05 pm 60 Challenges in Chart Image Classification: A Comparative Study of Different Deep Learning Methods short
4:15 pm Break
Collections, systems & management
4:35 pm 14 On Minimizing Cost in Legal Document Review Workflows full
4:50 pm 15 Heuristic Stopping Rules For Technology-Assisted Review full
5:05 pm 47 Shock Wave: a Graph Layout Algorithm for Text Analyzing short
5:15 pm 52 COVID-19 Multidimensional Kaggle Literature Organization short
5:25 pm Break
5:45 pm Binarisation challenge summary
5:55 pm Extractive text summarisation challenge summary
6:05 pm Birds of a Feather presentations
6:15 pm DocEng 2022
6:25 pm Best paper awards
6:35 pm Closing remarks
6:45 pm Networking/Free Chat
7:30 pm End of Symposium

Birds of a Feather

Charles Nicholas will serve as chair of this year's Birds of a Feather (BoaF) session. We invite you to go to the bof-general Slack channel or message Charles Nicholas on Slack with ideas or suggestions for discussion. The only constraint is that the topic must have some relationship to Document Engineering. These suggestions will be boiled down to a few specific topics, and shared with the participants a day or so before the conference. We'll use Zoom to set up meeting spaces for those who want to take part in one or more BoaF discussions. Each session will have the chance to give a brief (two minutes) presentation on the last day of the conference.

Presentation Guidelines for Authors

DocEng'21 will hold four live sessions on each day of the conference, starting with the Tutorials on the 24th of August.

In order to facilitate an on-time schedule, we are requiring authors to pre-record their presentations. Long papers will have 10 minutes for their presentation recording with 5 minutes for a live Q&A. Short papers will have 7 minutes for their presentation recording with 3 minutes for a live Q&A.

There are several video conferencing tools available to easily record a presentation. In this method, you can show your face via webcam and display your slides as you talk. You can use any meeting software as long as you get a good quality recording and your final file is in the MP4 format. Here are some links to instructions on recording a meeting on common platforms:

In order for videos to be verified by the technical program committee, please upload your video using easyChair no later than Friday, 13th August 2021.

Video Specifications:

Please contact if you have any questions.