Uncovering Tibet’s Oldest Religion through AI-enhanced Handwritten Text Recognition

14 November 2024, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

This poster presents our AI-enhanced workflow to digitise a vast corpus of manuscripts, hand-written in an extremely difficult form of Tibetan. These offer a unique glimpse into archaic non-Buddhist rituals and narratives, enabling us to reconstruct Tibetan Pagan religion for the first time. The palm-leaf page layout differs from any previously trained models and the wide range of abbreviations, a mixture of scripts and languages, and varying image quality make the task extremely challenging. To tackle these challenges, we developed a unique collaborative Handwritten Text Recognition (HTR) workflow making optimal use of new technologies while bringing together a diverse range of expertise from multiple stakeholders. Central to this workflow is the custom-developed PechaTools that streamlines the process of transcribing from scratch and correcting computer-generated output, with multiple proofreading stages to ensure accuracy. Within six months, the above-sketched workflow enabled us to successfully train Layout Analysis (LA) and HTR models using Transkribus. We furthermore show how we improve results by providing detailed feedback to the correction teams (including customised glyph-lookup dictionaries), optimising parameters as well as fine-tuning. We also discuss the trade-off of getting more-readable results by using dictionaries and language models that yield more normalised text, which can be useful for a more general audience, but is highly detrimental to retrieving essential philological details. We also discuss postprocessing leading to the next steps of creating both diplomatic as well as normalised transcriptions to make these unique text collections maximally useful for researchers in a wide range of disciplines.

Keywords

Handwritten Text Recognition (HTR)
Collaborative NLP Pipelines
Tibetan
AI-enhanced Workflows
Machine Learning
Philology
Corpus Creation

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.