UniText: A Unified Framework for Chinese Text Detection, Recognition, and Restoration in Ancient Document and Inscription Images

  • Lu Shen
  • , Zewei Wu
  • , Xiaoyuan Huang
  • , Boliang Zhang
  • , Su Kit Tang
  • , Jorge Henriques
  • , Silvia Mirri

Research output: Contribution to journalArticlepeer-review

Abstract

Processing ancient text images presents significant challenges due to severe visual degradation, missing glyph structures, and various types of noise caused by aging. These issues are particularly prominent in Chinese historical documents and stone inscriptions, where diverse writing styles, multi-angle capturing, uneven lighting, and low contrast further hinder the performance of traditional OCR techniques. In this paper, we propose a unified neural framework, UniText, for the detection, recognition, and glyph restoration of Chinese characters in images of historical documents and inscriptions. UniText operates at the character level and processes full-page inputs, making it robust to multi-scale, multi-oriented, and noise-corrupted text. The model adopts a multi-task architecture that integrates spatial localization, semantic recognition, and visual restoration through stroke-aware supervision and multi-scale feature aggregation. Experimental results on our curated dataset of ancient Chinese texts demonstrate that UniText achieves a competitive performance in detection and recognition while producing visually faithful restorations under challenging conditions. This work provides a technically scalable and generalizable framework for image-based document analysis, with potential applications in historical document processing, digital archiving, and broader tasks in text image understanding.

Original languageEnglish
Article number7662
JournalApplied Sciences (Switzerland)
Volume15
Issue number14
DOIs
Publication statusPublished - Jul 2025

Keywords

  • ancient Chinese characters
  • glyph restoration
  • text detection and recognition

Fingerprint

Dive into the research topics of 'UniText: A Unified Framework for Chinese Text Detection, Recognition, and Restoration in Ancient Document and Inscription Images'. Together they form a unique fingerprint.

Cite this