Enhancement Spatial Transformer Networks for Text Classification

Ka Hou Chan, Sio Kei Im, Vai Kei Ian, Ka Man Chan, Wei Ke

研究成果: Conference contribution同行評審

6 引文 斯高帕斯(Scopus)

摘要

This paper introduces a 2D transformation based framework for arbitrary-oriented text detection in natural scene images. We present the localization networks within Spatial Transformer Networks (STN), which are designed to generate proposals with text orientation affine information including translation, scaling and rotation. This information will then be adapted as learning parameters to make the proposals to be fitted into the text regular form in terms of the orientation more accurately. Localization network is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. Compared with any previous text detection systems, this work ensures the relationship between the learning parameters, which can lead to a better approximation for orientation. As a result, this new layer greatly enhances the training accuracy. Moreover, the design and implementation can be easily deployed in the current systems built upon the standard CNNs architecture.

原文English
主出版物標題ICGSP 2020 - Proceedings of the 4th International Conference on Graphics and Signal Processing
發行者Association for Computing Machinery
頁面5-10
頁數6
ISBN(電子)9781450377812
DOIs
出版狀態Published - 26 6月 2020
事件4th International Conference on Graphics and Signal Processing, ICGSP 2020 - Nagoya, Virtual, Japan
持續時間: 26 6月 202028 6月 2020

出版系列

名字ACM International Conference Proceeding Series

Conference

Conference4th International Conference on Graphics and Signal Processing, ICGSP 2020
國家/地區Japan
城市Nagoya, Virtual
期間26/06/2028/06/20

指紋

深入研究「Enhancement Spatial Transformer Networks for Text Classification」主題。共同形成了獨特的指紋。

引用此