Detecting and Recognizing Text in Natural Images
- Baoguang Shi | Huazhong University of Science and Technology (HUST)
Text in natural images possesses rich information for image understanding. Detecting and recognizing text facilitates many important applications. From a computer vision perspective, text is a structured object made of characters arranged in a line or curve. The unique characteristics of text makes its detection and recognition problems different than that of general objects. In the first part of this talk, I will introduce our recent work on text detection, where we decompose long text into smaller segments and the links between them. A fully-convolutional neural network model is proposed to detect both segments and links at different scales in a single forward pass. In the second part, I will introduce our work on text recognition, where we tackle the structural recognition problem with an end-to-end neural network that outputs character sequences from image pixels. We further incorporate a learnable spatial transformer into this network, in order to handle text of irregular shape with robustness.
-
-
Gang Hua
Principal Researcher/Research Manager
-
-
Watch Next
-
Magma: A foundation model for multimodal AI Agents
- Jianwei Yang
-
AI for Precision Health: Learning the language of nature and patients
- Hoifung Poon,
- Ava Amini,
- Lili Qiu
-
-
-
-
-
-
-
Research Forum: Closing Remarks and Announcements
- Ashley Llorens,
- Hanna Wallach,
- Daniela Massiceti
-