Computer Science > Computer Vision and Pattern Recognition

arXiv:1801.01315 (cs)

[Submitted on 4 Jan 2018]

Title:PixelLink: Detecting Scene Text via Instance Segmentation

Authors:Dan Deng, Haifeng Liu, Xuelong Li, Deng Cai

View PDF

Abstract:Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text bounding boxes are then extracted directly from the segmentation result without location regression. Experiments show that, compared with regression-based methods, PixelLink can achieve better or comparable performance on several benchmarks, while requiring many fewer training iterations and less training data.

Comments:	AAAI-2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1801.01315 [cs.CV]
	(or arXiv:1801.01315v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1801.01315

Submission history

From: Dan Deng [view email]
[v1] Thu, 4 Jan 2018 11:48:21 UTC (888 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dan Deng
Haifeng Liu
Xuelong Li
Deng Cai

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:PixelLink: Detecting Scene Text via Instance Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PixelLink: Detecting Scene Text via Instance Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators