Feature Enhancement Network: A Refined Scene Text Detector

Zhang, Sheng; Liu, Yuliang; Jin, Lianwen; Luo, Canjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:1711.04249 (cs)

[Submitted on 12 Nov 2017]

Title:Feature Enhancement Network: A Refined Scene Text Detector

Authors:Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo

View PDF

Abstract:In this paper, we propose a refined scene text detector with a \textit{novel} Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement. Retrospectively, both region proposal with \textit{only} $3\times 3$ sliding-window feature and text detection refinement with \textit{single scale} high level feature are insufficient, especially for smaller scene text. Therefore, we design a new FEN network with \textit{task-specific}, \textit{low} and \textit{high} level semantic features fusion to improve the performance of text detection. Besides, since \textit{unitary} position-sensitive RoI pooling in general object detection is unreasonable for variable text regions, an \textit{adaptively weighted} position-sensitive RoI pooling layer is devised for further enhancing the detecting accuracy. To tackle the \textit{sample-imbalance} problem during the refinement stage, we also propose an effective \textit{positives mining} strategy for efficiently training our network. Experiments on ICDAR 2011 and 2013 robust text detection benchmarks demonstrate that our method can achieve state-of-the-art results, outperforming all reported methods in terms of F-measure.

Comments:	8 pages, 5 figures, 2 tables. This paper is accepted to appear in AAAI 2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1711.04249 [cs.CV]
	(or arXiv:1711.04249v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1711.04249

Submission history

From: Lianwen Jin [view email]
[v1] Sun, 12 Nov 2017 08:12:54 UTC (1,789 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Enhancement Network: A Refined Scene Text Detector

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Enhancement Network: A Refined Scene Text Detector

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators