标记: [位置感知], [文字识别]
[位置感知] 从场景图片中检测出文字所在的区域
[文字识别] 识别出文字区域中的文字内容
最近更新时间:2018-08-12
- [2016-TIP] Text Detection Tracking and Recognition in Video: A Comprehensive Survey
论文
- [2015-PAMI] Text Detection and Recognition in Imagery: A Survey
论文
- [2014-Front.Comput.Sci] Scene Text Detection and Recognition: Recent Advances and Future Trends
论文
- [2016-IJCV][位置感知][文字识别] Reading Text in the Wild with Convolutional Neural Networks
论文
样例
主页
- [2016-CVPR][位置感知] Synthetic Data for Text Localisation in Natural Images
论文
代码
数据
- [2015-ICLR][文字识别] Deep structured output learning for unconstrained text recognition
论文
- [2015-PhD Thesis][位置感知] Deep Learning for Text Spotting
论文
代码
- [2014-ECCV][位置感知] Deep Features for Text Spotting
论文
代码
模型
GitXiv
- [2014-NIPS][文字识别] Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition
论文
主页
模型
- [2016-ECCV][位置感知] CTPN: Detecting Text in Natural Image with Connectionist Text Proposal Network
论文
代码
- [2016-CVPR][位置感知] Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network
论文
- [2016-AAAI][位置感知][文字识别] Reading Scene Text in Deep Convolutional Sequences
论文
- [2016-TIP][位置感知] Text-Attentional Convolutional Neural Networks for Scene Text Detection
论文
- [2014-ECCV][位置感知] Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees
论文
- [2018-AAAI][位置感知] Feature Enhancement Network: A Refined Scene Text Detector
论文
- [2017-arXiv][位置感知] Detecting Curve Text in the Wild: New Dataset and New Solution
论文
- [2017-TPAMI][文字识别] Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition
论文
- [2017-CVPR][位置感知] Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection
论文
- [2016-arXiv][位置感知][文字识别] DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
论文
- [2018-CVPR][文字识别] Edit Probability for Scene Text Recognition
论文
- [2017-arXiv][位置感知] Arbitrary-Oriented Scene Text Detection via Rotation Proposals
论文
- [2018-ICIP][位置感知] Feature Fusion Network for Scene Text Detection
- [2018-CVPR][位置感知] Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
论文
- [2018-CVPR][位置感知] Rotation-sensitive Regression for Oriented Scene Text Detection
论文
- [2018-TIP][位置感知] TextBoxes++: A Single-Shot Oriented Scene Text Detector
论文
代码
- [2017-AAAI][位置感知] TextBoxes: A Fast TextDetector with a Single Deep Neural Network
论文
代码
- [2017-CVPR][位置感知] Detecting Oriented Text in Natural Images by Linking Segments
论文
- [2016-CVPR][文字识别] Robust scene text recognition with automatic rectification
论文
- [2016-arXiv][位置感知] Scene Text Detection via Holistic, Multi-Channel Prediction
论文
- [2016-CVPR][位置感知] Multi-oriented text detection with fully convolutional networks
论文
- [2015-TPAMI][文字识别] An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
论文
代码
代码
- [2015-CVPR][位置感知] Symmetry-Based Text Line Detector in Natural Scenes
论文
代码
- [2015-ICDAR][文字识别] Automatic Script Identification in the Wild
论文
- [2017-arXiv][位置感知] Improving Text Proposal for Scene Images with Fully Convolutional Networks
论文
- [2016-arXiv][位置感知] TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild
论文
代码
- [2015-ICDAR][位置感知] Object Proposals for Text Extraction in the Wild
论文
代码
- [2014-TPAMI][文字识别] Word Spotting and Recognition with Embedded Attributes
论文
主页
代码
- [2012-ICPR][文字识别] End-to-End Text Recognition with Convolutional Neural Networks
论文
代码
SVHN 数据集
- [2012-PhD thesis][文字识别] End-to-End Text Recognition with Convolutional Neural Networks
论文
- [2017-AAAI][位置感知][文字识别] Detection and Recognition of Text Embedding in Online Images via Neural Context Models
论文
- [2017-arXiv][位置感知] Deep Direct Regression for Multi-Oriented Scene Text Detection
论文
- [2016-CVPR][文字识别] Recursive Recurrent Nets with Attention Modeling for OCR in the Wild
论文
- [2017-arXiv][位置感知] Cascaded Segmentation-Detection Networks for Word-Level Text Spotting
论文
- [2016-arXiv][位置感知][文字识别] COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
论文
- [2016-PhD Thesis][位置感知] Context Modeling for Semantic Text Matching and Scene Text Detection
论文
- [2016-IJCAI][位置感知] Scene Text Detection in Video by Learning Locally and Globally
论文
- [2014-TPAMI][文字识别] Robust Text Detection in Natural Scene Images
论文
- [2016-CVPR][位置感知] CannyText Detector: Fast and Robust Scene Text Localization Algorithm
论文
- [2016-IJDAR][位置感知] TextCatcher: a method to detect curved and challenging text in natural scenes
论文
- [2017-ICCV][位置感知][文字识别] Deep TextSpotter: An End-to-End Trainable Scene Text Localization and
Recognition Framework
论文
代码
- [2015-TPAMI][位置感知][文字识别] Real-time Lexicon-free Scene Text Localization and Recognition
论文
- [2015-ICCV][位置感知] FASText: Efficient unconstrained scene text detector
论文
代码
- [2012-CVPR][位置感知][文字识别] Real-time scene text localization and recognition
论文
代码
- [2013-ICCV][位置感知][文字识别] Photo OCR: Reading Text in Uncontrolled Conditions
论文
- [2017-ICCV][位置感知] WordSup: Exploiting Word Annotations for Character based Text Detection
论文
- [2010-CVPR][位置感知] SWT: Detecting Text in Natural Scenes with Stroke Width Transform
论文
代码
- [2017-arXiv][位置感知] R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection
论文
- [2016-NIPS][文字识别] Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data
论文
- [2013-CVPR][文字识别] Scene Text Recognition using Part-based Tree-structured Character Detection
论文
- [2012-CVPR][文字识别] top-down and bottom-up cues for scene text recognition
论文
- [2017-ICCV][位置感知] WeText: Scene Text Detection under Weak Supervision
论文
- [2017-ICCV][位置感知] Self-organized Text Detection with Minimal Post-processing via Border Learning
论文
- [2017-ICCV][文字识别] Focusing Attention: Towards Accurate Text Recognition in Natural Images
论文
- [2017-ICCV][位置感知][文字识别] Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks
论文
- [2017-CVPR][位置感知] Unambiguous Text Localization and Retrieval for Cluttered Scenes
论文
- [2018-AAAI][文字识别] Char-Net: A Character-Aware Neural Network for Distorted Scene Text
论文
- [2018-AAAI][位置感知] PixelLink: Detecting Scene Text via Instance Segmentation
论文
- [2018-AAAI][文字识别] SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional
Encoder-decoder Network
论文
- [2018-CVPR][位置感知] Geometry-Aware Scene Text Detection with Instance Transformation Network
论文
- [2018-CVPR][位置感知] Learning Markov Clustering Networks for Scene Text Detection
论文
- [2018-IJCAI][位置感知] IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection
paper
- [2018-ICIP][位置感知] Dense Chained Attention Network for Scene Text Recognition
- [2018-ICIP][STL] Focal Text: An Accurate Text Detection With Focal Loss
21,384 图片, 21,384+ 文本标记
用途: 文本感知,识别
63,686 图片, 173,589 文本标记, 3 细粒度文本参数.
用途: 文本感知,识别
涵括9万英语单词的的9百万张图片
用途: 文本识别,分割
街景数字位置定位与识别数据库。73257张训练图,26032测试图,531131额外功用图
用途: 数字文本位置定为, 数字文字识别
IIIT 5K-Words
2012
5000张带文本的场景 (2000张用于训练,3000张用于测试)
文本图片都被裁剪出来并标记出相应的大小写敏感的文本
用途: 文本识别
包含62个字符的小尺寸图片 (0-9, a-z, A-Z) 每张图只包含少量字符
用途: 文本识别
500张自然场景图片 (图片大小从 1296x864 到 1920x1280不等)
中文英文及其混合的图片
用途: 文本感知
350高分辨率图片 (平均尺寸为 1260 × 860) (100 用于训练 and 250 用于测试)
提供文本区域坐标以及其文本相应的字符
用途: 文本感知
3000包含文本的室内室外场景图片
包含韩文,英文,数字及其三者混合
用途: 文本感知,识别,分割
Chars74k
2009
74000张从自然场景提取出来的包含字符(0-9, a-z, A-Z)的图片, 包含通过对称生成的字符图片,每张图只包含少量字符
包含62个字符的小尺寸图片 (0-9, a-z, A-Z)
用途: 文本识别
数据集 | 描述 | 相应论文 |
---|---|---|
ICDAR 2015 | 1000张训练图片和500测试图片 | 论文 |
ICDAR 2013 | 229张训练图片和233张测试图片 | 论文 |
ICDAR 2011 | 229张训练图片和255张测试图片 | 论文 |
ICDAR 2005 | 1001张训练图片和489张测试图片 | 论文 |
ICDAR 2003 | 181张训练图片和251张测试图片(包含词以及字符层级标记) | 论文 |
名称 | 描述 |
---|---|
Tesseract OCR | 有API,免费 |
Online OCR | 有API,免费 |
Free OCR | 有API,免费 |
New OCR | 有API,免费 |
ABBYY FineReader Online | 无API,收费 |
在线超级转换工具 | 无API,免费 |
在线中文识别 | 有API,免费 |
- Scene Text Detection with OpenCV 3
- Handwritten numbers detection and recognition
- Applying OCR Technology for Receipt Recognition
- Convolutional Neural Networks for Object(Car License) Detection
- Extracting text from an image using Ocropus
- Number plate recognition with Tensorflow
github
- Using deep learning to break a Captcha system
研究报告
github
- Breaking reddit captcha with 96% accuracy
github
- 文字检测与识别资源-1
- 文字的检测与识别资源-2