Text Detection from an Image
dc.contributor.advisor | Andriamanalimanana, Bruno R.; Thesis Advisor | |
dc.contributor.advisor | Novillo, Jorge; Thesis Committee | |
dc.contributor.advisor | Spetka, Scott; Thesis Committee | |
dc.contributor.author | Goda, Piyush Jain | |
dc.date.accessioned | 2021-08-05T19:13:16Z | |
dc.date.available | 2021-08-05T19:13:16Z | |
dc.date.issued | 2020-12 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12648/2036 | |
dc.description.abstract | Recently, a variety of real-world applications have triggered a huge demand for techniques that can extract textual information from images and videos. Therefore, image text detection and recognition have become active research topics in computer vision. The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power. In this project, I have built an approach for text detection using the object detection technique. Our approach is to deal with the text as objects. We use an object detection method, YOLO (You Only Look Once), to detect the text in the images. We frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. YOLO, a single neural network, that predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. The MobileNet pre-trained deep learning model architecture was used and modified in different ways to find the best performing model. The goal is to achieve high accuracy in text spotting. Experiments on standard datasets ICDAR 2015 demonstrate that the proposed algorithm significantly outperforms methods in terms of both accuracy and efficiency. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | image text detection and recognition | en_US |
dc.subject | computer vision | en_US |
dc.subject | Convolution Neural Network (CNN) | en_US |
dc.subject | Object Detection | en_US |
dc.subject | YOLO (You Only Look Once) | en_US |
dc.subject | Single Shot Detector (SSD) | en_US |
dc.subject | MobileNet | en_US |
dc.subject | Python | en_US |
dc.title | Text Detection from an Image | en_US |
dc.type | Thesis | en_US |
dc.description.version | NA | en_US |
refterms.dateFOA | 2021-08-05T19:13:17Z | |
dc.description.institution | SUNY Polytechnic Institute | en_US |
dc.description.department | College of Engineering | en_US |
dc.description.degreelevel | MS | en_US |
Files in this item
This item appears in the following Collection(s)
-
SUNY Polytechnic Institute College of Engineering
This collection contains master's theses, capstone projects, and other student and faculty work from programs within the Department of Engineering, including computer science and network security.