COMPARISON OF THRESHOLDING OTSU AND MORPHOLOGY OPENING PREPROCESSOR FOR IMAGE TEXT DETECTION USING TESSERACT

Oei Hizkia Renat Guntur Sugiarto, Yonathan Purbo Santosa

Abstract


Now everything is completely digital, so image processing is very necessary for this era, one of which is image text processing. processing image text and converting to text using OCR and Tesseract help. There are many helpful applications like this on the internet, but if we want to use them on personal documents it will be very dangerous to use applications from the internet. so it is safer if you use your offline application. This program will not only convert the text converter on the image into text format but will also recognize the text later. So data processing will be easier when the final result is in txt format then for the detection of text in the image will use the help of an open source library from python, namely tesseract. but before using this tesseract library, we have to do preprocessing on our image, because tesseract is very sensitive if there is noise in the image, so here I will use preprocessing thresholding binary otsu method and use dilation. This method will later be compared the final result will be more accurate if detected using tesseract. The final result of this project will detect the text in the image, from here it will detect every word in the image, not every sentence, and will also save the text in the image that has been detected using tesseract


Keywords


Opening; Threshold Otsu; Preprocessing; Image Processing; Text Detection

Full Text:

PDF

References


J. H. Kim, A. Canedo-Rodríguez, J. H. Kim, and J. Kelly, “Simple and Efficient Text Localization for Compressed Image in Mobile Phone,” JSIP, vol. 05, no. 04, pp. 208–228, 2014, doi: 10.4236/jsip.2014.54022.

R. Gunawan, S. Suwarno, and W. Hapsari, “PENERAPAN OPTICAL CHARACTER RECOGNITION (OCR) UNTUK PEMBACAAN METERAN LISTRIK PLN,” Jurnal Informatika, vol. 10, no. 2, Art. no. 2, Jan. 2015, doi: 10.21460/inf.2014.102.331.

I. Sarief, H. Y. Biu, F. Harismana, and S. I. Chandra, “Detection of Vehicles Number Plate Using Image Processing with Template matching Method,” Jurnal Ilmiah Telekomunikasi, Kendali dan Elektronika Terapan, vol. 7, no. 1, pp. 14–24, Apr. 2019, doi: 10.34010/telekontran.v7i1.1634.

R. Siregar, “Implementasi OTSU Thresholding pada Optical Character Recognition Menggunakan Engine Tesseract,” Jurnal Ilmiah Core IT : Community Research Information Technology, vol. 7, no. 1, Art. no. 1, Apr. 2019, Accessed: Aug. 01, 2023. [Online]. Available: https://ijcoreit.org/index.php/coreit/article/view/97

K. Ibnutama and M. G. Suryanata, “Ekstraksi Karakter Citra Menggunakan Optical Character Recognition Untuk Pencetakan Nomor Kendaraan Pada Struk Parkir,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 4, no. 4, Art. no. 4, Oct. 2020, doi: 10.30865/mib.v4i4.2432.

N. D. Hoang, “Detection of Surface Crack in Building Structures Using Image Processing Technique with an Improved Otsu Method for Image Thresholding,” Advances in Civil Engineering, vol. 2018, 2018, doi: 10.1155/2018/3924120.

O. Nina, B. Morse, and W. Barrett, “A recursive otsu thresholding method for scanned document binarization,” 2011 IEEE Workshop on Applications of Computer Vision, WACV 2011, pp. 307–314, 2011, doi: 10.1109/WACV.2011.5711519.

K. A. M. Said, A. B. Jambek, and N. Sulaiman, “A study of image processing using morphological opening and closing processes,” International Journal of Control Theory and Applications, vol. 9, no. 31, pp. 15–21, 2016.

S. Tangwannawit and W. Saetang, “Recognition of Lottery Digits Using OCR Technology,” Proceedings - 12th International Conference on Signal Image Technology and Internet-Based Systems, SITIS 2016, pp. 632–636, 2017, doi: 10.1109/SITIS.2016.105.

X. Bai, “Morphological infrared image enhancement based on multi-scale sequential toggle operator using opening and closing as primitives,” Infrared Physics and Technology, vol. 68, pp. 143–151, 2015, doi: 10.1016/j.infrared.2014.11.015.




DOI: https://doi.org/10.24167/proxies.v5i2.12449

Copyright (c) 2024 Proxies : Jurnal Informatika



View My Stats