Description
One of the oldest and most common ways for people to communicate knowledge is through speech. On a long-term basis, efforts have been undertaken to encourage vocally receptive PCs to recognise voice/speech synthesis. It is obvious that such a point of interaction would result in exceptional benefits. The development of intelligent systems to enhance the quality of life is the focus of current technological developments in the fields of processing images and natural language processing. An efficient method for text recognition, extraction from images, and text-to-speech conversion is proposed in this paper. Efficient methods for implementing the three modules are image to voice, text to speech, and values to speech by utilising the programming language Python is suggested in this study. Three modules’ outputs are also saved on our system’s local drive. The suggested technique is tested on images, after which the text within each image is extracted, and the image is then translated to speech using the Pytesseract function. An OCR (optical character recognition) tool for the Python is called Python-tesseract. The suggested framework has shown promising results when compared the existing method of the system
Reviews
There are no reviews yet.