Text Mining Tool – Extract Text from PDF, RTF, DOC, HTML and chm Files
Written By Samrat| 13 October 2008| No Comment
Text Mining Tool is a freeware utility which can extract the text from popular file formats like PDF, RTF, DOC, HTML and chm. To use the tool you need not install Microsoft Word or Adobe Reader. The interface is really very simple. You can also use the hotkeys to speed up your work.
So when you are looking to just extract the text from a PDF file with full of images this tool will be very helpful. There is also a command line tool which will help you with batch conversion. I tried extracting text from a PDF file with lot of images and it worked well. To use this tool you need .NET framework which can be downloaded for free.
All you have to do is just unzip Text Mining Tool and use it.