14 responses to “Text Mining Tool”

  1. Jespard

    Where is a link to download page? I can’t find this on the usual freeware sites (Snapfiles, Betanews, Download, etc.)

  2. Jespard

    Never mind. If you want to download this search page for “Go to the program page”. (I always forget this!!!) It is kinda hard to see. Regardless, I love this site. Jespard

  3. Joe

    It’s great that it has a CLI tool. It is one thing that truly makes it more useful.

  4. David

    I would like to add my results of usage as review.

    Pros:
    - works very well as pdf2text, doc2text utilities
    - gets text from RTF, CHM, HTML files
    - special console utilitiy included for use from the command line interface
    - no installation is needed, just extract the tool and use it
    - hotkeys are really comfortable
    - 100% free

    Cons:
    - text got from HTML files is a bit unformatted (though JavaScript is bitten from it)
    - .NET Framework 2.0 is required

  5. Jeff Winterburn

    Console utility minetext.exe really impresses! It performs many convertions from PDF to text with one click (or key press :) ).

  6. Asus47

    I just can’t convert a pdf into text. It tells me it doesn’t find the file. I’ve already checked the pdf file exists. Some ideas? Here is my command line : E:\ltsaua0\Archivage\minetext\minetext.exe “E:\ltsaua0\Archivage\1.pdf” “E:\ltsaua0\Archivage\mine.txt”
    Thanks

  7. os

    Downloaded the program twice, but runtime error when click on the exe, doesnt initialise. How do you use it then? -thanks.

  8. Olaf

    Runs fine for me but really does not like my PDFs. Bombs with error message

    Unhandled Exception: System.Exception: Error during text extraction from Pdf-file: ..\rates.pdf
    Error getting pdf version: java.lang.NumberFormatException: For input string: “TYP”
    at TextMiningTool.Readers.PdfFileReader.GetText(String fileName)
    at TextMiningTool.FileMaster.GetText(String inputFile)
    at MineText.Program.Main(String[] args)

  9. Amox

    That is great.I was said the converter:Nemo PDF Converter 4.0 converts PDF to Word/RTF and Word/Excel to PDF for uses of different situations with speed and 100 accuracy. It keeps intact of the original files and supports batch conversion. You can either batch convert files from the converter or from the button integrated in your documents with ease. Moreover, its user-friendly interfaces will make you a veteran from a new user in minutes.
    http://www.nemopdf.com/index.html

Leave a Reply


+ five = 9