MiMa (6) [Avatar] Offline
i have written a method to return extraction text from an PDF file.

public static String ExtractionText(File arrayDocument) throws IOException, TikaException
Tika tika = new Tika();
String textExtract = tika.parseToString(arrayDocument);
return textExtract;

in Main are follow lines to extract text for the method

// Extract Text from Array Document
String textExtract = PDF.ExtractionText(arrayDocument);

it will be work but ever i get Messges:

log4j:WARN No appenders could be found for logger (org.apache.pdfbox.util.PDFStreamEngine).
log4j:WARN Please initialize the log4j system properly.

In dont know what are the Warings will tell.

thanks Michael

Message was edited by:
MiMa (6) [Avatar] Offline
Re: Text extraction with Warning
Ok, i had read that String Exraction no more accept as 100.000 characters.
My document have more as it can.

Will be the error abut that.
Then i must use an reader for extraction these text?


Message was edited by:
chris.mattmann (14) [Avatar] Offline
Re: Text extraction with Warning
Hi MiMa,

Yep, please use a reader that will take care of it.