MiMa (6) [Avatar] Offline
#1
Hi,
i have written a method to return extraction text from an PDF file.

public static String ExtractionText(File arrayDocument) throws IOException, TikaException
{
Tika tika = new Tika();
String textExtract = tika.parseToString(arrayDocument);
return textExtract;
}


in Main are follow lines to extract text for the method

...
// Extract Text from Array Document
String textExtract = PDF.ExtractionText(arrayDocument);
System.out.println(textExtract);
...


it will be work but ever i get Messges:

log4j:WARN No appenders could be found for logger (org.apache.pdfbox.util.PDFStreamEngine).
log4j:WARN Please initialize the log4j system properly.

In dont know what are the Warings will tell.

thanks Michael

Message was edited by:
MiMa
MiMa (6) [Avatar] Offline
#2
Re: Text extraction with Warning
Ok, i had read that String Exraction no more accept as 100.000 characters.
My document have more as it can.

Will be the error abut that.
Then i must use an reader for extraction these text?

Mi

Message was edited by:
MiMa
chris.mattmann (14) [Avatar] Offline
#3
Re: Text extraction with Warning
Hi MiMa,

Yep, please use a reader that will take care of it.

Cheers,
Chris