I used only your samples, together with the Split Tool in the iText Toolbox..
I think you've isolated the problem to the text-extraction tool you're using, as the text you're seeking to extract clearly appears in all of the split PDFs. Summary: It's not an iText problem; and therefore, does not belong on this list. Best regards, Bill Segraves ----- Original Message ---- From: rosette <rose...@arx.com> To: itext-questions@lists.sourceforge.net Sent: Tuesday, October 6, 2009 2:50:00 AM Subject: Re: [iText-questions] Splitting by Itext BTW Bill, in the attachement you can dind my split file (it is the regular sample), can you tell me if you use the same sample ? Thanks again, rosette rosette wrote: > > > Hi Bill, > > Thanks, but the problem exist also with the files you sent. > I can not extract text from test_split4.pdf > But I sucess to extract from test_split5.pdf > Rosette > > > wasegraves wrote: >> >> I had already split again with the iText Toolbox's Split tool. Here are >> the two files. >> >> I suspect your probelm is caused by the "split" function of whatever >> software you're using. If you can extract the text from these, it >> confirms my conclusion; and you should report the problem to the provider >> of the refective software, not to the iText list. >> >> ----- Original Message ---- >> From: rosette <rose...@arx.com> >> To: itext-questions@lists.sourceforge.net >> Sent: Monday, October 5, 2009 3:10:30 AM >> Subject: Re: [iText-questions] Splitting by Itext >> >> >> Hi Bill, >> >> Yes, I can extract the document that you attached (test_split3.pdf). >> However, if you split again, then the extraction fail in the first >> document >> and sucess in the second. >> >> This is exactly my problem! if you know about a tool that can extract the >> information in the second split (the first document and the second), I'll >> be >> happy to know and test it. >> >> Thanks, >> >> Rosette >> >> >> >> wasegraves wrote: >>> >>> I tried your experiment with the Split tool in the iText Toolbox, >>> splitting once at page 2, then splitting the resulting two-page second >>> part again at page 2. Visual examination of the resulting pages suggests >>> there would be no problem with text extraction, using my own primitive >>> text extraction tool. >>> >>> Please try using PDFBox to extract the text from the attached two-page >>> file and see what happens. If all of the text is extracted, I think >>> we've >>> isolated your problem to PDFBox, rather than it's being an iText >>> problem.. >>> >>> Best regards, >>> Bill Segraves >>> >>> >>> >>> ----- Original Message ---- >>> From: rosette <rose...@arx.com> >>> To: itext-questions@lists.sourceforge.net >>> Sent: Sunday, October 4, 2009 2:21:41 AM >>> Subject: Re: [iText-questions] Splitting by Itext >>> >>> >>> Hi all, >>> >>> In the attachment you can find the following documents: >>> 1. The full document (test_split1.pdf) - 3 pages >>> The test_split1.pdf was splited to two documents by iText :Doc_0.PDF (1 >>> page >>> - you can see it also in the attachment) and to other document which >>> contains the rest 2 pages. I can extract the text from Doc_0.PDF by >>> iText. >>> >>> 2. I'm taking now the file that contains the 2 last pages and I'm >>> spliting >>> it again by iText - I failed here. >>> You can see the file in (Doc_1.PDF). >>> >>> PDFBox says that extract sucessed but it returns empty string! >>> >>> Any help will be appreciated. >>> >>> Thanks, >>> >>> Rosette >>> >>> >>> >>> >>> rosette wrote: >>>> >>>> Hi, >>>> >>>> I have a PDF file that I'm spliting by Itext, it works well! >>>> My problem begins when I want to extract a text from the PDF that was >>>> cerated by iText. >>>> >>>> If I have a PDF document with 10 pages and I split this document to 2 >>>> documents.. >>>> >>>> The first from pages 1-3 and the second is from 4-10, I'll be able to >>>> extract the text from both documents. >>>> But if I take the first or the second document and I'll split it again >>>> and >>>> then I'll try to extract the text , it will fail. >>>> Since IText can't extract text, I investigated the problem with couple >>>> API >>>> and it seems that the core of the problem is when I'm spliting by Itext >>>> files that the origin were also splited by IText. >>>> >>>> Please let me know if you have any idea how to solve this problem. >>>> >>>> Rosette >>>> >>> http://www.nabble.com/file/p25734860/test_split1.pdf test_split1.pdf >>> http://www..nabble.com/file/p25734860/test_split1.pdf test_split1.pdf >>> http://www.nabble.com/file/p25734860/Doc_0.PDF Doc_0.PDF >>> http://www.nabble.com/file/p25734860/Doc_1.PDF Doc_1.PDF >>> -- >>> View this message in context: >>> http://www.nabble.com/Splitting-by-Itext-tp25695119p25734860.html >>> Sent from the iText - General mailing list archive at Nabble.com. >>> >>> >>> ------------------------------------------------------------------------------ >>> Come build with us! The BlackBerry® Developer Conference in SF, CA >>> is the only developer event you need to attend this year. Jumpstart your >>> developing skills, take BlackBerry mobile applications to market and >>> stay >>> ahead of the curve. Join us from November 9-12, 2009. Register now! >>> http://p.sf.net/sfu/devconf >>> _______________________________________________ >>> iText-questions mailing list >>> iText-questions@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/itext-questions >>> >>> Buy the iText book: http://www.1t3xt.com/docs/book..php >>> Check the site with examples before you ask questions: >>> http://www.1t3xt.info/examples/ >>> You can also search the keywords list: >>> http://1t3xt.info/tutorials/keywords/ >>> >>> >>> ------------------------------------------------------------------------------ >>> Come build with us! The BlackBerry® Developer Conference in SF, CA >>> is the only developer event you need to attend this year. Jumpstart your >>> developing skills, take BlackBerry mobile applications to market and >>> stay >>> ahead of the curve. Join us from November 9-12, 2009. Register >>> now! >>> http://p.sf.net/sfu/devconf >>> _______________________________________________ >>> iText-questions mailing list >>> iText-questions@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/itext-questions >>> >>> Buy the iText book: http://www.1t3xt.com/docs/book.php >>> Check the site with examples before you ask questions: >>> http://www.1t3xt.info/examples/ >>> You can also search the keywords list: >>> http://1t3xt.info/tutorials/keywords/ >>> >> >> -- >> View this message in context: >> http://www.nabble.com/Splitting-by-Itext-tp25695119p25746068.html >> Sent from the iText - General mailing list archive at Nabble.com. >> >> >> ------------------------------------------------------------------------------ >> Come build with us! The BlackBerry® Developer Conference in SF, CA >> is the only developer event you need to attend this year. Jumpstart your >> developing skills, take BlackBerry mobile applications to market and stay >> ahead of the curve. Join us from November 9-12, 2009. Register now! >> http://p.sf.net/sfu/devconf >> _______________________________________________ >> iText-questions mailing list >> iText-questions@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/itext-questions >> >> Buy the iText book: http://www.1t3xt.com/docs/book.php >> Check the site with examples before you ask questions: >> http://www.1t3xt.info/examples/ >> You can also search the keywords list: >> http://1t3xt.info/tutorials/keywords/ >> >> >> ------------------------------------------------------------------------------ >> Come build with us! The BlackBerry® Developer Conference in SF, CA >> is the only developer event you need to attend this year. Jumpstart your >> developing skills, take BlackBerry mobile applications to market and stay >> ahead of the curve. Join us from November 9-12, 2009. Register >> now! >> http://p.sf.net/sfu/devconf >> _______________________________________________ >> iText-questions mailing list >> iText-questions@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/itext-questions >> >> Buy the iText book: http://www.1t3xt.com/docs/book.php >> Check the site with examples before you ask questions: >> http://www.1t3xt.info/examples/ >> You can also search the keywords list: >> http://1t3xt.info/tutorials/keywords/ >> > > http://www.nabble.com/file/p25763725/Split.cs Split.cs -- View this message in context: http://www.nabble.com/Splitting-by-Itext-tp25695119p25763725.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.1t3xt.com/docs/book..php Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/ ------------------------------------------------------------------------------ Come build with us! The BlackBerry® Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9-12, 2009. Register now! http://p.sf.net/sfu/devconf _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.1t3xt.com/docs/book.php Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/