Thanks pal! I'll look into it! :)
On Tue, Dec 1, 2009 at 12:36 AM, trumpetinc2 <forum_...@trumpetinc.com> wrote: > > The parser can be used to get text from the PDF. See > com.lowagie.text.pdf.parser.PdfContentReaderTool > > - K > > > CGP-4 wrote: >> >> Cool! Thanks! >> For my task, I doubt that whether should I read the PDF Reference and >> write my info-extraction program from scratch.......... -___- >> >> On Mon, Nov 30, 2009 at 11:48 PM, Iliadis Yannis <ilyan...@gmail.com> >> wrote: >>> For Rtf you need the iText-rtf-2.1.7.jar. >>> >>> As far as concerning text extraction from a PDF, there are quite a lot of >>> threads in the mailing list that state the complexity of this task. >>> On the other hand if all the information about the papers is stored in >>> the >>> pdf's metadata then you can extract them. >>> >>> 2009/11/30 CGP <chenguang1...@gmail.com> >>>> >>>> Thanks Matthew! >>>> I've downloaded iText 2.1.7 >>>> It is JAR'd.. but it is not complete... some classes like RtfWriter2 >>>> does not exist in the JAR.. but it is implemented in the source code.. >>>> That's why I want to recompile the source code... >>>> >>>> For me, the most important task is to extract information from PDF >>>> files.. >>>> Have you guys done similar things? >>>> I would love to hear your advice >>>> >>>> Thanks! >>>> >>>> On Mon, Nov 30, 2009 at 11:31 PM, Wain, Matthew >>>> <matthew.w...@landregistry.gsi.gov.uk> wrote: >>>> > If you downloaded iText 2.1.7 you will see that's it is already JAR'd. >>>> > >>>> > -----Original Message----- >>>> > From: CGP [mailto:chenguang1...@gmail.com] >>>> > Sent: 30 November 2009 15:21 >>>> > To: itext-questions@lists.sourceforge.net >>>> > Subject: [iText-questions] Can iText be used to extract information >>>> from >>>> > PDFfiles? >>>> > >>>> > >>>> > Hello guys! >>>> > I am interested in extract information from academic papers in PDF >>>> > format. >>>> > The information I wish to extract include: paper title, author, >>>> > publication, year, etc. >>>> > I've started reading the book iText in Action, it is a great book, but >>>> > I have no idea whether the books content will help me in achieving my >>>> > goals. >>>> > Soooo, I need advice from you guys. >>>> > >>>> > Oh, BTW, how can I recompile the iText 2.1.7 source code to get a JAR >>>> > file within Windows XP? I'm using eclipse..... >>>> > Thanks a lot! >>>> > >>>> > -- >>>> > Chenguang(Rance) Pan >>>> > School of Electronics Engineering & Computer Science >>>> > Peking University >>>> > Tel:(86)1358-169-4723 >>>> > >>>> > >>>> > >>>> ------------------------------------------------------------------------------ >>>> > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 >>>> > 30-Day >>>> > trial. Simplify your report design, integration and deployment - and >>>> > focus on >>>> > what you do best, core application coding. Discover what's new with >>>> > Crystal Reports now. http://p.sf.net/sfu/bobj-july >>>> > _______________________________________________ >>>> > iText-questions mailing list >>>> > iText-questions@lists.sourceforge.net >>>> > https://lists.sourceforge.net/lists/listinfo/itext-questions >>>> > >>>> > Buy the iText book: http://www.1t3xt.com/docs/book.php >>>> > Check the site with examples before you ask questions: >>>> > http://www.1t3xt.info/examples/ >>>> > You can also search the keywords list: >>>> > http://1t3xt.info/tutorials/keywords/ >>>> > >>>> > This email was received from the INTERNET and scanned by the >>>> Government >>>> > Secure Intranet anti-virus service supplied by Cable&Wireless in >>>> partnership >>>> > with MessageLabs. (CCTM Certificate Number 2009/09/0052.) In case of >>>> > problems, please call your organisation's IT Helpdesk. >>>> > Communications via the GSi may be automatically logged, monitored >>>> and/or >>>> > recorded for legal purposes. >>>> > >>>> > Land Registry's House Price Index is now live. www.landregistry.gov.uk >>>> > >>>> > If you have received this e-mail and it was not intended for you, >>>> please >>>> > let us know, and then delete it. Please treat our communications in >>>> > confidence, as you would expect us to treat yours. Land Registry >>>> checks all >>>> > mail and attachments for known viruses, however, you are advised that >>>> you >>>> > open any attachments at your own risk. >>>> > >>>> > >>>> > >>>> > The original of this email was scanned for viruses by the Government >>>> > Secure Intranet virus scanning service supplied by Cable&Wireless in >>>> > partnership with MessageLabs. (CCTM Certificate Number 2009/09/0052.) >>>> On >>>> > leaving the GSi this email was certified virus free. >>>> > Communications via the GSi may be automatically logged, monitored >>>> and/or >>>> > recorded for legal purposes. >>>> > >>>> >>>> >>>> >>>> -- >>>> Chenguang(Rance) Pan >>>> School of Electronics Engineering & Computer Science >>>> Peking University >>>> Tel:(86)1358-169-4723 >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 >>>> 30-Day >>>> trial. Simplify your report design, integration and deployment - and >>>> focus >>>> on >>>> what you do best, core application coding. Discover what's new with >>>> Crystal Reports now. http://p.sf.net/sfu/bobj-july >>>> _______________________________________________ >>>> iText-questions mailing list >>>> iText-questions@lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/itext-questions >>>> >>>> Buy the iText book: http://www.1t3xt.com/docs/book.php >>>> Check the site with examples before you ask questions: >>>> http://www.1t3xt.info/examples/ >>>> You can also search the keywords list: >>>> http://1t3xt.info/tutorials/keywords/ >>> >>> >> >> >> >> -- >> Chenguang(Rance) Pan >> School of Electronics Engineering & Computer Science >> Peking University >> Tel:(86)1358-169-4723 >> >> ------------------------------------------------------------------------------ >> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 >> 30-Day >> trial. Simplify your report design, integration and deployment - and focus >> on >> what you do best, core application coding. Discover what's new with >> Crystal Reports now. http://p.sf.net/sfu/bobj-july >> _______________________________________________ >> iText-questions mailing list >> iText-questions@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/itext-questions >> >> Buy the iText book: http://www.1t3xt.com/docs/book.php >> Check the site with examples before you ask questions: >> http://www.1t3xt.info/examples/ >> You can also search the keywords list: >> http://1t3xt.info/tutorials/keywords/ >> >> > > -- > View this message in context: > http://old.nabble.com/Re%3A-Can-iText-be-used-to-extract-information-from-PDFfiles--tp26576644p26577072.html > Sent from the iText - General mailing list archive at Nabble.com. > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day > trial. Simplify your report design, integration and deployment - and focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > iText-questions mailing list > iText-questions@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/itext-questions > > Buy the iText book: http://www.1t3xt.com/docs/book.php > Check the site with examples before you ask questions: > http://www.1t3xt.info/examples/ > You can also search the keywords list: http://1t3xt.info/tutorials/keywords/ > -- Chenguang(Rance) Pan School of Electronics Engineering & Computer Science Peking University Tel:(86)1358-169-4723 ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.1t3xt.com/docs/book.php Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/