Thanks pal!
I'll look into it! :)

On Tue, Dec 1, 2009 at 12:36 AM, trumpetinc2 <forum_...@trumpetinc.com> wrote:
>
> The parser can be used to get text from the PDF.  See
> com.lowagie.text.pdf.parser.PdfContentReaderTool
>
> - K
>
>
> CGP-4 wrote:
>>
>> Cool! Thanks!
>> For my task, I doubt that whether should I read the PDF Reference and
>> write my info-extraction program from scratch.......... -___-
>>
>> On Mon, Nov 30, 2009 at 11:48 PM, Iliadis Yannis <ilyan...@gmail.com>
>> wrote:
>>> For Rtf you need the iText-rtf-2.1.7.jar.
>>>
>>> As far as concerning text extraction from a PDF, there are quite a lot of
>>> threads in the mailing list that state the complexity of this task.
>>> On the other hand if all the information about the papers is stored in
>>> the
>>> pdf's metadata then you can extract them.
>>>
>>> 2009/11/30 CGP <chenguang1...@gmail.com>
>>>>
>>>> Thanks Matthew!
>>>> I've downloaded iText 2.1.7
>>>> It is JAR'd.. but it is not complete... some classes like RtfWriter2
>>>> does not exist in the JAR.. but it is implemented in the source code..
>>>> That's why I want to recompile the source code...
>>>>
>>>> For me, the most important task is to extract information from PDF
>>>> files..
>>>> Have you guys done similar things?
>>>> I would love to hear your advice
>>>>
>>>> Thanks!
>>>>
>>>> On Mon, Nov 30, 2009 at 11:31 PM, Wain, Matthew
>>>> <matthew.w...@landregistry.gsi.gov.uk> wrote:
>>>> > If you downloaded iText 2.1.7 you will see that's it is already JAR'd.
>>>> >
>>>> > -----Original Message-----
>>>> > From: CGP [mailto:chenguang1...@gmail.com]
>>>> > Sent: 30 November 2009 15:21
>>>> > To: itext-questions@lists.sourceforge.net
>>>> > Subject: [iText-questions] Can iText be used to extract information
>>>> from
>>>> > PDFfiles?
>>>> >
>>>> >
>>>> > Hello guys!
>>>> > I am interested in extract information from academic papers in PDF
>>>> > format.
>>>> > The information I wish to extract include: paper title, author,
>>>> > publication, year, etc.
>>>> > I've started reading the book iText in Action, it is a great book, but
>>>> > I have no idea whether the books content will help me in achieving my
>>>> > goals.
>>>> > Soooo, I need advice from you guys.
>>>> >
>>>> > Oh, BTW, how can I recompile the iText 2.1.7 source code to get a JAR
>>>> > file within Windows XP?  I'm using eclipse.....
>>>> > Thanks a lot!
>>>> >
>>>> > --
>>>> > Chenguang(Rance) Pan
>>>> > School of Electronics Engineering & Computer Science
>>>> > Peking University
>>>> > Tel:(86)1358-169-4723
>>>> >
>>>> >
>>>> >
>>>> ------------------------------------------------------------------------------
>>>> > Let Crystal Reports handle the reporting - Free Crystal Reports 2008
>>>> > 30-Day
>>>> > trial. Simplify your report design, integration and deployment - and
>>>> > focus on
>>>> > what you do best, core application coding. Discover what's new with
>>>> > Crystal Reports now.  http://p.sf.net/sfu/bobj-july
>>>> > _______________________________________________
>>>> > iText-questions mailing list
>>>> > iText-questions@lists.sourceforge.net
>>>> > https://lists.sourceforge.net/lists/listinfo/itext-questions
>>>> >
>>>> > Buy the iText book: http://www.1t3xt.com/docs/book.php
>>>> > Check the site with examples before you ask questions:
>>>> > http://www.1t3xt.info/examples/
>>>> > You can also search the keywords list:
>>>> > http://1t3xt.info/tutorials/keywords/
>>>> >
>>>> > This email was received from the INTERNET and scanned by the
>>>> Government
>>>> > Secure Intranet anti-virus service supplied by Cable&Wireless in
>>>> partnership
>>>> > with MessageLabs. (CCTM Certificate Number 2009/09/0052.) In case of
>>>> > problems, please call your organisation's IT Helpdesk.
>>>> > Communications via the GSi may be automatically logged, monitored
>>>> and/or
>>>> > recorded for legal purposes.
>>>> >
>>>> > Land Registry's House Price Index is now live. www.landregistry.gov.uk
>>>> >
>>>> > If you have received this e-mail and it was not intended for you,
>>>> please
>>>> > let us know, and then delete it. Please treat our communications in
>>>> > confidence, as you would expect us to treat yours. Land Registry
>>>> checks all
>>>> > mail and attachments for known viruses, however, you are advised that
>>>> you
>>>> > open any attachments at your own risk.
>>>> >
>>>> >
>>>> >
>>>> > The original of this email was scanned for viruses by the Government
>>>> > Secure Intranet virus scanning service supplied by Cable&Wireless in
>>>> > partnership with MessageLabs. (CCTM Certificate Number 2009/09/0052.)
>>>> On
>>>> > leaving the GSi this email was certified virus free.
>>>> > Communications via the GSi may be automatically logged, monitored
>>>> and/or
>>>> > recorded for legal purposes.
>>>> >
>>>>
>>>>
>>>>
>>>> --
>>>> Chenguang(Rance) Pan
>>>> School of Electronics Engineering & Computer Science
>>>> Peking University
>>>> Tel:(86)1358-169-4723
>>>>
>>>>
>>>> ------------------------------------------------------------------------------
>>>> Let Crystal Reports handle the reporting - Free Crystal Reports 2008
>>>> 30-Day
>>>> trial. Simplify your report design, integration and deployment - and
>>>> focus
>>>> on
>>>> what you do best, core application coding. Discover what's new with
>>>> Crystal Reports now.  http://p.sf.net/sfu/bobj-july
>>>> _______________________________________________
>>>> iText-questions mailing list
>>>> iText-questions@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/itext-questions
>>>>
>>>> Buy the iText book: http://www.1t3xt.com/docs/book.php
>>>> Check the site with examples before you ask questions:
>>>> http://www.1t3xt.info/examples/
>>>> You can also search the keywords list:
>>>> http://1t3xt.info/tutorials/keywords/
>>>
>>>
>>
>>
>>
>> --
>> Chenguang(Rance) Pan
>> School of Electronics Engineering & Computer Science
>> Peking University
>> Tel:(86)1358-169-4723
>>
>> ------------------------------------------------------------------------------
>> Let Crystal Reports handle the reporting - Free Crystal Reports 2008
>> 30-Day
>> trial. Simplify your report design, integration and deployment - and focus
>> on
>> what you do best, core application coding. Discover what's new with
>> Crystal Reports now.  http://p.sf.net/sfu/bobj-july
>> _______________________________________________
>> iText-questions mailing list
>> iText-questions@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/itext-questions
>>
>> Buy the iText book: http://www.1t3xt.com/docs/book.php
>> Check the site with examples before you ask questions:
>> http://www.1t3xt.info/examples/
>> You can also search the keywords list:
>> http://1t3xt.info/tutorials/keywords/
>>
>>
>
> --
> View this message in context: 
> http://old.nabble.com/Re%3A-Can-iText-be-used-to-extract-information-from-PDFfiles--tp26576644p26577072.html
> Sent from the iText - General mailing list archive at Nabble.com.
>
>
> ------------------------------------------------------------------------------
> Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day
> trial. Simplify your report design, integration and deployment - and focus on
> what you do best, core application coding. Discover what's new with
> Crystal Reports now.  http://p.sf.net/sfu/bobj-july
> _______________________________________________
> iText-questions mailing list
> iText-questions@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/itext-questions
>
> Buy the iText book: http://www.1t3xt.com/docs/book.php
> Check the site with examples before you ask questions: 
> http://www.1t3xt.info/examples/
> You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
>



-- 
Chenguang(Rance) Pan
School of Electronics Engineering & Computer Science
Peking University
Tel:(86)1358-169-4723

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to