Hello team,
I have a PDF which has text in the form of paragraphs( with underlining
for some tet and some text may be bold) and tables. I want to read that
PDF in sequential order. So how will i know whether the section that i am
going to read is a table or a paragraph or a text.
As of now i am using the following code.But it is taking everything as
text i guess, hence i am not able to read the tables, underlined text and
all.
for (int i = 1; i <= reader.getNumberOfPages(); i++)
{
hg = hg + h.getTextFromPage(i);
if (-1 != hg.indexOf("XYZ")
{
hg=hg.replaceAll("XYZ","ABC");
}
}
Can someone please help me in this regard.
Regards,
Swapna Matanam
Tata Consultancy Services
Mailto: [email protected]
Website: http://www.tcs.com
____________________________________________
Experience certainty. IT Services
Business Solutions
Outsourcing
____________________________________________
=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you
------------------------------------------------------------------------------
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions:
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/