Re: Html to Pdf

2014-09-05 Thread Alin Mazilu
Since we are suggesting alternatives, I use iText for converting HTML into PDF. Here is an example: http://www.rgagnon.com/javadetails/java-html-to-pdf-using-itext.html Hope that helps, Alin On Fri, Sep 5, 2014 at 1:50 PM, John Hewson wrote: > Rendering HTML is very complex, you basically nee

Re: Problem with processTextPosition

2014-05-17 Thread Alin Mazilu
inates? > > > On Sat, May 17, 2014 at 12:18 PM, Alin Mazilu wrote: > > What are the x and y coordinates of H and W? > > > > Alin Mazilu > > SKE GlobalTech, LLC > > 3250 West Market St. Suite 307D > > Fairlawn, OH 44333 > > > > Sent from my G

Re: Problem with processTextPosition

2014-05-17 Thread Alin Mazilu
What are the x and y coordinates of H and W? Alin Mazilu SKE GlobalTech, LLC 3250 West Market St. Suite 307D Fairlawn, OH 44333 Sent from my Galaxy S3 On May 17, 2014 2:42 AM, "DImuthu Upeksha" wrote: > Hi all, > > I was tying to manually feed text position objects to &g

Re: PDF file characters x and y coordinates

2014-05-16 Thread Alin Mazilu
I process about 2000 PDF files daily and I never had had an issue with the coordinates. One piece of advise though: write your own TextPositionComparator. ~Alin On Fri, May 16, 2014 at 8:39 AM, Simer P wrote: > I just needed to confirm this with you guys. > > Can the X and Y coordinates return

Re: PDFTextPositions

2014-04-02 Thread Alin Mazilu
t; To Search for Text, I used a method prescribed in > > http://www.programming-free.com/2012/11/simple-word-search-in-pdf-files-using.html > . > > Is there a easier way to Search for Text as well? > > Are there no direct APIs? > > Thank you, > Sireesha > > > On

Re: PDFTextPositions

2014-04-02 Thread Alin Mazilu
You have to extend the PDFTextStripper class and override the processTextPosition(...) method. From there the logic depends on you. You can also override the writePage() method to grab the charactersByArticle Vector and then you would look for your words in there by iterating over it. Basically in

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
In the case of PDFTextStripper, there is a strong use case for using a > protected field, > because overriding it is the primary mechanism for custom text extraction. > > Cheers > > -- John > > On 13 Mar 2014, at 10:40, Alin Mazilu wrote: > > > Ok, I will try. In m

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
rovide null as buffer parameter). > > > Best, > Timo > > > Am 13.03.2014 16:46, schrieb Alin Mazilu: > > Where? Here's the code that causes that: >> >> PDFMergeUtility util = new PDFMergeUtility(); >> >> for (File file : set) { >>

Re: Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
ar 13, 2014 at 11:27 AM, Maruan Sahyoun wrote: > Hi, > > not a direct answer to your question but could you try > PDDocument.loadNonSeq instead? > > BR > Maruan Sahyoun > > > Am 13.03.2014 um 16:16 schrieb Alin Mazilu : > > > > Hello guys, > > >

Problem With MergeUtility

2014-03-13 Thread Alin Mazilu
Hello guys, Has anyone had any problem with this? Any idea why it happens? What would be a good value for pushBackSize so this does not happen? Thanks! Partial stack trace: org.apache.pdfbox.exceptions.WrappedIOException: Could not push back 72940 bytes in order to reparse stream. Try increas

Re: Need JBIG2 test image

2014-03-12 Thread Alin Mazilu
I have a scanned accident police reports that have people names, addresses and phone numbers in them. I had a problem printing these files with pdfbox and I had to improvise by using a command prompt print utility as a Process. I could maybe give you one if you agree not to release it to the public

Re: Regarding pdf data extraction

2014-03-03 Thread Alin Mazilu
I don't think that class can help you... All you need is the PDFTextStripper class... On Mon, Mar 3, 2014 at 7:15 PM, Divya Muttineni wrote: > I am trying to convert the tabular data from pdf file to text(.txt) file. > In one of the article I came across > org.apache.pdfbox.pdfviewer.PDFPageDraw

Re: Error printing...

2014-01-22 Thread Alin Mazilu
1:48 PM, John Hewson wrote: > Yes, there is. Simply Google "JBIG2 plugin” and follow the first link, it > will be called "jbig2-imageio". > > -- John > > On 22 Jan 2014, at 09:16, Alin Mazilu wrote: > > > Hello all, > > > > I am printing so

Error printing...

2014-01-22 Thread Alin Mazilu
Hello all, I am printing some PDFs and I am getting this: Jan 22, 2014 12:07:47 PM org.apache.pdfbox.filter.JBIG2Filter decode SEVERE: Can't find an ImageIO plugin to decode the JBIG2 encoded datastream. Jan 22, 2014 12:07:47 PM org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap getRGBImage SE

Re: PDFTextStripper's writeLine() must be protected!

2013-11-15 Thread Alin Mazilu
Hello, I would venture to guess that if you need to override that method you probably need to do something more complicated than just finding out where a line starts and where it ends. Because if you just need to get the beginning and end of each line, you can override setLineSeparator() and all t

Re: PDF Text Highlight

2013-07-27 Thread Alin Mazilu
standard -- the upper left corner. But that's not a problem as it's fixable with simple arithmetic. Thank you so much for your help. It would have taken me a long time to figure it out on my own, if ever. Alin Mazilu On Fri, Jul 26, 2013 at 6:19 PM, Fred Hansen wrote: > Caveat: I

PDF Text Highlight

2013-07-26 Thread Alin Mazilu
as? I appreciate it. Alin Mazilu

Re: [DISCUSS] Switch to java 1.6

2013-04-30 Thread Alin Mazilu
th normally now... :)) On Tue, Apr 30, 2013 at 1:03 PM, Thomas Chojecki wrote: > > Zitat von Alin Mazilu : > > Hello, >> > Hi, > > > I got one: JavaFX. I use PDFBox in projects that use JavaFX 1.7/1.8. >> > I try to find this JavaFX version to see what J

Re: [DISCUSS] Switch to java 1.6

2013-04-28 Thread Alin Mazilu
Hello, I got one: JavaFX. I use PDFBox in projects that use JavaFX 1.7/1.8. Alin On Sun, Apr 28, 2013 at 1:35 PM, Andreas Lehmkuehler wrote: > Hi, > > there was already a discussion about switching to java 1.6. As this is a > very > important topic I'd like to move the discussion to a separate