Mostafa Shaeri wrote: > hello. > i faced problem in my image extractor code. any suggestion appreciated. > > i first get stream from pdf and check the object in that , if object is > a image i will save it. > the code is working properly and i can get all image in the pdf. but > some images is unusual because some texts on the image are hidden > (erased).!!!!!!!!
The objects you're extracting are raster images. Putting text in raster images usually results in poor quality of the glyphs. Therefore, it's better to add text to the PDF as vector data that is printed ON TOP OF the raster images. That's probably what happened in the PDFs you have. You are extracting the images from the PDF exactly the way they were added: without the text. The text probably isn't a part of the image. I'm pretty sure you'll find the text if you parse the content stream of the page. -- This answer is provided by 1T3XT BVBA http://www.1t3xt.com/ - http://www.1t3xt.info ------------------------------------------------------------------------------ This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.itextpdf.com/book/ Check the site with examples before you ask questions: http://www.1t3xt.info/examples/ You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
