On Sun, Dec 30, 2012 at 12:50 AM, Albert Astals Cid <aa...@kde.org> wrote:
> El Diumenge, 30 de desembre de 2012, a les 00:30:41, Jaydeep Solanki va > escriure: > > I'm aware about Textpage algorithms that regardless of the generator, > makes > > text selection work properly in column layout. > > I'm not asking it for okular, I'm asking for personal use, as I was > trying > > out Poppler, I made a small app that currently can open pdf files, while > I > > was working on the text selection in column layout, I thought that if all > > the libs generate text in the correct order then there is no need for > > coding the algorithm to arrange text. In future I'm planning to add > support > > for other formats, so I need to know if the algorithm is needed. > > The thing is, why write the column sorting algorithm in each and every of > the > libraries if you can have it just in one place? > > yes you are absolutely correct, but I was thinking to omit the algorithm completely. Because if all the libs generate text in proper order then that can be used for helping the selection, instead of an algorithm. I have found that OCRopus is used in Okular, I didn't confirm, but found it somewhere written on internet. (correct me if I'm wrong). OCRopus uses image processing which uses a large amount of computation resources. So the core reason to omit the algorithm is to save large computation. > Cheers, > Albert > > > > > Jaydeep > > > > On Sat, Dec 29, 2012 at 11:22 PM, Albert Astals Cid <aa...@kde.org> > wrote: > > > El Dissabte, 29 de desembre de 2012, a les 22:56:12, Jaydeep Solanki va > > > > > > escriure: > > > > As you might be knowing that Poppler::Page::textList() generates > text in > > > > the correct order (i.e. left to right). Now poppler not only > generates > > > > it > > > > in the correct order but it also considers the layout, for example > in a > > > > > > two > > > > > > > column document, it follows the column layout while indexing the > text. > > > > [image: Inline image 2] > > > > > > > > just of example consider the image above, see the selection, poppler > > > > doesn't generate textList() in that order, it generates the > textList() > > > > as > > > > shown in the below image, > > > > > > > > [image: Inline image 3] > > > > So my question is do all the libraries that the generators of okular > > > > use, > > > > generate text in the proper order considering the layout ? > > > > > > That doesn't matter, Textpage algorithms "should" correctly arrange > text > > > in > > > columns (as correctly as the algorithm in there works). > > > > > > Have you found any particular problem? > > > > > > Albert > > > _______________________________________________ > > > Okular-devel mailing list > > > Okular-devel@kde.org > > > https://mail.kde.org/mailman/listinfo/okular-devel > _______________________________________________ > Okular-devel mailing list > Okular-devel@kde.org > https://mail.kde.org/mailman/listinfo/okular-devel >
_______________________________________________ Okular-devel mailing list Okular-devel@kde.org https://mail.kde.org/mailman/listinfo/okular-devel