Thanks. I'll certainly look into those - I've used pdftotext on the odd occasion to get access to text in a pdf file - useful for abstracting program fragments out of pdf files when learning new languages - but I likewise haven't ever investigated it beyond that.

Wesley Parish

On 7/01/2013, at 9:44 PM, Nick Rout wrote:



On Mon, Jan 7, 2013 at 9:18 PM, Wesley Parish <[email protected]> wrote: Does anyone know if there is any command-line utility to parse PDF files for file info? Things like wordcount, no of pages, etc? (I know there's something like this for DOC files: but I haven't come across it for PDF - yet.)



pdftk has a lot of options, I mainly use it to join pdfs together or choose pages out of a pdf to make a new one. (eg cutting out the advert pages from the electronic version of linux journal before printing.)

I haven't investigated all it's functions but it's a place to start.

pdftotxt and pdf2txt may help too.
_______________________________________________
Linux-users mailing list
[email protected]
http://lists.canterbury.ac.nz/mailman/listinfo/linux-users

_______________________________________________
Linux-users mailing list
[email protected]
http://lists.canterbury.ac.nz/mailman/listinfo/linux-users

Reply via email to