Thanks. I'll certainly look into those - I've used pdftotext on the
odd occasion to get access to text in a pdf file - useful for
abstracting program fragments out of pdf files when learning new
languages - but I likewise haven't ever investigated it beyond that.
Wesley Parish
On 7/01/2013, at 9:44 PM, Nick Rout wrote:
On Mon, Jan 7, 2013 at 9:18 PM, Wesley Parish
<[email protected]> wrote:
Does anyone know if there is any command-line utility to parse PDF
files for file info? Things like wordcount, no of pages, etc? (I
know there's something like this for DOC files: but I haven't come
across it for PDF - yet.)
pdftk has a lot of options, I mainly use it to join pdfs together
or choose pages out of a pdf to make a new one. (eg cutting out the
advert pages from the electronic version of linux journal before
printing.)
I haven't investigated all it's functions but it's a place to start.
pdftotxt and pdf2txt may help too.
_______________________________________________
Linux-users mailing list
[email protected]
http://lists.canterbury.ac.nz/mailman/listinfo/linux-users
_______________________________________________
Linux-users mailing list
[email protected]
http://lists.canterbury.ac.nz/mailman/listinfo/linux-users