Jose, Do you have a choice between using .DOC and .PDF or is it that you must be able to extract from both?
If a choice, can you use another format instead? Brett. ----- Original Message ----- From: "jose" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Friday, September 27, 2002 3:59 AM Subject: [REBOL] Re: Converting Word .DOC and .PDF to text files Thanks. This is the advice I need. I'll probably be better off using wvware library I hope word is easier than PDF ! --- Gabriele Santilli <[EMAIL PROTECTED]> escribió: > Hi jose, > > On Thursday, September 26, 2002, 6:21:10 PM, you > wrote: > > j> I want to get the text of any arbitrary PDF file. > Is > j> there a spec I can look at ? > > On the Adobe web site you'll find the full > specifications for the > PDF format. I can send it to you, if you don't want > to search for > it. However, as I said, parsing a PDF file is harder > than creating > one, because you'll have to deal with all > possibilities > (compression, encryption, linearized format...); > of course, this > does not mean it is impossible. > > Regards, > Gabriele. > -- > Gabriele Santilli <[EMAIL PROTECTED]> -- > REBOL Programmer > Amigan -- AGI L'Aquila -- REB: > http://web.tiscali.it/rebol/index.r > > -- > To unsubscribe from this list, please send an email > to > [EMAIL PROTECTED] with "unsubscribe" in the > subject, without the quotes. > _______________________________________________________________ Yahoo! Messenger Nueva versión: Webcam, voz, y mucho más ¡Gratis! Descárgalo ya desde http://messenger.yahoo.es -- To unsubscribe from this list, please send an email to [EMAIL PROTECTED] with "unsubscribe" in the subject, without the quotes. -- To unsubscribe from this list, please send an email to [EMAIL PROTECTED] with "unsubscribe" in the subject, without the quotes.