Re: Plucker server on Project Gutenberg

2005-11-05 Thread Marcello Perathoner
The first experimental PG plucker server is up. Find the no. of the ebook you want and then call this url: http://www.gutenberg.org/cache/plucker/17000.plucker replace 17000 with your ebook no. This will build the file, if not existent. That may take some time when the servers are busy. Th

RE: Plucker server on Project Gutenberg

2005-11-03 Thread Lambert, Mark
> From: Marcello Perathoner > Lambert, Mark wrote: > > > But it is low-hanging fruit that would make it simpler for > those that > > have HTML. > > If they have HTML, of course I use HTML. But more than half > of them don't. > No worries. I wasn't sure if you were or not so I thought I'd

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Lambert, Mark wrote: But it is low-hanging fruit that would make it simpler for those that have HTML. If they have HTML, of course I use HTML. But more than half of them don't. -- Marcello Perathoner [EMAIL PROTECTED] ___ plucker-dev mailing list

RE: Plucker server on Project Gutenberg

2005-11-02 Thread Lambert, Mark
>On Behalf Of Marcello Perathoner >Sent: Wednesday, November 02, 2005 1:37 PM >To: plucker-dev@rubberchicken.org >Subject: Re: Plucker server on Project Gutenberg > >Lambert, Mark wrote: > >> I don't know if this would help or not, but I always go off the HTML >

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Lambert, Mark wrote: I don't know if this would help or not, but I always go off the HTML version and break on any H1 or H2. That isn't perfect either, but is easier to do. Not all PG ebooks have an HTML version. -- Marcello Perathoner [EMAIL PROTECTED] ___

RE: Plucker server on Project Gutenberg

2005-11-02 Thread Lambert, Mark
ovember 02, 2005 10:10 AM To: plucker-dev@rubberchicken.org Subject: Re: Plucker server on Project Gutenberg David A. Desrosiers wrote: >> I'm going to replace the text/plain parser with a custom one that >> will (try to) parse chapter heads, italics etc. out of the plain text. >

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
David A. Desrosiers wrote: I'm going to replace the text/plain parser with a custom one that will (try to) parse chapter heads, italics etc. out of the plain text. I'd be interested to see how you solve the context issue that has been brought up on the pg lists over the last year or so. I

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
Alexander R. Pruss wrote: That's a wonderful idea. Are you going to be caching the pdbs, or will it be fast enough to generate on demand? I'll have to cache them. Are you going to be making the docs split into 32K pages, or will you use the continuation flag to make each doc look like a si

Re: Plucker server on Project Gutenberg

2005-11-02 Thread David A. Desrosiers
I'm the webmaster of Project Gutenberg and I'm about to install the plucker distiller on the PG website. The idea is to have people download a ready-made plucker pdb instead of requiring them to run the distiller on the appropriate ebook file. There's a LOT of tools out there that do this,

Re: Plucker server on Project Gutenberg

2005-11-02 Thread Alexander R. Pruss
That's a wonderful idea. Are you going to be caching the pdbs, or will it be fast enough to generate on demand? Sorry, don't know about sorting of bookmarks. I myself added sorting of all records by URL to the parser, though, to keep chapters and the like in the right order. Maybe the bookm

Plucker server on Project Gutenberg

2005-11-02 Thread Marcello Perathoner
I'm the webmaster of Project Gutenberg and I'm about to install the plucker distiller on the PG website. The idea is to have people download a ready-made plucker pdb instead of requiring them to run the distiller on the appropriate ebook file. I'm going to replace the text/plain parser with a