|
Robin,
If you
find you're grabbing too much, limit the selection.
a) see
if there are natural divisions in the site and upload each individually (ie:
instead of grabbing all www.mylocalnewspaper.com, take www.mylocalnewspaper.com/august2004,
or instead of all the reviews on a site, take all the sci-fi ones first (see if
they all start with the same address) and so on, doing each month (division) in
smaller chunks). If the site only has the www.whatever.com as its common denominator,
and then all individual pages are randomly named, you can't do this. But
if for example all sc-fi reviews start with www.whatever.com/sci-fi and all the
romance ones /romance, etc..., you can pluck these groups individually for
smaller files. You can also limit the size by turning off pictures or
making them grayscale as opposed to full color or thousands of
colors.
b)
specify your limits. For example recently I plucked a review websites all
about cool tech gadgets. The site is www.the-gadgeteer.com. I first
tried to pluck the one page that has every single link to all their
reviews. Not only did it keep freezing, but the finished file (had it
worked) would have been over 60 megs, and while I had room on my card, the palm
doesn't always want to open files this large. So I broke it into
chunks. I noticed that another page had subdivisions for the reviews
(palmos, toys, etc...) and that all reviews in that category began with that
address (like in the example above). So I set up 8 different plucker
files, 1 for each category. But my new problem was that at the bottom,
after all the reviews, were dozens and dozens of third-party links to sites off
the gadgeteer site. I didn't want those, but I did want to be able to go
as deep into the gadgeteer site as I could. I set my limit to 3, but I
then went to the configuration screen (you can either bring it up manually or
tell the plucker wizard to open it right after you create a new file), went to
the "limits" field, and in the area for limiting a specific configuration
pattern, I pasted in any website address for the category I was working
on. ie: www.the-gadgeteer.com/toys/micromachines.
Obviously I didn't want to tell it to only give me pages that start with this
exact address, 'cus only one would come up (the page about micromachines).
So I deleted everything backwards until the / after toys. Now it knew to
get me all links, up to 3 deep, ONLY if they started with www.the-gadgeteer.com/toys/.
So it wouldn't waste time and space grabbing all that third party stuff, or
anything else. The last thing to do in this step is put .* at the
beginning and end of that address (don't ask me why, that's what it said in the
help file I used to teach me this). So in this field of the limits page, I
end up with:
.*www.the-gadgeteer.com/toys/.*
I hope
this helps, and that I explained clearly.
Jennifer.
|
- Library navigation with hardkeys? Anne-Gert Bultena
- Re: Library navigation with hardkeys? Ian Dobbie
- RE: Library navigation with hardkeys? Anne-Gert Bultena
- Just getting started Robin Taylor
- Re: Just getting started Jennifer Lori Carpman
- Re: Just getting started Robin Taylor
- Re: Just getting started David A. Desrosiers
- Re: Just getting start... Alan Hoyle
- Re: Just getting start... Robin Taylor
- RE: Just getting start... Jennifer Lori Carpman
- Re: Just getting start... Akkana Peck
- Re: Just getting start... Robin Taylor
- In JPluck and Plucker ... Robin Taylor
- RE: Just getting started Jennifer Lori Carpman

