ike so:
start_url: `/home/htdig/conf/start_url_file`
limit_urls_to: `/home/htdig/conf/limit_url_file`
The contents of both files are just links.
Good Luck,
Bill Carlson
--
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/|
Last thing is to examine those files in lost+found. Use less against them,
then file if that doesn't make any sense.
Finally, reformatting and reinstalling is a bad habit, break it if you
can. You'll learn much more by trying to fix things rather than reinstall.
Contary to Windows, with Li
really have fun, compile several kernels at once or alter the -j
parameter for make in the Makefile). I had a machine that could not
compile a kernel but otherwise ran fine. Turned out the CPU was
overheating, but only when it was really pushed.
So, compile a kernel or two and then start lookin
Hat and others. ht://Dig certainly provides all the
needed configuration options "out of the source".
Personally, I disagree with the move of /home/httpd -> /var/www in RedHat
7.0, but then I can move it right back anytime I want. Such is the power
of Knowing How It Works.
$.
rsion), but
Webalizer does a good job of getting the stats out. And the source is
fairly good too, easy to modify.
$.02
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not my employer's.
University o
clean dig. You need to setup url_part_aliases for htdig to handle the
different hostname, but it works very well. This approach also handles the
problem of dynamic footers on the pages (more of a link checker issue).
If you don't control the server, Gilles is right, not much you can do.
Bill C
On Wed, 24 May 2000, Bruce Fancher wrote:
> Is everything else I did, like the url_part_alias: config entries, correct?
>
> Thanks
Hey Bruce,
The parameter is url_part_aliases. Add the 'es' in each config file and
you should be set.
Bill Carlson
Systems P
, I don't know what kind of hardware they are using.
When indexing, it is possible to merge separate digs into one large
database. It's all a matter of planning and reading the fine print in the
documentation (which is excellent).
Building a scaling solution is always very iffy and
ould I modify to change the directory of star.gif, such as
> "temp/htdig/star.gif"
>
You need to either provide templates other than the "builtin" ones in your
conf file or recompile htdig after modifying CONFIG (look for IMAGE).
I'd use the builtin templates if you
Hey all,
I'm just starting to work on this and wanted to check with you all before
I got too involved.
Are the databases transportable across platforms? IE, if I dig on a SUN
box, should I be able to move the resulting dbs to an intel box and expect
htdig to work?
Thanks!
Bill Ca
m Cobalt's site got me up and running:
glibc-2.0.7-29C2
ldconfig-1.9.5-2
I believe the fix in ldconfig is what stops the core dump from htdig.
HTH,
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospi
arches, look at the restrict field for htsearch. A tag
like this will do what you need:
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not my employer's.
University of Iowa Hosp
lders (I'm an email packrat). I got around the authentication by using
the local_urls and local_users_urls in htdig.conf. This only works if
htdig is on the same machine as your archives...but it completely bypasses
the web server that way.
Bill Carlson
Systems Programmer[EMAI
IP and copy the Apache setup of www.site.org to
htdig.site.org and add FancyIndexing off. You will need to use the
url_part_aliases directive in htdig to get this to properly return the
correct search urls, but it works just fine. I use this at my site to
crawl a local copy of the site rather th
ok at the htsearch documentation, specifically the restrict tag.
Include something like:
http://some.host/Search/Here/Only/">
in your search form.
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not
url_part_aliases: http://12.34.56.78/ *1
Think of the *1 as a place holder which is set when digging and replaced
when searching.
HTH,
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not my employer's
zy, etc.) for that config with the -m option. The only
real savings is that you don't have to run htdig against the same site
twice, the target database will contain information from all the merged
databases.
HTH,
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opin
reat the 'index' page as a
regular page and follow the links to all files and subdirectories.
For Apache, this is done with the Options Indexes configuration directive.
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://
reference as to where the data came from
other than the value of the links. As far as htsearch is concerned, it is
using only one database.
Were you thinking of displaying the source of the hit with each hit on the
reults page?
Bill Carlson
Systems Programmer[EMAIL PROTEC
eager to merge :> )
Thanks,
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not my employer's.
University of Iowa Hospitals and Clinics|
To unsubscr
On Mon, 20 Sep 1999, Geoff Hutchison wrote:
> At 10:14 AM -0500 9/20/99, Bill Carlson wrote:
> >rundig -c main.conf
> >rundig -m main.conf -c crawl.conf -skipdig
> >
> >where I modified rundig to take -skipdig and skip the htdig and htmerge
> >portion.
>
&
ntially what I am doing:
rundig -c main.conf
rundig -m main.conf -c crawl.conf -skipdig
where I modified rundig to take -skipdig and skip the htdig and htmerge
portion.
How can I verify that the merge happens other than performing a search?
What can I lookup for in the verbose output? In one of th
(stock rundig)
htdig crawl
htmerge crawl with '-m main.conf'
rundig skipping htdig and htmerge
The merge doesn't seem to happen. Search on main works fine, search on
crawl only returns crawl hits. What am I doing wrong?
Setup:
HtDig 3.1.2
Solaris 2.6
Any help appreciated
le, but no
change.
Any pointers?
Thanks,
Bill Carlson
Systems Programmer[EMAIL PROTECTED]| Opinions are mine,
Virtual Hospital http://www.vh.org/| not my employer's.
University of Iowa Hospitals and Clinics|
To u
ou hit this before I did, I was about to try htdig on a qube.
At any rate, to compile htdig you need the c++ (such as g++) compiler, not
just libstdc++. I would guess there is an rpm on Cobalt's site for g++, if
not htdig itself.
Good Luck!
Bill Carlson
Systems Programmer
25 matches
Mail list logo