htdig@htdig.org

2000-07-19 Thread Gilles Detillieux

According to [EMAIL PROTECTED]:
> I am modifying the perl script that parses pdf's and have come across a
> problem. I am indexing sites that use CGI;  htdig (I think) seems to cut
> off my URL just before the & and unfortunately the information
> following the & is the most important. The URL should look like
> /cgi-bin/../../..?..=..&..=&.. but $ARGV[2] returns /cgi-bin/../../?..=..
> Could this be coded into htdig? And if so, where can I find the code
> to modify it? I have looked at URL.cc and don't see it
> coded into there but maybe I am overlooking something.

If you're running version 3.1.3, upgrade to 3.1.5.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




htdig@htdig.org

2000-07-19 Thread campbel

Hello,

I am modifying the perl script that parses pdf's and have come across a
problem. I am indexing sites that use CGI;  htdig (I think) seems to cut
off my URL just before the & and unfortunately the information
following the & is the most important. The URL should look like
/cgi-bin/../../..?..=..&..=&.. but $ARGV[2] returns /cgi-bin/../../?..=..
Could this be coded into htdig? And if so, where can I find the code
to modify it? I have looked at URL.cc and don't see it
coded into there but maybe I am overlooking something.

Thank you in advance,
Sheri 


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




RE: [htdig] Solaris Compile Problems

2000-07-19 Thread Chad Phillips

That did the trick.  I went out to sunfreeware.com and got the 2.95.2 gcc package.  
Installed it and everything went smooth after that.

thanks
chad

>>> "Budd, S" <[EMAIL PROTECTED]> 07/19/00 03:40AM >>>

I have always had to use   2.95.2  gcc



-Original Message-
From: Chad Phillips [mailto:[EMAIL PROTECTED]] 
Sent: Monday, July 17, 2000 5:30 PM
To: [EMAIL PROTECTED] 
Subject: [htdig] Solaris Compile Problems


I just upgraded my OS from Solairs 2.6 to 8, and now I can't compile HtDig.

When I run configure it start fine but then gives me this error:
checking for fstream.h... no
configure: error: To compile ht://Dig, you will need a C++ library. Try
installing libstdc++.

I installed packages for gcc and libstdc.
libstdc++-2.8.1.1-sol7-sparc-local
GCC281

Any ideas would be greatly appreciated.

thanks
chad



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] 
You will receive a message to confirm this.



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] start/stop/restart

2000-07-19 Thread Gilles Detillieux

According to Franck Horlaville:
> I'd like to set htdig to crawl only at night.
> 
> Right now I know how to start it at specified times with cron and 
> tell it to save context before quitting so it restarts where it left 
> off.
> 
> Current operation chain :
> 
> general "do it forever and log" script
>slightly modified rundig.sh
>  htdig
> 
> I would like to include cron somewhere in there to turn crawling on and off.
> 
> Any suggestions ?

You'd need to run htdig with the -l option, and save its process ID somewhere.
Then, your cron job to stop crawling would have to kill that process ID.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Solaris Compile Problems

2000-07-19 Thread Gilles Detillieux

According to Chad Phillips:
> I just upgraded my OS from Solairs 2.6 to 8, and now I can't compile HtDig.  
> When I run configure it start fine but then gives me this error:
> checking for fstream.h... no
> configure: error: To compile ht://Dig, you will need a C++ library. Try installing 
>libstdc++.
> 
> I installed packages for gcc and libstdc.
> libstdc++-2.8.1.1-sol7-sparc-local
> GCC281
> 
> Any ideas would be greatly appreciated.

Does the libstdc++ package include all the header files?  That seems to
be what configure is failing to find.  If the header files are in a non-
standard location, perhaps you need to set your CFLAGS environment variable
before running ./configure, to point it in the right direction.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Error in ./configure

2000-07-19 Thread Gilles Detillieux

According to Sigfus Oddsson:
> My Linux distribution is Red Hat 6.2, very minimal though - I only installed
> what I considered to be the bare minimum of packages.
> 
> This is my sed version: sed-3.02-6
> 
> Included is my config.status file
> 
> My $SHELL is /bin/bash.
...
> > > > > When doing a ./configure on the 3.1.5 version of Ht://Dig I
> constantly get
> > > > > this error:
> > > > >
> > > > > sed: file conftest.s1 line 3: Unterminated `s' command
> > > > >
> > > > > Which results in no Makefile and CONFIG being generated so I can't
> 'make'
> > > > > the binary.

In the attached config.status file...
[snip]
> sed 's/%@/@@/; s/@%/@@/; s/%g$/@g/; /@g$/s/[\\&%]/\\&/g;
>  s/@@/%@/; s/@@/@%/; s/@g$/%g/' > conftest.subs <<\CEOF
> /^[   ]*VPATH[]*=[^:]*$/d
> 
> s%@SHELL@%/bin/sh%g
> s%@CFLAGS@%-O3 -march=pentium -mcpu=pentium -ffast-math -funroll-loops 
>-fomit-frame-pointer -fforce-mem -fforce-addr
> -malign-double -fno-exceptions%g
[snip]

This seems to be the problem right here.  The CFLAGS environment variable
contains an embedded newline character, which is throwing off the sed
script.  You should either rerun ./configure with the CFLAGS variable
corrected, if that's how configure got it's CFLAGS setting in the first
place, or you can edit config.status to rejoin the two lines above into
a single line and then execute config.status to build all the necessary
Makefiles.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] htdig - searching

2000-07-19 Thread alan

Whenever i search for some word (i.e. "frog" or any other word) it doesn't show me any
results even though i know "frog" has to be found at least in one file... any ideas why
my search results r not finding anything? i tried to add META tags but to no avail...
htdig and htmerge seemed to have worked fine but my searches r not finding anything...
plz help

thx


-alan




Gilles Detillieux wrote:

> According to alan:
> > thx... now i run htdig -vvv but nothing appears on the screen yet the program exits
> > normally... any idea why i'm not seeing ANYTHING at all?
>
> Strange.  Try "htdig -v -c /path/to/your/config/file.conf", and make
> sure your config file makes sense.  You should get some sort of output,
> unless htdig is dying prematurely.
>
> --
> Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
> Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930
>
> 
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] htdig verbose mode

2000-07-19 Thread alan

ok my config file had a minor error which i didnt realize :)

thx anyway


-alan




Gilles Detillieux wrote:

> According to alan:
> > thx... now i run htdig -vvv but nothing appears on the screen yet the program exits
> > normally... any idea why i'm not seeing ANYTHING at all?
>
> Strange.  Try "htdig -v -c /path/to/your/config/file.conf", and make
> sure your config file makes sense.  You should get some sort of output,
> unless htdig is dying prematurely.
>
> --
> Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
> Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930
>
> 
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Error in ./configure

2000-07-19 Thread Gilles Detillieux

According to [EMAIL PROTECTED]:
> Oh, I'm sorry that I didn't state that. I figured it would appear in the log
> file.
> 
> I'm running Linux 2.2.16 on an i386

I've taken a look at the ./configure script, and as near as I can tell,
the only thing that would cause that error is an error in the substitution
command for the $SHELL variable.  What is your setting of $SHELL when you
run ./configure?  Which distribution of Linux are you running, and what
is your sed version?  (For an rpm-based distribution, you can find out
the sed version with "rpm -qf `which sed`".)  Could you send or post your
config.status file?  It might provide a clue as to what the problem is.

> > According to Sigfus Oddsson:
> > > When doing a ./configure on the 3.1.5 version of Ht://Dig I constantly get
> > > this error:
> > > 
> > > sed: file conftest.s1 line 3: Unterminated `s' command
> > > 
> > > Which results in no Makefile and CONFIG being generated so I can't 'make'
> > > the binary.
> > 
> > What is your operating system (& which version)?

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] htdig verbose mode

2000-07-19 Thread Gilles Detillieux

According to alan:
> thx... now i run htdig -vvv but nothing appears on the screen yet the program exits
> normally... any idea why i'm not seeing ANYTHING at all?

Strange.  Try "htdig -v -c /path/to/your/config/file.conf", and make
sure your config file makes sense.  You should get some sort of output,
unless htdig is dying prematurely.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Htmerge: "Deleted, invalid"

2000-07-19 Thread Gilles Detillieux

According to David Adams:
> I use the standard MIPSpro compiler.  The script I use (thanks to my former 
> collegeaue James Hammick) to setup the Makefile is:
> 
> #!/bin/sh
> CFLAGS="-woff all -O2 -mips4 -n32 -DHAVE_ALLOCA_H" ; export CFLAGS
> CPPFLAGS="-woff all -O2 -mips4 -n32 -DHAVE_ALLOCA_H" ; export CPPFLAGS
> LDFLAGS="-mips4 -L/usr/lib32 -rpath /opt/local/htdig-3.1.5/lib";
> export LDFLAGS
> ./configure --prefix=/opt/local/htdig-3.1.5 \
>   --with-cgi-bin-dir=/opt/local/htdig-3.1.5/cgi-bin \
>   --with-image-dir=/opt/local/htdig-3.1.5/graphics \
>   --with-search-dir=/opt/local/htdig-3.1.5/htdocs/sample
> 
> A lot of that is site-specific, and the "-rpath " option is only
> needed because the compression library is not in a standard place on the 
> machine on which htdig is run.
> 
> The "-woff all" option suppresses most warning messages.  I will remove it,
> recompile htdig and send the result directly to Gilles, it might contain a clue.

As Sinclair mentioned, 'you need to have the 2.95.2 gcc and the latest
gnu "make".'  I don't know that anyone has ever gotten ht://Dig to work
with SGI's own compiler.  If fact, we got a lot of reports from folks
who couldn't even get it to compile.

If you're really determined to get to the bottom of this and make it work
with the SGI compiler, I wish you well, but I doubt I can help much.
I looked at the output you sent me, and didn't really see any red
flags pointing to an obvious problem.  I know that the Serialize and
Deserialize functions for the db.docdb records can be a tad finicky, so
that would probably be a place to look.  There could also be problems
with incorrect assumptions about word sizes, e.g. if the SGI compiler
has 64-bit long ints.  I'd also look at the db.wordlist records (they're
ASCII text) before and after htmerge, to see if htdig is actually telling
htmerge to remove some of these documents, or if htmerge is deciding to
do so on its own.

For the time being, the ht://Dig code hasn't had much of a workout on
non-GNU compilers, so it doesn't seem to do well on them.  If you can
help remedy that, great.  If you want to get the package working as
quickly and easily as possible, I'd suggest trying the GNU C and C++
compilers.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] htdig verbose mode

2000-07-19 Thread alan

thx... now i run htdig -vvv but nothing appears on the screen yet the program exits
normally... any idea why i'm not seeing ANYTHING at all?

thx

-alan

Gilles Detillieux wrote:

> According to alan:
> > how do i run htdig in verbose mode in unix? and how do i show all the htdig options
> > (i.e. htdig -v -c ...)
>
> You can use one or more -v options to htdig to select the level of verbosity
> you want.  A -\? option will show all available options for a given program
> in the package.  Also have a look at all the on-line documentation at
> http://www.htdig.org/ , which will explain the options for each of the
> programs in greater detail.
>
> --
> Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
> Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930
>
> 
> To unsubscribe from the htdig mailing list, send a message to
> [EMAIL PROTECTED]
> You will receive a message to confirm this.



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] htdig verbose mode

2000-07-19 Thread Gilles Detillieux

According to alan:
> how do i run htdig in verbose mode in unix? and how do i show all the htdig options
> (i.e. htdig -v -c ...)

You can use one or more -v options to htdig to select the level of verbosity
you want.  A -\? option will show all available options for a given program
in the package.  Also have a look at all the on-line documentation at
http://www.htdig.org/ , which will explain the options for each of the
programs in greater detail.

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] htdig verbose mode

2000-07-19 Thread alan

how do i run htdig in verbose mode in unix? and how do i show all the htdig options
(i.e. htdig -v -c ...)

thx


-alan



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




Re: [htdig] Error in ./configure

2000-07-19 Thread Gilles Detillieux

According to Sigfus Oddsson:
> When doing a ./configure on the 3.1.5 version of Ht://Dig I constantly get
> this error:
> 
> sed: file conftest.s1 line 3: Unterminated `s' command
> 
> Which results in no Makefile and CONFIG being generated so I can't 'make'
> the binary.

What is your operating system (& which version)?

-- 
Gilles R. Detillieux  E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.




[htdig] htdig on irix

2000-07-19 Thread Budd, S

we have built and are running htdig on irix 6.5

you need to have the 2.95.2  gcc and the latest gnu "make"


Regards
Sinclair 


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.