UdmSearch: Webboard: #1044: Access denied for user: '@localhost' to database 'udmsearch'

2000-11-09 Thread gluke

Author: gluke
Email: [EMAIL PROTECTED]
Message:
This is covered at the FAQ section which can be found on our site.

Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: UdmSearch PHP Frontend - is it possible to search by the URL, not by keyword listed in "dict"?

2000-11-09 Thread Sergey Kartashoff

Hi!

Friday, November 10, 2000, 8:58:08 AM, you wrote:

AS>  is it possible to search by the URL, not by keyword listed in "dict"? It's
AS> a file database, and the "doct" table doesn't contain over half of the file
AS> names, so although I know a link to a file does exist the search engine
AS> doesn't pick it up. Any ideas how to fix this?
AS> Thanks
AS> Ari

Please check your Allow/Disallow section.

-- 
Regards, Sergey aka gluke.


__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: What proxy software on a freebsd machine?

2000-11-09 Thread gluke

Author: gluke
Email: [EMAIL PROTECTED]
Message:
It does not matter. Use Squid for example.

Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re[2]: UdmSearch: PgSQL: DELETE INDEX url_url;

2000-11-09 Thread Sergey Kartashoff

Hi!

Thursday, November 09, 2000, 9:55:11 PM, you wrote:

THH> On Thu, 9 Nov 2000, Alexander Barkov wrote:

>> Don't forget to recreate this index before starting indexer.
>> As far as url_url index is UNIQUE this does not allow indexer to add
>> the same link several time. If you remove index, the same documents
>> might be added several times.

THH> can this unique index not be based on the crc32 value instead?  that might
THH> explain why I'm up to 140K docs when I was only expecting 91k :)

no, index on crc32 cannot be unique, because of it will block adding
site mirrors into url table.

-- 
Regards, Sergey aka gluke.


__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: What proxy software on a freebsd machine?

2000-11-09 Thread Jim Smyser

Author: Jim Smyser
Email: [EMAIL PROTECTED]
Message:
I used Squid.



Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: problem

2000-11-09 Thread Jim Smyser

Author: Jim Smyser
Email: [EMAIL PROTECTED]
Message:
Sounds more like a Mysql configuration issue then a mngosearch one. I would look at 
the port number MySql is configured to use and make sure maybe indexer.conf is 
directed to the same port. Not a whole lot of info you provided to make any other more 
educated guess.


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: syntax to remove stopword table Mysql?

2000-11-09 Thread Jim Smyser

Author: Jim Smyser
Email: [EMAIL PROTECTED]
Message:
What I thought and tried, but h say doesn't exists.
---snip-

Greetings,

Doesn't sound like you are loged into Mysql. Follow these steps if you like to remove 
the stopword table:

sh$> mysql -h localhost -p
password:
mysql> use udmsearch #database name table is in
mysql> DROP TABLE stopword
mysql> exit

Jim

Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: UdmSearch PHP Frontend - is it possible to search by the URL, not by keyword listed in "dict"?

2000-11-09 Thread Ari Shomair

 is it possible to search by the URL, not by keyword listed in "dict"? It's
a file database, and the "doct" table doesn't contain over half of the file
names, so although I know a link to a file does exist the search engine
doesn't pick it up. Any ideas how to fix this?
Thanks
Ari

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




RE: UdmSearch: Webboard: Performance: cache db

2000-11-09 Thread Paul Stewart

I'm just following up.. I did an SQL command and it took quite a while.. can
anyone tell me if the following response time is normal?  I'm trying to
narrow down the problem (yes, I'm doing to do debug output shortly as well -
just short on time).

mysql> UPDATE url SET next_index_time=next_index_time+60*60*24*7;
Query OK, 339890 rows affected (1 min 27.09 sec)

Doesn't this seem slow?  This is MySQL 3.23.27beta

Thanks,

Paul Stewart
Nexicom Inc.
http://www.nexicom.net

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of Paul Stewart
Sent: Wednesday, November 08, 2000 3:07 PM
To: gluke; [EMAIL PROTECTED]
Subject: RE: UdmSearch: Webboard: Performance: cache db


Actually... it seems to have settled down..

You can check it and let me know what you think: www.canadian-links.com :)

I accidentally blew part of the OS away yesterday and had to reinstall a
good part of FreeBSD by hand.  Wondering if something was dragging my server
that i was unaware of been off on a project lately so haven't checked
the server much but seems better...

Thanks,

Paul Stewart


-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of gluke
Sent: Wednesday, November 08, 2000 3:47 AM
To: [EMAIL PROTECTED]
Subject: UdmSearch: Webboard: Performance: cache db


Author: gluke
Email: [EMAIL PROTECTED]
Message:
This is very strange.
Please uncomment
#define DEBUG_SQL and
#define DEBUG_SEARCH
in the beginning of sql.c, recompile mnogosearch and execute search.cgi from
command line. Then send us its debug output.
Also please send us indexer -S output.

Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: What proxy software on a freebsd machine?

2000-11-09 Thread Ari

Author: Ari
Email: [EMAIL PROTECTED]
Message:
I'm just wondering what proxy software I should use in conjunction with udmSearch (for 
ftpsearch.conf). Which is known to work best? 
Thanks
Ari


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: BUG? 502 error when indexing using ftpsearch.conf file

2000-11-09 Thread Adrift

Author: Adrift
Email: [EMAIL PROTECTED]
Message:
I get a "Command not implemented" (502) code whenever I try to index a ftp site - the 
indexer doesn't give me the error, but when I check the mysql database in the URL 
table I find that only the address of the site has been indexed, no files, with that 
error attached to it. Is this a bug?


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: PgSQL: DELETE INDEX url_url;

2000-11-09 Thread The Hermit Hacker

On Thu, 9 Nov 2000, Alexander Barkov wrote:

> Don't forget to recreate this index before starting indexer.
> As far as url_url index is UNIQUE this does not allow indexer to add
> the same link several time. If you remove index, the same documents
> might be added several times.

can this unique index not be based on the crc32 value instead?  that might
explain why I'm up to 140K docs when I was only expecting 91k :)


> 
> 
> 
> The Hermit Hacker wrote:
> > 
> > Morning all ...
> > 
> > Just a hint for anyone using udmsearch with PostgreSQL ... if you
> > find performance is atrociuos, there is a relatively simple fix:
> > 
> > DELETE INDEX url_url;
> > 
> > I run the PostgreSQL.org site, and indexing it, with its mailing
> > lists, has so far generated a database that currently looks like, and is
> > growing:
> > 
> > sbin/indexer -S etc/udmsearch.conf
> > 
> >   UdmSearch statistics
> > 
> > StatusExpired  Total
> >-
> >  0  77516  77722 Not indexed yet
> >200503  60508 OK
> >301  0104 Moved Permanently
> >302  0 27 Moved Temporarily
> >304  0   4179 Not Modified
> >401  0  2 Unauthorized
> >403  0 17 Forbidden
> >404  0666 Not found
> >500  0 26 Internal Server Error
> >503 10 50 Service Unavailable
> >-
> >  Total  78029 143301
> > 
> > Due to a limitation in current PgSQL with its LIKE query
> > optimizer, it "mis-optimizes" the query so that an EXPLAIN looks good, but
> > the query itself will take several minutes to run on a Dual-PIII 450Mhz
> > with fast drives ...
> > 
> > Getting rid of the INDEX helps keep the numbers lower, but am
> > looking forward to us getting the optimizer fixed :)
> 
> 

Marc G. Fournier   ICQ#7615664   IRC Nick: Scrappy
Systems Administrator @ hub.org 
primary: [EMAIL PROTECTED]   secondary: scrappy@{freebsd|postgresql}.org 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: syntax to remove stopword table Mysql?

2000-11-09 Thread Rick Hatfield

Author: Rick Hatfield
Email: 
Message:
What I thought and tried, but h say doesn't exists. And I know it exists because 
searches are complaining about certain terms. Oh well.


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: problem

2000-11-09 Thread nobo

Author: nobo
Email: 
Message:
#2002: Can't connect to local MySQL server through socket '/var/lib/mysql/mysql.sock' 
(111) 
???


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: syntax to remove stopword table Mysql?

2000-11-09 Thread gluke

Author: gluke
Email: [EMAIL PROTECTED]
Message:
SQL syntax for that is:
DROP TABLE table_name;


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: syntax to remove stopword table Mysql?

2000-11-09 Thread Rick Hatfield

Author: Rick Hatfield
Email: 
Message:
Could someone give me the command line syntax to remove the stopword table from the 
database using mysql on unix?

Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: #1044: Access denied for user: '@localhost' to database 'udmsearch'

2000-11-09 Thread Ari

Author: Ari
Email: [EMAIL PROTECTED]
Message:
This error messaage means your SQL database backend is not set up right. Make sure to 
set up a new mysql user which has full access to the database 'udmsearch', and the the 
config files change the url of your sever to: 
mysql://user:password@localhost/dbname/
('udmsearch' is the dbname in the error you gave)
Ari


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: Cache Mode Indexer Limitation since 3.1.5

2000-11-09 Thread Alexander Barkov

Fredy Kuenzler wrote:
> I'm currently using v.3.1.8 in cache mode, and I wonder whether
> the limitation of only one simultanious indexer is still the
> case. My bandwidth would allow more.

We are working on this task. Right now you may just install several
instances of search engine into different catalogs. Then put the same
indexer.conf into /etc directory of all instances. After running several
indexers just run splitter in main installation difrectory. Then remove
logs. Then take logs from the second directory, run splitter again and
remove logs. And so on.


> I also wonder whether it makes sense to run indexer and splitter
> simultanious.

It is not implemented yet, but surely will be done soon.
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




RE: UdmSearch: custom markup file

2000-11-09 Thread Hargraves, Christian

That will work, except then I can't search for files that have certain tags.


Just a thought. You may try to run external parser which can
convert XML into HTML.


"Hargraves, Christian" wrote:
> 
> Hi all!!
> 
> Remember me? I was wondering how hard it would be to have the
> indexer read in a file describing the document type it is indexing.
> For example I want to be able to index XML. How hard would it be to
> 1) create a plugin for it to parse XML based on a DTD
> 2) Add a feature to the indexer that reads in a Document Type Definition
> file and then indexes based on that.
> 
> I know there are obviously some DB integration issues, but I just want
this
> to be in the current cache-mode. Once all documents are indexed, I want to
> be able to search on files that have certian tags. For example, I want to
to
> search for files that have the tag  and where inside that tag
the
> words 'free', 'gpl', 'xml' exist.
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: custom markup file

2000-11-09 Thread Alexander Barkov

Just a thought. You may try to run external parser which can
convert XML into HTML.


"Hargraves, Christian" wrote:
> 
> Hi all!!
> 
> Remember me? I was wondering how hard it would be to have the
> indexer read in a file describing the document type it is indexing.
> For example I want to be able to index XML. How hard would it be to
> 1) create a plugin for it to parse XML based on a DTD
> 2) Add a feature to the indexer that reads in a Document Type Definition
> file and then indexes based on that.
> 
> I know there are obviously some DB integration issues, but I just want this
> to be in the current cache-mode. Once all documents are indexed, I want to
> be able to search on files that have certian tags. For example, I want to to
> search for files that have the tag  and where inside that tag the
> words 'free', 'gpl', 'xml' exist.
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: PgSQL: DELETE INDEX url_url;

2000-11-09 Thread Alexander Barkov

Don't forget to recreate this index before starting indexer.
As far as url_url index is UNIQUE this does not allow indexer to add
the same link several time. If you remove index, the same documents
might be added several times.



The Hermit Hacker wrote:
> 
> Morning all ...
> 
> Just a hint for anyone using udmsearch with PostgreSQL ... if you
> find performance is atrociuos, there is a relatively simple fix:
> 
> DELETE INDEX url_url;
> 
> I run the PostgreSQL.org site, and indexing it, with its mailing
> lists, has so far generated a database that currently looks like, and is
> growing:
> 
> sbin/indexer -S etc/udmsearch.conf
> 
>   UdmSearch statistics
> 
> StatusExpired  Total
>-
>  0  77516  77722 Not indexed yet
>200503  60508 OK
>301  0104 Moved Permanently
>302  0 27 Moved Temporarily
>304  0   4179 Not Modified
>401  0  2 Unauthorized
>403  0 17 Forbidden
>404  0666 Not found
>500  0 26 Internal Server Error
>503 10 50 Service Unavailable
>-
>  Total  78029 143301
> 
> Due to a limitation in current PgSQL with its LIKE query
> optimizer, it "mis-optimizes" the query so that an EXPLAIN looks good, but
> the query itself will take several minutes to run on a Dual-PIII 450Mhz
> with fast drives ...
> 
> Getting rid of the INDEX helps keep the numbers lower, but am
> looking forward to us getting the optimizer fixed :)
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: symlinks on ftp:// urls

2000-11-09 Thread Alexander Barkov

Kir and Igor, can you explain it, guys?


Mario Lang wrote:
> 
> Hi.
> 
> I have reported this already at 3.1.6.
> I am using now 3.1.8 and the indexer doesn't follow
> symbolic links on ftp servers.
> 
> Did anyone experience the same thing. When will this be
> fixed. Our mirror ftp server heavily relays on symlinking
> and now I am kinda stuck.
>
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: upgrading from 3.0x to 3.1x

2000-11-09 Thread Alexander Barkov

Paul Stewart wrote:
> 
> HI there...
> 
> I read somewhere that if I'm upgrading an install from 3.0x to 3.1x versions
> that I have to reindex my entire system?  Please tell me this isn't true...
> I have a 3.0x system with crc-multi db and want to upgrade to crc-multi as
> well on 3.1x version.  I do not plan to use db cache mode at this time
> (already tried it and found it slower than crc-multi method).

You have either to reindex or change "url" table structure using ALTER
TABLE
  SQL command.
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: #1044: Access denied for user: '@localhost' to database 'udmsearch'

2000-11-09 Thread Insider

Author: Insider
Email: 
Message:
#1044: Access denied for user: '@localhost' to database 'udmsearch' 

Where's the Template File for the search.cgi ?


Reply: 

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re[2]: UdmSearch: Webboard: Performance: cache db

2000-11-09 Thread Sergey Kartashoff

Hi!

Thursday, November 09, 2000, 12:46:02 AM, you wrote:

PS> Yup...i have some  more things to try...:)

ok, we need debug info. In my previus mail i told how you can get it.
Thank you.

One more question: can you look at CPU and HDD activity while
search.cgi doing its long 23-seconds search ?

-- 
Regards, Sergey aka gluke.


__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




Re: UdmSearch: Bug report

2000-11-09 Thread Sergey Kartashoff

Hi!

Thursday, November 09, 2000, 8:53:09 AM, you wrote:

VG> UdmSearch version: 3.1.7
VG> Platform:  Pentium III
VG> OS:freebsd
VG> Database:  mysql-se~3.22.32
VG> Statistics:


VG>  ./indexer -a
VG> Indexer[85464]: indexer from UdmSearch v.3.1.7/MySQL started with 
'/usr/local/etc/udmsearch/indexer.conf'
VG> Indexer[85464]: [1] Error: '#1054: Unknown column 'crc32' in 'field list''
VG> I'm not sure that is a bug, but i have no idea to resolve this..
VG> Thanks for help

it seems to me that you are using old 3.0 database with 3.1 indexer.
You need to recreate at least url table from create.txt script.

-- 
Regards, Sergey aka gluke.


__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]