UdmSearch: Webboard: How to Use the Server Table with mnoGoSearch 3.1.10

2001-02-13 Thread Rich Wall

Author: Rich Wall
Email: [EMAIL PROTECTED]
Message:
Hi,
I'm trying to set up the indexer so that it reads the servers from the database.
My table and it's contents are appended below.
The indexer runs but it...
a) Doesn't store *any* words in the dictionary tables
b) Doesn't index all the documents on the server.
See database stats appended below.

When I add the server into indexer.conf everything works fine (~5500 urls stored)

Rich Wall.

---

# phpMyAdmin MySQL-Dump
# http://phpwizard.net/phpMyAdmin/
#
# Host: elkie.cwn.org.uk Database : mnogosearch
# 

#
# Table structure for table 'server'
#

CREATE TABLE server (
   rec_id int(11) NOT NULL auto_increment,
   active int(11) DEFAULT '0' NOT NULL,
   url varchar(128) NOT NULL,
   period int(11) DEFAULT '604800' NOT NULL,
   tag varchar(11) NOT NULL,
   category varchar(11) NOT NULL,
   charset varchar(16) NOT NULL,
   lang char(2) NOT NULL,
   basic_auth varchar(64) NOT NULL,
   proxy varchar(64) NOT NULL,
   proxy_auth varchar(64) NOT NULL,
   maxhops int(11) DEFAULT '' NOT NULL,
   gindex int(11) DEFAULT '1' NOT NULL,
   follow int(11) DEFAULT '1' NOT NULL,
   deletebad int(11) DEFAULT '0' NOT NULL,
   use_robots int(11) DEFAULT '1' NOT NULL,
   delete_no_srv int(11) DEFAULT '1' NOT NULL,
   use_clones int(11) DEFAULT '1' NOT NULL,
   descweight int(11) DEFAULT '2' NOT NULL,
   keywordweight int(11) DEFAULT '2' NOT NULL,
   titleweight int(11) DEFAULT '2' NOT NULL,
   bodyweight int(11) DEFAULT '1' NOT NULL,
   urlweight int(11) DEFAULT '0' NOT NULL,
   urlhostweight int(11) DEFAULT '0' NOT NULL,
   urlpathweight int(11) DEFAULT '0' NOT NULL,
   urlfileweight int(11) DEFAULT '0' NOT NULL,
   correct_factor int(11) DEFAULT '1' NOT NULL,
   incorrect_factor int(11) DEFAULT '1' NOT NULL,
   number_factor int(11) DEFAULT '1' NOT NULL,
   alnum_factor int(11) DEFAULT '1' NOT NULL,
   min_word_length int(11) DEFAULT '0' NOT NULL,
   max_word_length int(11) DEFAULT '32' NOT NULL,
   max_net_errors int(11) DEFAULT '16' NOT NULL,
   net_delay_time int(11) DEFAULT '86400' NOT NULL,
   read_timeout int(11) DEFAULT '90' NOT NULL,
   PRIMARY KEY (rec_id),
   UNIQUE srv_url (url)
);

#
# Dumping data for table 'server'
#

INSERT INTO server VALUES ( '1', '1', 'http://www.cwn.org.uk/', '6', '', '', '', 
'', '', '', '', '1000', '1', '1', '0', '1', '1', '1', '2', '2', '2', '1', '0', '0', 
'0', '0', '1', '1', '1', '1', '0', '32', '100', '3600', '60');



--
Database statistics

StatusExpired  Total
   -
   200  0380 OK
   301  0  4 Moved Permanently
   403  0  1 Forbidden
   404  0  7 Not found
   -
 Total  0392
--

Reply: <http://search.mnogo.ru/board/message.php?id=1363>

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: Server table in 3.1.10 (MySQL)

2001-02-14 Thread Rich Wall

Author: Rich Wall
Email: [EMAIL PROTECTED]
Message:
I had the same problem.
Another problem I have, is that when using the server table, indexer walks through the 
site OK, but doesn't store any word data.
I can't even get it working with the two default server entries.(http://web.izhcom.ru/ 
, and http://www.udmnet.ru/ )
As I said in a previous posting, every thing works fine when I enter the servers 
directly into indexer.conf.
I can't find any sign of anyone else having problems, so I wonder if I have just 
missed something obvious.

I have appended my server table and indexer.conf

Rich Wall.

--
Database mnogosearch - table server
# phpMyAdmin MySQL-Dump
# http://phpwizard.net/phpMyAdmin/
#
# Host: elkie.cwn.org.uk Database : mnogosearch
# 

#
# Table structure for table 'server'
#

CREATE TABLE server (
   rec_id int(11) NOT NULL auto_increment,
   active int(11) DEFAULT '0' NOT NULL,
   url varchar(128) NOT NULL,
   period int(11) DEFAULT '604800' NOT NULL,
   tag varchar(11) NOT NULL,
   category varchar(11) NOT NULL,
   charset varchar(16) NOT NULL,
   lang char(2) NOT NULL,
   basic_auth varchar(64) NOT NULL,
   proxy varchar(64) NOT NULL,
   proxy_auth varchar(64) NOT NULL,
   maxhops int(11) DEFAULT '' NOT NULL,
   gindex int(11) DEFAULT '1' NOT NULL,
   follow int(11) DEFAULT '1' NOT NULL,
   deletebad int(11) DEFAULT '0' NOT NULL,
   use_robots int(11) DEFAULT '1' NOT NULL,
   delete_no_srv int(11) DEFAULT '1' NOT NULL,
   use_clones int(11) DEFAULT '1' NOT NULL,
   descweight int(11) DEFAULT '2' NOT NULL,
   keywordweight int(11) DEFAULT '2' NOT NULL,
   titleweight int(11) DEFAULT '2' NOT NULL,
   bodyweight int(11) DEFAULT '1' NOT NULL,
   urlweight int(11) DEFAULT '0' NOT NULL,
   urlhostweight int(11) DEFAULT '0' NOT NULL,
   urlpathweight int(11) DEFAULT '0' NOT NULL,
   urlfileweight int(11) DEFAULT '0' NOT NULL,
   correct_factor int(11) DEFAULT '1' NOT NULL,
   incorrect_factor int(11) DEFAULT '1' NOT NULL,
   number_factor int(11) DEFAULT '1' NOT NULL,
   alnum_factor int(11) DEFAULT '1' NOT NULL,
   min_word_length int(11) DEFAULT '0' NOT NULL,
   max_word_length int(11) DEFAULT '32' NOT NULL,
   max_net_errors int(11) DEFAULT '16' NOT NULL,
   net_delay_time int(11) DEFAULT '86400' NOT NULL,
   read_timeout int(11) DEFAULT '90' NOT NULL,
   PRIMARY KEY (rec_id),
   UNIQUE srv_url (url)
);

#
# Dumping data for table 'server'
#

INSERT INTO server VALUES ( '1', '1', 'http://web.izhcom.ru/', '6', 'TT', 'AA', 
'koi8-r', 'ru', '', '', '', '1000', '1', '1', '0', '1', '1', '1', '2', '2', '2', '1', 
'0', '0', '0', '0', '1', '1', '1', '1', '0', '32', '100', '3600', '60');
INSERT INTO server VALUES ( '2', '1', 'http://www.udmnet.ru/', '6', 'GG', 'BB', 
'koi8-r', 'ru', '', '', '', '1000', '1', '1', '0', '1', '1', '1', '2', '2', '2', '1', 
'0', '0', '0', '0', '1', '1', '1', '1', '0', '32', '100', '3600', '60');

--

# This is a minimal sample indexer config file

DBAddr mysql://foo:bar@localhost/udmsearch/

Server  http://localhost/

# Allow some known extensions and directory index
Allow *.html *.htm *.shtml *.txt */ 

# Disallow everything else
Disallow *

--

Reply: <http://search.mnogo.ru/board/message.php?id=1393>

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: Maximum execution time of 30 seconds exceeded

2001-02-14 Thread Rich Wall

Author: Rich Wall
Email: [EMAIL PROTECTED]
Message:
> Fatal error: Maximum execution time of 30 seconds exceeded in 
>/home/httpd/html/search/ispell.inc on line 135

There is a setting in 'php.ini' which specifies how long the script can run before it 
dies. (see below)

;;;
; Resource Limits ;
;;;

max_execution_time = 30 ; Maximum execution time of each script, in seconds


Just set this to something higher. Mind you, if it's running that long, it sounds like 
you may have a problem with your database or something.
Also bear in mind that the max execution time is a security feature designed to stop a 
malicious script from hogging server resources.
Finally, if you don't have access to 'php.ini' then you may be able to put this...
'php_value max_execution_time 60'
in an .htaccess file in the root folder of the php script.

Rich Wall.

Reply: <http://search.mnogo.ru/board/message.php?id=1402>

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]




UdmSearch: Webboard: Maximum execution time of 30 seconds exceeded

2001-02-15 Thread Rich Wall

Author: Rich Wall
Email: [EMAIL PROTECTED]
Message:
>I do not know if my mysql has a problem. If I use mnogo without >Affix and Spell and 
>there are no problem, but time for look for is 1 >second / 1 minute (there are 25 
>urls into de DB) (Is this >usual?). I want to make more quick this program, I 
>thinked that >Affix and Spell was the solition.

I've just been trying out the ispell stuff and have found that it works well if you 
search for the shortest word variation, but doesn't work at all if you search for the 
longer word variations.
eg searching for 'fast' returns lots of results, including those containing 'faster'...
but searching for 'faster' doesn't return any results at all.

Did you or any one else have this problem?

i'm using the 'british.xlg' dictionary and the 'english.aff' affix file, both stored 
in the database.

Rich Wall.

Reply: <http://search.mnogo.ru/board/message.php?id=1428>

__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]