Webboard: Can't find search.cgi

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Hello,
 
 I'm installing on SUSE Linux.  I ran the install script, it went fine.  Did the 
make/make install.  However when I looked for the search.cgi it was not to be found.  
Is this a separate download/install  /  am I missing something?
 
 Any help much appreciated.
 


It is in /bin directory of mnogosearch installation.
Usually /usr/local/mnogosearch/bin/


Reply: http://www.mnogosearch.org/board/message.php?id=4943

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Search results contains deleted pages...

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Hi,
 
 The search results on our web site contain links to pages that were deleted. If I 
click on one of those results, I obtain a 404.
 
 I tried to use DeleteBad=yes in the indexer.conf and reindexing the whole site 
(indexer -a), but these broken links still appear.
 
 What could I do to remove these links from the results?
 

You can delete them using indexer with these arguments:

./indexer -Cw -s 404


However, this looks like a bug. Could you try the following please:
let's say http://site/bad.html is a page which doesn't longer
exits on your server but still can be found using search.
Run the following:

./indexer -amv6 -u http://site.bad.html

and post it's output.



Reply: http://www.mnogosearch.org/board/message.php?id=4944

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: index.php disappeared from url

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Indexer stores urls in DB without index.php. I mean that real url is 
http://www.xxx.ru/index.php?act=todoamp;smth=1 but indexer cuts index.php off. And 
in DB we see http://www.xxx.ru/?act=todoamp;smth=1
 
 What can I do with it ?

This is either the result of incorrect links on your pages
or the result of bug. The idea is to find on which page
this wrong link appeared first time. 

Use:

SELECT referrer FROM url WHERE url='wrong url';
SELECT url FROM url WHERE rec_id=RRR;

  (where RRR is a result from the first query).

Then take a look into the source of this page
and check whether all links on it are correct.


Reply: http://www.mnogosearch.org/board/message.php?id=4946

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Indexing hostnames and domainnames

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Yes but I want to stop it splitting the string quot;your-site.comquot; up into 
different words. I'd like to configure the indexer to treat it as a literal string. 
Is there a way to do that?
 
  

Unfortunately no.


Reply: http://www.mnogosearch.org/board/message.php?id=4947

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Indexer caught in a loop?

2002-06-12 Thread Cal Zemelman

I had Alias http://www.myproductionsite.com
file:/Library/Webserver/Documents  (Mac OS X setup filesystem)
Then Server http://www.myproductionsite.com/;

Actually, I noticed it tended to get hung-up inside one specific folder.
 I moved that one out and now it seems to  get farther before getting an
Alias error.  I can't think of any sort of thing that could this...maybe
there is a symlink that creates a circular reference in the folder?  The
folder I moved out is only one of a 20 or so on the site and it has quite
a few folders inside it and a couple hundred HTML pages, so it isn't easy
to debug.

Now it simply stops indexing with Error in Aliased URL: xxx but I
haven't really looked into this new problem now...it is probably something
with that file.  I had a few realy large HTML documents (like 1.5MB
converted Excel spreadsheet) and maxsize was being ignored and the indexer
would seg fault on these...

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Server Table

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 mnoGoSearch v3.1.19
 
 I am just looking at the structure of the server table and cannot find any 
documentation on some of the fields, could someone explain the alnum_factor field
 
 

Take a look into indexer.conf-dist for AlnumFactor explanation.


Reply: http://www.mnogosearch.org/board/message.php?id=4948

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: site and path

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 If I enter the following site into my server table:
 http://members.tripod.com/mysportster/
 how can I stop the indexer following links to
 http://members.tripod.com/


Write this in your indexer.conf:

Server path http://members.tripod.com/

 and any other directories on this domain without physically excluding each 
directory, should I use 'site' or 'path'?

The same with above.


Reply: http://www.mnogosearch.org/board/message.php?id=4950

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: site and path

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Mine is set up like your taking and I use path.  no clue about signal 2.  I am still 
stuck with my front end


Sorry I didn't understand you.


Reply: http://www.mnogosearch.org/board/message.php?id=4951

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: How to exclude URLs?

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Hi there.  On our website, we have a few sections of content we would like people to 
be able to search, including:
 
 * Mailing Lists  /mailinglists/archives/
 * Presentation   /presentations/
 * Meeting Minutes/meeting/past/
 * All of the above   /
 * Everything EXCEPT
   Mailing Lists
 
 
 Right now, the first four are trivial.  I simply provide a pulldown menu for the 
quot;ulquot; form variable which looks something like:
 
 lt;select name=quot;ulquot;gt;
 lt;option value=quot;/quot;gt;Entire Site
 lt;option value=quot;/mailinglists/archives/quot;gt;Mailing Lists
 lt;option value=quot;/presentations/quot;gt;Presentations
 lt;optionv alue=quot;/meeting/past/quot;gt;Meeting Minutes
 lt;/selectgt;
 
 It's the last one which is giving me difficulty.  I tried a number
 of things so far, including:
 
Tags - Couldn't seem to get anything tag-related to work!
   
 I thought Tags or Categories would be the solution...  
 Doing something like this in the quot;indexer.confquot; file:
 
   Server http://lugod.org/
 
   Tag 1
   Server http://www.lugod.org/mailinglists/archives/
 
 etc. Or SOMETHING. :(
 


Try something like this inf your indexer.conf, them
use tag limit in search.htm:


Tag 1
Server http://www.lugod.org/mailinglists/archives/

Tag 2
Server http://www.lugod.org/mailinglists/presentations/

Tag 3
Server http://www.lugod.org/mailinglists/meeting/past/

# All of the above   /
# Everything EXCEPT
# Mailing Lists

Tag 4
Server http://www.lugod.org/



Reply: http://www.mnogosearch.org/board/message.php?id=4952

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: search full URL

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 
When i trying to search full URL like (http://someting.com) it always gave me no 
result, like this
 Search results: http : 275, www : 0, something : 19, com : 0
 does it have something for correct that..? is it in my indexer.conf of in my 
parameter passed in my search.cgi..?
 

First of all, you have to enable URL indexing using 
either URLWeight in 3.1.19 or Section url.* in 3.2.x

Another thing that you probably don't use the same set of stopwords
in both indexer.conf and search.htm. Probably indexer ignores
www and com as stopwords, but search doesn't know that these two
words are stopwords.


Reply: http://www.mnogosearch.org/board/message.php?id=4953

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: How to ignore some HTML tags in body?

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Whenever I run indexer it works fine, but when I search for anything I get results 
like this:
 
 /quot;gt;HOME gt; /articlesquot;gt;ARTICLES gt; 
/articles/southweststates/index.htmlquot;gt;SOUTHWESTSTATES gt; 
/articles/southweststates/Santa_Fe/index.htmlquot;gt;SANTA_FE Carlsbad, New Mexico: 
A Year-Round Treat Santa Fe, New Mexico showPics(0,100,5) ?gt; CARLSBAD, NEW MEXICO 
A...
 
 obviously I dont want the php or those html tags in there, how can I only display 
the body of the document?

If it is your site, there are two ways.
1. Check HTTP_USER_AGENT in your pages and don't display
menus when robot is downloading the page.
2. Add !--UdmComment ... !--/UdmComment-- around menus.
  indexer will skip the text between these tags.



Reply: http://www.mnogosearch.org/board/message.php?id=4954

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Partial Keyword searching (Fuzzy)

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
The simpliest way is to check what kind of SQL queries are sent
to server during search. Can you extract them from queries log?


 Hi Alexander,
 
 I followed the instructions of setting DBMode to multi and substring setting. But, 
the partial searching still does not work. Can you please help.
 
 indexer.conf settings:
 DBHostlocalhost
 DBNameudmsearch
 DBUserfoo
 DBPassbar
 DBMode  multi
 ..
 
 Search.htm settings:
 # Choose storage mode (see explanation in indexer.conf-dist):
 DBMode  multi
 Match:
 lt;SELECT NAME=quot;mquot;gt;
 lt;OPTION VALUE=quot;allquot; SELECTED=quot;$mquot;gt;All
 lt;OPTION VALUE=quot;anyquot; SELECTED=quot;$mquot;gt;Any
 lt;OPTION VALUE=quot;substringquot; SELECTED=quot;$mquot;gt;Substring
 lt;/SELECTgt;
 ..
 
 
 Thanks.
 
 
 Perry Luo
 

Reply: http://www.mnogosearch.org/board/message.php?id=4955

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Problems indexing word doc files

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Still no luck unfortunately, 
 After adding in the type change as suggested. 
 I've also come to run the package today to get all the new files 
 that will be in there now, and all that happened was: 
  
 Indexer[19177]: indexer from mnogosearch-3.1.19/MySQL started with 
 '../etc/indexer.conf' 
 Indexer[19177]: [1] http://myserver.mydomain.co.uk/cv/cvindex 
 Indexer[19177]: [1] No Content-type in 
 'http://myserver.mydomain.co.uk/cv/cvindex'! 
 Indexer[19177]: [1] Done (8 seconds) 
  
 And no indexing of other documents? Anyone got any suggestions, 
 could it be because I'm trying to index files that are physically 
 located in a directory, rather than fetch them via a web url?? 

Does this page work fine when you open it in browser?

Do you have any *.doc files in your database?
Check it using SELECT url FROM url.

Also, seeing in your previous letter, I noticed a mistake:
CheckOnly *.doc 

You should use Allow *.doc ( not CheckOnly *.doc ) if you want
parser to be executed and then it's output to be indexed.






Reply: http://www.mnogosearch.org/board/message.php?id=4956

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Search results cache

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 search results cache has stopped working on my site, no files are created in the 
cache directory. I am using v3.1.19. 
 
 Originally I had set quot;Cache yesquot; in search.htm and it seemed to work fine. 
I then had to move the var directory to a partition with more disk space and it seems 
to have stopped working around that time. 
 Everything else seems to be normal.
 Any ideas why this has stopped?

Just specify new location of your var directory using 
VarDir command.


Reply: http://www.mnogosearch.org/board/message.php?id=4957

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




ISO9000

2002-06-12 Thread

 =?GB2312?B?ytY=?=
To: [EMAIL PROTECTED]
Content-Type: text/html;charset=GB2312
Date: Wed, 12 Jun 2002 22:10:57 +0800
X-Priority: 3
X-Mailer: FoxMail 3.11 Release [cn]

general:ÄúºÃ!


¸ù¾Ý2000°æISO9000×åϵÁбê×¼£¬ÓÉÖйú±ê×¼»¯Ð­»áÓë±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾£¬¹²Í¬ÑÐÖÆ¿ª·¢µÄ
¡¶2000°æISO9000×åÖÊÁ¿¹ÜÀíÌåϵ
±ê×¼Îļþ´¦Àíϵͳ¡·ÊÇISO9000£º2000°æÍÆÐеÄÅäÌ×Èí¼þ£¬Ê¹ÒѾ­Í¨¹ýÈÏÖ¤µÄÆóÒµºÍ¼´½«¼ÓÈëÈÏÖ¤µÄÆóҵʹÓÃÎÒ
ÃǵÄÈí¼þºó£¬Äܹ»¹æ·¶±ê×¼µØ°´ÕÕ
бê×¼ÒªÇó£¬ÍêÕûµØÖÆ×÷ÖÊÁ¿ÌåϵÎļþ¡£

¡¶2000°æISO 
9000×åÖÊÁ¿¹ÜÀíÌåϵ±ê×¼Îļþ´¦Àíϵͳ¡·º­¸ÇÁËISO9000£º2000°æµÄÖÊÁ¿¹ÜÀíÌåϵµÄ»ù´¡Êõ
Óï¡¢Àí½âÒªÇóºÍʵʩҪµãÒÔ¼°ÐÐÒµÖÊ
Á¿ÊÖ²áʾÀý£¨ÖÆÔìÒµ¡¢±ö¹ÝÒµ¡¢ÎïÒµ¹ÜÀí£©£¬²¢Éè¼ÆÁËÆóÒµÖÆ×÷ÖÊÁ¿¹ÜÀíÌåϵÎļþµÄ±ê×¼²Î¿¼Ä£°å£¬·½±ãʵ
Óã¬Ò»Ä¿ÁËÈ»¡£Ëü½«ÊÇÆóÒµÄÚÉóÈË
Ô±¡¢×ÉѯÈËÔ±¡¢ÈÏÖ¤ÈËÔ±µÄºÃ°ïÊÖ¡£

×Ü·¢ÐУº±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾
¶¨¼Û£º680Ôª/Ì×
Óʾ֡¢ÒøÐлã¿î½Ô¿É£¬¿îµ½·¢»õ£»Ç뽫»ã¿îƾ֤´«ÕæÖÁ010-6843 4198¡£
Óʾֻã¿îÖÁ£º±±¾©Êк£µíÇø³µ¹«×¯Î÷·29ºÅ
±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾
Óʱࣺ100044
ÒøÐлã¿îÖÁ£º¿ª»§Ãû³Æ£º±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾
¿ª»§ÒøÐУºÕÐÉÌÒøÐб±¾©·ÖÐÐÓªÒµ²¿
Õʺţº0186437510001
»¶Ó­À´µç¡¢À´º¯´¹Ñ¯²¢³ÏÕ÷ÇøÓò´úÀí£¡
ÁªÏµÈË£ºÖ£ÏÈÉú£¬°¬Ð¡½ã
µç»°£º010-68732115£¬68732117£¬68435484
´«Õ棺010-68434198
E-mail£º[EMAIL PROTECTED]

Ïà¹Ø²úÆ·£º
¡¶ÖÊÁ¿¹ÜÀíÌåϵʵսÈí¼þ£¨±ö¹ÝÒµ£©¡·   1200Ôª/Ì×
¡¶Ö°Òµ°²È«½¡¿µ¹ÜÀíÌåϵÎļþ±àдϵͳ¡· 880Ôª/Ì×
¡¶ISO 14000»·¾³¹ÜÀíÌåϵ±ê×¼Îļþ´¦Àíϵͳ¡·880Ôª/Ì×

ÖÂ
Àñ!
   ±±¾©Ìì´óÌì²ÆÖÇÄÜ¿¨ÏµÍ³ÓÐÏÞ¹«Ë¾
   [EMAIL PROTECTED]
   2002-06-12

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: URL table structure / Performance

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 I have just created the index and I noticed an immediate increase in the speed of 
the indexer. 
 
 Will this have any adverse side effects? 

Should not.

 If not why is this not in the standard table structure?

We've added this index in 3.2.x tables structure.
Thanks for suggestion!



Reply: http://www.mnogosearch.org/board/message.php?id=4958

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Various errors and problems

2002-06-12 Thread Alexander Barkov

Mark Roebuck wrote:
 Hi,
 
 I have posted these questions to the boards, but with no results.
 
 I am using v3.1.19 with mysql and cache mode
 
 1, the ~ character does not exclude words from the search results.
 

You have to choose boolean search type in select box which
has all/any/bool values. Take a look into default template.

 2, cached results are not being saved in the cache directory, cache is set
 to yes.

Search results cache files can be found in /var/cache/ subdirectory
of mnogosearch installation, i.e. /usr/local/mnogosearch/var/cache/
by default. Do you have any files in this directory?

 3, is it safe to create an index on the url table field next_index_time

Yes.

 
 4, how can I stop indexer indexing certain parts of a site, for instance if
 I have http://www.domain.com/ in the server table with follow set to path or
 site how can I stop it indexing http://www.domain.com/directory/
 http://www.domain.com/directory/  but still index all the other
 directories.

Use either robots.txt file or this command in indexer.conf:

Disallow http://www.domain.com/directory/*



___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: conditional output from search.cgi

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Hi, I'm hoping to get some advice about how to tackle a specific situation.
 
 I am indexing a network of 5 domains.  One of these domains requires a subscription. 
 When you search on any of the 5 sites you get results for documents on all 5 
domains.  Tags are used to allow people to only search a single domain if they want.
 
 What I want to do provide html like quot;subscription required-- click here to 
learn morequot; in the search results when the page found is on the subscription 
site.
 
 My initial solution to this problem was to use the php frontend and create a custom 
template variable that would provide me with this functionality.  But I am changing 
servers (from redhat to freeBSD) and using the default freebsd ports has me 
increasing my php version to 4.2.1 and decreasing mnogosearch to 3.1.19-2.  The php 
frontend doesn't work with this combination so I have been looking to switch to using 
search.cgi.


If the problem is only in software versions, you can install
both php and mnogosearch from sources, not from ports.

 So far the best solution I have come up with to provide this 
 functionality on search.cgi is to use javascript to do a check on 
 $DU for the subscription url and write out the required html when I 
 get a match.  Can anyone think of a better option?  I have complete 
 control over the subscription site and can dynamically produce any 
 meta tags I want...

You can use external includes in search.htm, please take a look into
documentation.




Reply: http://www.mnogosearch.org/board/message.php?id=4959

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: search a blob

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 
 can mnogosearch search a datatype of blob in the dbase? i tried running indexer thru 
a table which has a blob datatype and it doesn't go thru it. just wanna make it clear 
if it's possible or not. thanks!

If you mean indexing blobs using HTDB, yes it is possible
in MySQL. Not sure about other databases, as far as most
of them have a special API to access blobs.


Reply: http://www.mnogosearch.org/board/message.php?id=4960

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Disallow Indexing of particular files ?

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 hi all,
 wht i want is just the opposit of this.i am running a site for a news agancy.i want 
to index only the story pages and not the others (ie story.php?someid=123).i tried 
all the changes ni my indexer.conf but all in vain.
 
 thx in advabce if anyone can help me.


The simplies sollution might look like this:

Server http://server/
Allow   *story.php?someid=*
HrefOnly *


Reply: http://www.mnogosearch.org/board/message.php?id=4961

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: 3.2.5 searchd problem

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 When using the searchd daemon with 3.2.5 I get the following error message
 
 #1044: Access denied for user: '@localhost' to database 'mnogosearch'
 
 The entry in search.htm is
 
 SearchdAddr localhost
 
 searchd daemon is started at the machine
 
 startin search.cgi from the command line results in this error:
 
 An error occured!
 #1049: Unknown database 'mnogosearch'
 
 The Database definition in searchd.conf is:
 
 DBAddr  mysql://xxx:xxx@localhost/udm5/?dbmode=cache
 Limit t:tag:lim_tag
 Limit c:cat:lim_cat
 

It means searchd.conf has incorrect user or password
in DBAddr string.


Reply: http://www.mnogosearch.org/board/message.php?id=4962

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Can not find server?

2002-06-12 Thread Alexander Barkov

alexson wrote:
 Hi
  
 Why do the web browser show  Can not find server after click the 
 search button?
  

Open search.cgi in your browser, then take a look into
View page source. Check what is written in
FORM ACTION=??? tag.


___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: How to ignore a specific directory?

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 I also have this problem, some sites will have hundreds of thousands of pages within 
forums. This make search results a complete mess. 
 
 If you work it out could you let me know.
 


There is no a kind of automatic detection which
can disallow indexing of various forums.
But you can use Disallow command for a particular site.


Reply: http://www.mnogosearch.org/board/message.php?id=4964

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Webboard: How to ignore a specific directory?

2002-06-12 Thread Alexander Barkov

Penny Rand wrote:
 I have a similar problem. My web directory has more than my main site in
 it but I only want to index the main site. I can do this with:
 
 #Server [subsection] URL [alias]
 Server  http://www.asix.com/ file:/web/html/
 
 But the URL on the results page is incorrect as it does not include the
 html directory in the path.
 For example the directory structure is;
 /web/html/asix/index.html
 But the results URL shows;
 http://www.asix.com/asix/index.html
 

If I understood you correctly, try something like this:

Server http://www.asix.com/ file:/web/html/asix/



___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: path indexing

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
Run for example
  indexer -am -u http://www.fortunecity.com/

indexer will explain why it accepts this page.

 I have the following urls in the server table:
 
 http://members.fortunecity.com/freddie10/FEARLESS/indexx.html  
 http://silverstone.fortunecity.com/austin/679/  
 http://www.fortunecity.com/marina/dockers/270/  
 http://www.fortunecity.com/olympia/mays/139/
 http://www.fortunecity.com/silverstone/tyre/167/smcc/index.htm  
 http://www.fortunecity.com/skyscraper/quantum/1504/  
 
 they are all set to index path, but I find the following urls have been indexed:
 
 http://www.fortunecity.com/
 http://www.fortunecity.com/corporate/about/q3current.shtml
 
 and others.
 
 I thought that using 'path' meant that only pages below the given url would be 
indexed.
 
 I am using v3.1.19 in cache mode
 
 Could anyone explain what is happening.
 
 Thanks

Reply: http://www.mnogosearch.org/board/message.php?id=4965

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Allow/disallow (no)match ip ???

2002-06-12 Thread Alexander Barkov

Hi!

We already implemented this feature in 3.2.0,
but didn't put it into the documentation yet.

Use this style command:

Realm subnet 129.27.0.*


Mario Lang wrote:
 Hello.
 
 I have a quite complicated allow /disaalow statment structure
 in my indexer.conf.  I recently had the idea, that the
 whole configuration would be alot more simple if I could limit
 the allowed hosts according to a some subnets.
 
 We have a class B subnet here, so if I could do something like
 
 Allow ip 129.27.0.0/16
 
 That would simply alot of things.
 
 Is there any plan to implement this in 3.2.x, or could you
 advice how I should go about writing a patch for that.
 It should be to hard, is it?
 
 



___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: How to set the mnogosearch to support Big5

2002-06-12 Thread Alexander Barkov

Hi!

alexson wrote:
 Hi
  
 How to set the mnogosearch to support big5, how many parameters will be 
 invovled?
 Help me, please.
  


Just use this command in both indexer.conf and search.htm:

LocalCharset big5


As well as this command in search.htm:

BrowserCharset big5

Note that big5 works only in 3.2.x branch of mnogosearch.


___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Webboard: string search

2002-06-12 Thread Alexander Barkov

To Ramil: any comments?

Does phrase search work?


Chiara Biancheri wrote:
 I mean phase search. I saw it is in the todo list.
 Do you have any idea about the time it would be implemented? 
 (on a large scale: 1 or 6 month, 1 year...
 
 thanks a lot 
 chiara
 
 
 On Tue, 2002-06-11 at 13:20, Aleksey Trubin wrote:
 
[EMAIL PROTECTED] wrote:


Author: chiara
Email: 
Message:

How you search for a complete string?
If it is not implemented, is it in the todo list?

thanks
chiara

Reply: http://www.mnogosearch.org/board/message.php?id=4941


___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: indexer -Cw takes too much time

2002-06-12 Thread Alexander Barkov

What does mysqladmin processlist display?

Are there any active queries being executed by
mnogosearch?


Kreso wrote:
 RedHat 7.2, 2x1Ghz, 512MB, hardware RAID dedicated search maschine,
 no other tasks running. MySQL 3.23.49, default configuration. Maybe
 some mysql tuning would help. Unfortunately, indexer has terminated
 after 3 hours, without doing its job. It seems like a bug to me. Are
 there any known limitations on database size, i.e. on the number
 of URLs that can be indexed?
 
 thanks again,
 Kresimir
 
 
 On Tue, Jun 11, 2002 at 06:21:22PM +0500, Aleksey Trubin wrote:
 
Kreso wrote:


Hallo,

I have a problem with indexer -Cw -t xxx. It takes too much time.
With a database 1.5 million entries it works longer than 2 hours already.
mysqld doesn't take much CPU. Any ideas? mnogosearch v3.1.19 is in the
question. I have created an index on tags before.

thanks,
Kresimir
___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]



 


So... It can depends on many things.
What is your OS, what tasks are executing on the server, mysql version?

 
 ___
 If you want to unsubscribe send unsubscribe general
 to [EMAIL PROTECTED]
 
 
 



___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Indexer caught in a loop?

2002-06-12 Thread Alexander Barkov

   Hi!

Please take a look here:

http://www.mnogosearch.org/board/message.php?id=4946

They have the same problem: urls are stored incorrectly
in the database. Can you please find the source of
error using the same recomendation as in the message
above?

Thanks.

Cal Zemelman wrote:
 I had Alias http://www.myproductionsite.com
 file:/Library/Webserver/Documents  (Mac OS X setup filesystem)
 Then Server http://www.myproductionsite.com/;
 
 Actually, I noticed it tended to get hung-up inside one specific folder.
  I moved that one out and now it seems to  get farther before getting an
 Alias error.  I can't think of any sort of thing that could this...maybe
 there is a symlink that creates a circular reference in the folder?  The
 folder I moved out is only one of a 20 or so on the site and it has quite
 a few folders inside it and a couple hundred HTML pages, so it isn't easy
 to debug.
 
 Now it simply stops indexing with Error in Aliased URL: xxx but I
 haven't really looked into this new problem now...it is probably something
 with that file.  I had a few realy large HTML documents (like 1.5MB
 converted Excel spreadsheet) and maxsize was being ignored and the indexer
 would seg fault on these...
 
 ___
 If you want to unsubscribe send unsubscribe general
 to [EMAIL PROTECTED]
 
 
 



___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: path indexing

2002-06-12 Thread nobody

Author: Alex Barkov
Email: [EMAIL PROTECTED]
Message:
 Run for example
   indexer -am -u http://www.fortunecity.com/
 
 indexer will explain why it accepts this page.
 

Sorry, the proper command is:

indexer -amv6 -u http://www.fortunecity.com/



Reply: http://www.mnogosearch.org/board/message.php?id=4967

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: I'm a beginner and need help

2002-06-12 Thread nobody

Author: Cori
Email: [EMAIL PROTECTED]
Message:
I wrote to your support email this morning and still haven't received a response so I 
decided to try this.  

I am just beginning to use your software and am very confused at how to use it.  I 
have downloaded many websites and saved them on a server.  I would like to search 
through all of the sites and documents for different strings and have the search 
return the list documents where the strings were found.  The sites are not stored in a 
database, but from what I'm reading in your help section and on the Webboard I think I 
need to store them in a database.  I don't know.  

As you can see I'm very confused on how to use your software.  If possible please 
explain how to use it beginning with the Wizard that pops up and what information I 
need to enter to configure your software to do what I want it to do.

Thanks!

Reply: http://www.mnogosearch.org/board/message.php?id=4969

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Help with PHP front end MacOS X

2002-06-12 Thread nobody

Author: Clive
Email: [EMAIL PROTECTED]
Message:
Please excuse this ignorant Mac user.

This is where I'm up to:

 I have MySQL running ok

 The mnoGoSearch indexer appears to work

 I can see data in the mnoGoSearch MySQL database
 via phpMyAdmin interface

 I've set the required variables in the search.htm and
 search.php files

 I've built and installed PHP 4.2.1, which seems to be
 working and reports mnoGoSearch Support enabled and 
 mnoGoSearch library version 30119 (via phpMyAdmin)

However, I can't get any results using the supplied PHP front end (3.1.3.5). The $Q 
variable doesn't seem to be set, and no warning/error messages show in the delivered 
search.php page, ie:

 !--notfound--
 Search Time: $SearchTimebr
 Search results:
 small$W/smallHR
 CENTER
 Sorry, but search returned no results.P
 ITry to produce less restrictive search query./I
 /CENTER
 !--/notfound--

...

Can anyone give me some ideas how to track this problem down and fix it?

I'm running mnoGoSearch 3.1.19 on MacOS X 10.1.4 with WebStar 5.1.2.

TIA


-- Clive

Reply: http://www.mnogosearch.org/board/message.php?id=4970

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Problems with 3.2.5 Searching

2002-06-12 Thread Trevor Phillips

I'm having some problems with searching in version 3.2.5, and am unsure 
if they're known bugs, or bits of the config I've messed up. I'm also 
after some clarification on some things not clearly documented.

Although I have toyed with mNoGo in the past, it was missing a few 
features important to us. I've recently installed 3.2.5, which 
supposedly has some of these features now supported, but I'm running 
into some problems.

Firstly, indexing an ordinary site, and using the supplied search.cgi, 
with a minimally modified search.htm, I cannot get the ul search 
filter/restriction to work. I've followed the docs, but none of my 
searches are sub-searches with this pattern match (they all search the 
entire content each time). What's more, after submitting it, the 
SELECTED option is not retained - although this seems to work for other 
select-style fields.

Secondly, I've had a stab at configuring it to support searching custom 
META tags, with mixed results. There seems to be little to no docs on 
how to go about this, but what I've tried is to add some Sections for 
each tag, preceeding the tag name with meta.. eg;

   Section  meta.modified   11128
   Section  meta.modified.position  12128
   Section  meta.modified.email 13128

This seems to've worked to a degree; Searches will now search these 
fields. However, these seem to be global, and I can't figure out how to 
enable or disable them, or how to change the priority weights for them.

The docs talk about using Section in conjunction with wf, but it's 
rather confusing. Hmmm, actually, the main thing that threw me was that 
the example Section declarations are all on one line in the docs, but 
viewing the HTML source, they're on separate lines. And if that's the 
case, then I always need to set a wf field, with one character per 
Section? What if I have 50 Sections?

(It would be nice to be able to assign multiple META tags to being the 
same section, where they are related info, or synonyms for the same 
field...)

I think I'll have to experiment with this a bit more...

-- 
. Trevor Phillips -   http://jurai.murdoch.edu.au/ . 
: Web Technical Administrator -  [EMAIL PROTECTED] : 
| IT Services-  Murdoch University | 
 
| On nights such as this, evil deeds are done. And good deeds, of /
| course. But mostly evil, on the whole. /
 \  -- (Terry Pratchett, Wyrd Sisters)  /


___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]