Webboard: How do I exclude subdirs in indexer.conf?

2001-05-23 Thread Markus Geiger

Author: Markus Geiger
Email: [EMAIL PROTECTED]
Message:
Hi,

I'd like to index http://www.mydomain.com/ with all subdirs
and pages except every subdirectory like:
http://www.mydomain.com/0/ or http://www.mydomain.com/123/ etc.
(only numbers).

In my indexer.conf I said:

Server http://www.mydomain.com/
Realm Regex NoMatch ^http://www\.mydomain\.com/[0-9]*/

but it won't work at all :(. With the first line only
everything is fine, but when I add the NoMatch line,
also other pages are indexed which do not start with
www.mydomain.com.

How do I set up my indexer.conf correctly?

tnx!

cu Markus


Reply: http://search.mnogo.ru/board/message.php?id=2255

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Russian letter 'io' ('')

2001-05-23 Thread Danil Lavrentyuk

Hello!

How does mnoGoSearch counts russian letter 'io' ('£')?

Does it counts this letter equal to russian 'ie' ('Å')?
Or not?

Have I to use this letter in ispell dictionaries or not to use?


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Russian letter 'io' ('')

2001-05-23 Thread Alexander Barkov

Danil Lavrentyuk wrote:
 
 Hello!
 
 How does mnoGoSearch counts russian letter 'io' ('£')?
 
 Does it counts this letter equal to russian 'ie' ('Å')?
 Or not?
 
 Have I to use this letter in ispell dictionaries or not to use?


It's not equal to ie, it's considered as a separate letter.

In ispell it's considered as separate letter two.
___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: How do I exclude subdirs in indexer.conf?

2001-05-23 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
To index your site just use the only first command.

 Hi,
 
 I'd like to index http://www.mydomain.com/ with all subdirs
 and pages except every subdirectory like:
 http://www.mydomain.com/0/ or http://www.mydomain.com/123/ etc.
 (only numbers).
 
 In my indexer.conf I said:
 
 Server http://www.mydomain.com/
 Realm Regex NoMatch ^http://www\.mydomain\.com/[0-9]*/
 
 but it won't work at all :(. With the first line only
 everything is fine, but when I add the NoMatch line,
 also other pages are indexed which do not start with
 www.mydomain.com.
 
 How do I set up my indexer.conf correctly?
 
 tnx!
 
 cu Markus
 

Reply: http://search.mnogo.ru/board/message.php?id=2256

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: compiling 3.1.13 failed on spelld.c

2001-05-23 Thread Dmitry Kostyuchenko

Author: Dmitry Kostyuchenko
Email: [EMAIL PROTECTED]
Message:
FreeBSD 3.3
GNU make 3.79.1
mnoGoSearch 3.1.13

./configure --with-mysql
gmake
...
...
spelld.c: In function `main':  
spelld.c:235: `socklen_t' undeclared (first use this function) 
spelld.c:235: (Each undeclared identifier is reported only once
spelld.c:235: for each function it appears in.)
spelld.c:235: parse error before `addrlen' 
spelld.c:241: `addrlen' undeclared (first use this function)   
gmake[1]: *** [spelld.o] Error 1   
gmake[1]: Leaving directory `/usr/local/src/mnogosearch-3.1.13/src'
gmake: *** [all-recursive] Error 1 

thanks in advance.

Reply: http://search.mnogo.ru/board/message.php?id=2257

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Russian letter 'io' ('')

2001-05-23 Thread Danil Lavrentyuk

[ On Wed, 23 May 2001, Alexander Barkov wrote: ]

AB  How does mnoGoSearch counts russian letter 'io' ('£')?
AB 
AB  Does it counts this letter equal to russian 'ie' ('Å')?
AB  Or not?
AB 
AB  Have I to use this letter in ispell dictionaries or not to use?
AB
AB It's not equal to ie, it's considered as a separate letter.
AB
AB In ispell it's considered as separate letter two.

Think it's not good to not have a choice even on configure stage. :-(
Some of pages could be whriten with 'io' usage, some other may be written
with 'ie' instaed of all 'io'. There are some traditions not to use 'io' in
texts.

BTW it would be good to say on this in mnoGoSearch documentations because
ispell dictionaries could be maden with or without 'io' usage (and default is
AFAIK without 'io').


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Doc Relevance

2001-05-23 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
DR means number of unique words found in document, It is always 1
if you search for the only one word. However most relevant document
is always dislayed first. 

In 3.2 we want to add a possibility to display something like 
percentage.


 While performing a search, I realised that the document
 relevancy $DR is always get 1. This is rather weird, cos' 
 I always thought that the document relevancy value should 
 be derived from the search text. How it be possible that 
 a document which contain more occurences of the search text 
 have the same document relevancy value as documents with lesser occurences?
 
 How should I configure indexer.conf during indexing
 so that the document relevancy can be taken into account
 when a search is issued?
 
 Any help on the matter is much appreciated.
 
 --
 Jenson

Reply: http://search.mnogo.ru/board/message.php?id=2258

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: compiling 3.1.13 failed on spelld.c

2001-05-23 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Replace socklen_t with int

Thanks for reporting.

 FreeBSD 3.3
 GNU make 3.79.1
 mnoGoSearch 3.1.13
 
 ./configure --with-mysql
 gmake
 ...
 ...
 spelld.c: In function `main':  
 spelld.c:235: `socklen_t' undeclared (first use this function) 
 spelld.c:235: (Each undeclared identifier is reported only once
 spelld.c:235: for each function it appears in.)
 spelld.c:235: parse error before `addrlen' 
 spelld.c:241: `addrlen' undeclared (first use this function)   
 gmake[1]: *** [spelld.o] Error 1   
 gmake[1]: Leaving directory `/usr/local/src/mnogosearch-3.1.13/src'
 gmake: *** [all-recursive] Error 1 
 
 thanks in advance.

Reply: http://search.mnogo.ru/board/message.php?id=2259

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Tags

2001-05-23 Thread Douglas Hanks

This question had to be asked a thousand different times due to the lack of
documentation:  how does one _implement_ keys?

I understand the theory behind keys, but I do not see how you actually
implement them.

If I have a sub-domain browse.baytrder.org, how would I assign that the tag
browse?  After tagging, it's just a matter od appending t=browse on the
search.cgi?

Any help or a link to better documentation appreciated.

-- 
- Doug Hanks = dhanks(at)gnu(dot)org
  Finger dhanks(at)gnu.org for GPG key
___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Russian letter 'io' ('')

2001-05-23 Thread Danil Lavrentyuk

[ On Thu, 24 May 2001, Maxime Zakharov wrote: ]

MZ  Think it's not good to not have a choice even on configure stage. :-(
MZ  Some of pages could be whriten with 'io' usage, some other may be written
MZ  with 'ie' instaed of all 'io'. There are some traditions not to use 'io' in
MZ  texts.
MZ 
MZ  BTW it would be good to say on this in mnoGoSearch documentations because
MZ  ispell dictionaries could be maden with or without 'io' usage (and default is
MZ  AFAIK without 'io').

MZ For my mind, i strictly recomemded to make russian affixes for both
MZ cases, because one part of webpages may contain 'yo', while other - 'e'.

And how could an indexer determine what of they to use for each page?
And how could search engine determine have it to use
'io'-containing-dictionaries for user request analizing or not?

Think it is not the way.


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Bug report

2001-05-23 Thread Trevor Phillips



UdmSearch version: 3.1.12
Platform:  Dell 2400 (Dual PIII 667, 1Gig RAM)
OS:Debian Linux 2.1 (Kernel 2.2.19)
Database:  
Statistics:


We have a Gartner database that has hundreds of URLs referenced internally as relative 
URLs starting with \./\. ie; something like \./../../research/etc...\.

mnoGoSearch seems to treat that first single period as a double period. ie; it reports 
a broken link pointing to a URL which is one directory higher than it should be. 
Clicking on these links from all browsers I\'ve tried works fine.


___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Webboard: Cookies Support

2001-05-23 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Hello!

It's still on TODO. 

Probably one of the possible solutions is to hack the function 
UdmAddURL() and cut SESS=XXX substrings before inserting into
database.


 I posted there are serval time a question about the support by mnogosearch of the 
cookies. Someone answered that it was in the TODO list.
 
 I just would like to know if the coders have an idea of when ? Because I'm really 
interest ;p in fact it's because I'm using session (PHP) on my website and if the 
broswer (like the parser) doesn't support cookies, sessions were forward in the url 
(like SESS=ksjfhsjkdf45zefD), well the problem is when I try to delete only the 
session in my database, mysql answer me that the url is already in. After some 
research I discovered that (it goes without saying) mnogosearsh consider different 
session as different web page ...
 
 I hope that someone have understand me ;p And thank you if someone have a solution
 
 Cheers,
 

Reply: http://search.mnogo.ru/board/message.php?id=2262

___
If you want to unsubscribe send unsubscribe general
to [EMAIL PROTECTED]




Re: Webboard: Storing special characters

2001-05-23 Thread Matthew Sullivan

Alexander Barkov wrote:

 Author: Alexander Barkov
 Email: [EMAIL PROTECTED]
 Message:
 Hi!

 Find #define WORDCHAR  ... in sources, and add _ sign
 into it's content. Then recompile both indexer and search.cgi

 However please note that ALL _'s will be considered
 as letters and you will not be able to search for word
 parts around _ sign.

Alex,

New functionality request for you ...  a selection of characters that if found
will be not indexed based on whether they are surrounded by letters or not.

ie:  234.234 would be indexed as '234.234' and 'lazy dog.' would be indexed as
'lazy' and 'dog' (note no fullstop at the end of the 'dog')

the list of characters would be selectable but it would also solve other
problems like 'TCP/IP' being indexed as 'TCP' and 'IP' where as in most other
cases you would want to ignore the '/' character..  THe search interface
would search with what it is given not stripping all the punctuation characters.



Yours

Matthew



begin:vcard 
n:Sullivan;Matthew
tel;cell:+61 (0)414 861 744 (Not for work use)
tel;fax:+61 (0)3 9693 7699
tel;home:+61 (0)3 9397 4859
tel;work:+61 (0)3 9693 7640
x-mozilla-html:TRUE
url:http://people.netscape.com/matthews/
org:TABLE cols=2 width=350 spacing=0 rows=1TRTD width=50img src=http://people.netscape.com/matthews/penguin.gif;/TDTDTABLE width=250 spacing=0 border=0TRTDFONT SIZE=2Senior Technical Support EngineerTRTDFONT SIZE=2iPlanet E-Commerce SolutionsTRTDFONT SIZE=2Netscape Communications Australia Pty LtdTRTDFONT SIZE=2A division of America On Line Inc./TABLE/TABLE
version:2.1
email;internet:[EMAIL PROTECTED]
adr;quoted-printable:;;Netscape Communications Australia=0D=0A;Level 1, The Tea House, 28 Clarendon Street;South Melbourne;VIC 3205;Australia
x-mozilla-cpt:nemesis.netscape.com;-18464
fn:Matthew Sullivan
end:vcard

 S/MIME Cryptographic Signature