Webboard: How do I exclude subdirs in indexer.conf?
Author: Markus Geiger Email: [EMAIL PROTECTED] Message: Hi, I'd like to index http://www.mydomain.com/ with all subdirs and pages except every subdirectory like: http://www.mydomain.com/0/ or http://www.mydomain.com/123/ etc. (only numbers). In my indexer.conf I said: Server http://www.mydomain.com/ Realm Regex NoMatch ^http://www\.mydomain\.com/[0-9]*/ but it won't work at all :(. With the first line only everything is fine, but when I add the NoMatch line, also other pages are indexed which do not start with www.mydomain.com. How do I set up my indexer.conf correctly? tnx! cu Markus Reply: http://search.mnogo.ru/board/message.php?id=2255 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Russian letter 'io' ('')
Hello! How does mnoGoSearch counts russian letter 'io' ('£')? Does it counts this letter equal to russian 'ie' ('Å')? Or not? Have I to use this letter in ispell dictionaries or not to use? Danil Lavrentyuk Communiware.net Programmer ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Russian letter 'io' ('')
Danil Lavrentyuk wrote: Hello! How does mnoGoSearch counts russian letter 'io' ('£')? Does it counts this letter equal to russian 'ie' ('Å')? Or not? Have I to use this letter in ispell dictionaries or not to use? It's not equal to ie, it's considered as a separate letter. In ispell it's considered as separate letter two. ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: How do I exclude subdirs in indexer.conf?
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: To index your site just use the only first command. Hi, I'd like to index http://www.mydomain.com/ with all subdirs and pages except every subdirectory like: http://www.mydomain.com/0/ or http://www.mydomain.com/123/ etc. (only numbers). In my indexer.conf I said: Server http://www.mydomain.com/ Realm Regex NoMatch ^http://www\.mydomain\.com/[0-9]*/ but it won't work at all :(. With the first line only everything is fine, but when I add the NoMatch line, also other pages are indexed which do not start with www.mydomain.com. How do I set up my indexer.conf correctly? tnx! cu Markus Reply: http://search.mnogo.ru/board/message.php?id=2256 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: compiling 3.1.13 failed on spelld.c
Author: Dmitry Kostyuchenko Email: [EMAIL PROTECTED] Message: FreeBSD 3.3 GNU make 3.79.1 mnoGoSearch 3.1.13 ./configure --with-mysql gmake ... ... spelld.c: In function `main': spelld.c:235: `socklen_t' undeclared (first use this function) spelld.c:235: (Each undeclared identifier is reported only once spelld.c:235: for each function it appears in.) spelld.c:235: parse error before `addrlen' spelld.c:241: `addrlen' undeclared (first use this function) gmake[1]: *** [spelld.o] Error 1 gmake[1]: Leaving directory `/usr/local/src/mnogosearch-3.1.13/src' gmake: *** [all-recursive] Error 1 thanks in advance. Reply: http://search.mnogo.ru/board/message.php?id=2257 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Russian letter 'io' ('')
[ On Wed, 23 May 2001, Alexander Barkov wrote: ] AB How does mnoGoSearch counts russian letter 'io' ('£')? AB AB Does it counts this letter equal to russian 'ie' ('Å')? AB Or not? AB AB Have I to use this letter in ispell dictionaries or not to use? AB AB It's not equal to ie, it's considered as a separate letter. AB AB In ispell it's considered as separate letter two. Think it's not good to not have a choice even on configure stage. :-( Some of pages could be whriten with 'io' usage, some other may be written with 'ie' instaed of all 'io'. There are some traditions not to use 'io' in texts. BTW it would be good to say on this in mnoGoSearch documentations because ispell dictionaries could be maden with or without 'io' usage (and default is AFAIK without 'io'). Danil Lavrentyuk Communiware.net Programmer ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Doc Relevance
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: DR means number of unique words found in document, It is always 1 if you search for the only one word. However most relevant document is always dislayed first. In 3.2 we want to add a possibility to display something like percentage. While performing a search, I realised that the document relevancy $DR is always get 1. This is rather weird, cos' I always thought that the document relevancy value should be derived from the search text. How it be possible that a document which contain more occurences of the search text have the same document relevancy value as documents with lesser occurences? How should I configure indexer.conf during indexing so that the document relevancy can be taken into account when a search is issued? Any help on the matter is much appreciated. -- Jenson Reply: http://search.mnogo.ru/board/message.php?id=2258 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: compiling 3.1.13 failed on spelld.c
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Replace socklen_t with int Thanks for reporting. FreeBSD 3.3 GNU make 3.79.1 mnoGoSearch 3.1.13 ./configure --with-mysql gmake ... ... spelld.c: In function `main': spelld.c:235: `socklen_t' undeclared (first use this function) spelld.c:235: (Each undeclared identifier is reported only once spelld.c:235: for each function it appears in.) spelld.c:235: parse error before `addrlen' spelld.c:241: `addrlen' undeclared (first use this function) gmake[1]: *** [spelld.o] Error 1 gmake[1]: Leaving directory `/usr/local/src/mnogosearch-3.1.13/src' gmake: *** [all-recursive] Error 1 thanks in advance. Reply: http://search.mnogo.ru/board/message.php?id=2259 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Tags
This question had to be asked a thousand different times due to the lack of documentation: how does one _implement_ keys? I understand the theory behind keys, but I do not see how you actually implement them. If I have a sub-domain browse.baytrder.org, how would I assign that the tag browse? After tagging, it's just a matter od appending t=browse on the search.cgi? Any help or a link to better documentation appreciated. -- - Doug Hanks = dhanks(at)gnu(dot)org Finger dhanks(at)gnu.org for GPG key ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Russian letter 'io' ('')
[ On Thu, 24 May 2001, Maxime Zakharov wrote: ] MZ Think it's not good to not have a choice even on configure stage. :-( MZ Some of pages could be whriten with 'io' usage, some other may be written MZ with 'ie' instaed of all 'io'. There are some traditions not to use 'io' in MZ texts. MZ MZ BTW it would be good to say on this in mnoGoSearch documentations because MZ ispell dictionaries could be maden with or without 'io' usage (and default is MZ AFAIK without 'io'). MZ For my mind, i strictly recomemded to make russian affixes for both MZ cases, because one part of webpages may contain 'yo', while other - 'e'. And how could an indexer determine what of they to use for each page? And how could search engine determine have it to use 'io'-containing-dictionaries for user request analizing or not? Think it is not the way. Danil Lavrentyuk Communiware.net Programmer ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Bug report
UdmSearch version: 3.1.12 Platform: Dell 2400 (Dual PIII 667, 1Gig RAM) OS:Debian Linux 2.1 (Kernel 2.2.19) Database: Statistics: We have a Gartner database that has hundreds of URLs referenced internally as relative URLs starting with \./\. ie; something like \./../../research/etc...\. mnoGoSearch seems to treat that first single period as a double period. ie; it reports a broken link pointing to a URL which is one directory higher than it should be. Clicking on these links from all browsers I\'ve tried works fine. ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Webboard: Cookies Support
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hello! It's still on TODO. Probably one of the possible solutions is to hack the function UdmAddURL() and cut SESS=XXX substrings before inserting into database. I posted there are serval time a question about the support by mnogosearch of the cookies. Someone answered that it was in the TODO list. I just would like to know if the coders have an idea of when ? Because I'm really interest ;p in fact it's because I'm using session (PHP) on my website and if the broswer (like the parser) doesn't support cookies, sessions were forward in the url (like SESS=ksjfhsjkdf45zefD), well the problem is when I try to delete only the session in my database, mysql answer me that the url is already in. After some research I discovered that (it goes without saying) mnogosearsh consider different session as different web page ... I hope that someone have understand me ;p And thank you if someone have a solution Cheers, Reply: http://search.mnogo.ru/board/message.php?id=2262 ___ If you want to unsubscribe send unsubscribe general to [EMAIL PROTECTED]
Re: Webboard: Storing special characters
Alexander Barkov wrote: Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi! Find #define WORDCHAR ... in sources, and add _ sign into it's content. Then recompile both indexer and search.cgi However please note that ALL _'s will be considered as letters and you will not be able to search for word parts around _ sign. Alex, New functionality request for you ... a selection of characters that if found will be not indexed based on whether they are surrounded by letters or not. ie: 234.234 would be indexed as '234.234' and 'lazy dog.' would be indexed as 'lazy' and 'dog' (note no fullstop at the end of the 'dog') the list of characters would be selectable but it would also solve other problems like 'TCP/IP' being indexed as 'TCP' and 'IP' where as in most other cases you would want to ignore the '/' character.. THe search interface would search with what it is given not stripping all the punctuation characters. Yours Matthew begin:vcard n:Sullivan;Matthew tel;cell:+61 (0)414 861 744 (Not for work use) tel;fax:+61 (0)3 9693 7699 tel;home:+61 (0)3 9397 4859 tel;work:+61 (0)3 9693 7640 x-mozilla-html:TRUE url:http://people.netscape.com/matthews/ org:TABLE cols=2 width=350 spacing=0 rows=1TRTD width=50img src=http://people.netscape.com/matthews/penguin.gif;/TDTDTABLE width=250 spacing=0 border=0TRTDFONT SIZE=2Senior Technical Support EngineerTRTDFONT SIZE=2iPlanet E-Commerce SolutionsTRTDFONT SIZE=2Netscape Communications Australia Pty LtdTRTDFONT SIZE=2A division of America On Line Inc./TABLE/TABLE version:2.1 email;internet:[EMAIL PROTECTED] adr;quoted-printable:;;Netscape Communications Australia=0D=0A;Level 1, The Tea House, 28 Clarendon Street;South Melbourne;VIC 3205;Australia x-mozilla-cpt:nemesis.netscape.com;-18464 fn:Matthew Sullivan end:vcard S/MIME Cryptographic Signature