Re[3]: [Fwd: BOUNCE general@mnogosearch.org: Non-member submission from [andreas andi@hostname.de]]

2001-03-01 Thread Sergey Kartashoff

Hi!

Wednesday, February 28, 2001, 9:08:17 PM, you wrote:

a if ($udmconn = udm_alloc_agent("mysql://user@host/search/","crc-multi")) {
a  udm_set_agent_param($udmconn,  UDM_PARAM_PAGE_SIZE,10);

a  [more deleted]
a  ...

a  $result  =  udm_find($udmconn,$q);  // $q contains input from form-field

Could you please insert here (before call to udm_get_res_field):
print $result;

and give me output of this print ?
I have to understand where does result link breaking.

a  echo  udm_get_res_field($result,0,UDM_FIELD_TITLE);  //  ^...
a  this line crashes php!

a  udm_free_res($result);
a  udm_free_agent($udmconn);
a }

a Thats  all  for  now.  The  line above, containing udm_get_res_field()
a should show one single Title of the first found result.

a Its not a problem of the number of the result or anything else.

a The  problem  occurs only with certain search words. I don't know with
a words  cause the problem and why. One very strange thing is, that if i
a try  to  search  for  "mp3"  just as an example word, it crashes. If i
a search for "blabla" it does not crash.

It is very strange. What version of mnogosearch are you using ?
To Alex : what do you think about that ?
This could be a bug in the library.

a I  think,  that  is  a problem with a Segmentation Fault, which occurs
a when having an empy string somewhere - or something like this. Because
a that  problem also occurs when browsing the results pages, for example
a just  clicking  forward  to  the  next  page  of  results.  Under some
a conditions,  there  is  one  result  on  that  page,  which causes the
a function udm_get_res_field() to crash.

a Some  last  information:  i'm  using  dbmode  crc-multi, with phrase 
a crosswords,  CheckMp3Tag  yes  and  IndexMP3TagOnly  yes, currently no
a server table on a linux machine kernel 2.2.16, 1gb ram.

To Alex: Could this be because of bugs in mp3 tags support r something
else ?

a The  complete  testing  database used is nearly empty except for a few
a hundred urls just for testing.

Thank you very much.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: php - udm_*() segfault gdb-backtrace

2001-03-01 Thread Sergey Kartashoff

Hi!

Thursday, March 01, 2001, 12:17:38 AM, you wrote:

a ?
a // lets search for the word "mp3"
a $q = "mp3";
a if ($udmconn = udm_alloc_agent("mysql://root@localhost/w3audio/","crc-multi")) {
a udm_set_agent_param($udmconn,  UDM_PARAM_PAGE_SIZE,10); 
a $result = udm_find($udmconn,$q);
a echo udm_get_res_param($result,UDM_PARAM_SEARCHTIME)." sek searchtime\n";
a echo udm_get_res_param($result,UDM_PARAM_FOUND)." results.\n\n";
a // give me 4 results
a $bgn = 0; $end = 3;
a while ($bgn = $end) {
a  echo $bgn." - ";
a  echo udm_get_res_field($result,$bgn,UDM_FIELD_TITLE)."\n";
a  echo "".udm_get_res_field($result,$bgn,UDM_FIELD_URL)."\n\n";
a }
a udm_free_res($result);
a udm_free_agent($udmconn);
a }
?

a then run it on the commandline:

a # ./php test.php
a 0.01 sek searchtime
a 1407 results.

I see no output from echo $bgn." - "; string here.

a Segmentation fault

a *boof*

a now lets go to gdb:

a # gdb ./php
a (gdb) run -q test.php
a Starting program: ./php -q test.php
a 0.01 sek searchtime
a 1407 results.


a Program received signal SIGSEGV, Segmentation fault.
a 0x8066c57 in php_if_udm_get_res_field (ht=3, return_value=0x818ac7c, this_ptr=0x0, 
return_value_used=1) at php_mnogo.c:845
a 845 RETURN_STRING((Res-Doc[row].text),1);

Very strange. It seems that you debugging not a valid core.
Look here: RETURN_STRING((Res-Doc[row].text),1);
but in your php script you are printing TITLE, not TEXT.

a Hopefully this is a small bug and can be fixed soon. Hopefully i could
a help to find it

I hope too 8)

a Please,  if  you  know  what is causing the segmentation fault, let me
a know.

Currently i have no ideas.
Please do a print $result before call to udm_get_res_field to
see if this is a valid result identifier.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: BOUNCE general@mnogosearch.org: Non-member submission from [Zenon Panoussis lrh@xs4all.nl]

2001-03-01 Thread Maxime Zakharov

ðÒÉ×ÅÔ,

õ ÎÅÇÏ ÓÌÕÞÁÅÍ ÎÅ ÍÕÌØÔÉÔÒÅÄÎÁÑ ×ÅÒÓÉÑ ÉÓÐÏÌØÚÕÅÔÓÑ ?
ïÞÅÎØ ÐÏÈÏÖÅ ÎÁ ÔÏ, ËÏÇÄÁ ÐÁÍÑÔÉ ÄÌÑ ÎÉÔÉ ÎÅ È×ÁÔÁÌÏ - ÉÍÅÎÎÏ × ÔÏÍ ÖÅ ÍÅÓÔÅ É 
ÔÒÁÐÁÅÔÓÑ.

On Thu, 01 Mar 2001 09:15:37 +0400
Alexander Barkov [EMAIL PROTECTED] wrote:

AB OK. Please check also this:
AB 
AB print realsize
AB print *Doc
AB 
AB 
AB 
AB Zenon Panoussis wrote:
AB   please run the following commands in gdb:
AB  
AB   frame 1
AB   print content_type
AB   print Method
AB   print Doc
AB   print Doc-content
AB   print Doc-url
AB  
AB  #0  0x80600ca in UdmCRC32 (buf=0x4021c03e "", size=4294967295) at crc32.c:97
AB  97  _CRC32_(crc, *p) ;
AB  (gdb) frame 1
AB  #1  0x804d7f8 in UdmIndexNextURL (Indexer=0x807ca50, index_flags=4) at 
indexer.c:1150
AB  1150crc32=UdmCRC32(Doc-content, (size_t)realsize);
AB  (gdb) print content_type
AB  $1 = 0x4021c027 "application/unknown"
AB  (gdb) print Method
AB  $2 = 1
AB  (gdb) print Doc
AB  $3 = (UDM_DOCUMENT *) 0x91ef7d8
AB  (gdb) print Doc-content
AB  $4 = 0x4021c03e ""
AB  (gdb) print Doc-url
AB  $5 = 0x91f0548 "http://www.xs4all.nl/~fishman/ls/."
AB  
AB  See my (bounced) posting from [EMAIL PROTECTED] on
AB  Tue, 27 Feb 2001 15:46:37 +0100 for details about this and
AB  the other URLs that the indexer crashes on.
AB ___
AB If you want to unsubscribe send "unsubscribe general"
AB to [EMAIL PROTECTED]
AB 
AB 

--
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: BOUNCE general@mnogosearch.org: Non-member submission from [Zenon Panoussis lrh@xs4all.nl]

2001-03-01 Thread Alexander Barkov

realsize -1 means that there was an error while donwloading document.

I found that there is not checking in indexer.c for this. Please
find a patch here:
http://gw.udmsearch.izhnet.ru/~bar/crc32.indexer.c.patch.gz
It should the crash.



Take a look into proto.c.  UDM_NET_ERROR (it is -1) is returned only
in two places:

1. in open_host() function, when port is 0.
2. in UdmHTTPGet() function, when select() returns an error.

I have no idea what is happening.



Zenon Panoussis wrote:
 
 Alexander Barkov skrev:
 
 
  OK. Please check also this:
 
  print realsize
  print *Doc
 
 (gdb) frame 1
 #1  0x804d7f8 in UdmIndexNextURL (Indexer=0x807ca50, index_flags=4) at indexer.c:1150
 1150crc32=UdmCRC32(Doc-content, (size_t)realsize);
 (gdb) print realsize
 $1 = -1
 (gdb) print *Doc
 $2 = {url_id = 12018, status = 0, size = 0, rating = 0, order = 0, referrer = 0, tag 
= 0, hops = 3,
   indexed = 0, url = 0x91f0548 "http://www.xs4all.nl/~fishman/ls/.", content_type = 
0x0, title = 0x0,
   keywords = 0x0, description = 0x0, text = 0x0, category = 0x0, content = 
0x4021c03e "",
   last_mod_time = 0, last_index_time = 983253816, next_index_time = 0, crc32 =  0}
 
 Z
 
 --
 oracle@everywhere: The ephemeral source of the eternal truth...
 ___
 If you want to unsubscribe send "unsubscribe general"
 to [EMAIL PROTECTED]
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




RE: Searching multiple tags

2001-03-01 Thread Briggs, Gary

I'm using the PHP frontend at the moment.

And I shall actually continue using it, especially when the PHP 4.0.5 comes
out (=

Gary (-;

 -Original Message-
 From: Laurent LEVIER [SMTP:[EMAIL PROTECTED]]
 Sent: Wednesday, February 28, 2001 6:38 PM
 To:   [EMAIL PROTECTED]; Briggs, Gary; '[EMAIL PROTECTED]'
 Subject:  Re: Searching multiple tags
 
 When searching, the search tool is restricting tags within the select.
 
 So for sure it is possible, but what are you using as searching tool ? CGI
 ? PHP ? PERL ?
 
 At 17:49 28/02/2001 +, Briggs, Gary wrote:
 Is there any way that I can search from multiple tags in one search?
 
 Gary (-;
 ___
 If you want to unsubscribe send "unsubscribe general"
 to [EMAIL PROTECTED]
 
 Laurent LEVIER
 IT Systems  Networks, Unix System Engineer
 Security Specialist
 
 Argosnet Security Server : http://www.Argosnet.com
 "Le Veilleur Technologique", "The Technology Watcher"
 
 ___
 If you want to unsubscribe send "unsubscribe general"
 to [EMAIL PROTECTED]
 
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re[5]: [Fwd: BOUNCE general@mnogosearch.org: Non-member submission from [andreas andi@hostname.de]]

2001-03-01 Thread Sergey Kartashoff

Hi!

Thursday, March 01, 2001, 11:41:31 AM, you wrote:

 It is very strange. What version of mnogosearch are you using ?
 To Alex : what do you think about that ?
 This could be a bug in the library.

a I  deleted  the /var directory completely - and the segmentation fault
a is  gone. So i think the udm_get_res_field function did find something
a from within files, but not database, and crashed.

Have you reindexed all and crash is gone ?
After reindex all your searches works fine ?
Am i right ? So the problem was corrupted cache-mode files ?

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




RE: Searching multiple tags

2001-03-01 Thread Briggs, Gary

Hmmm.
The php one can't .

Gary (-;

 -Original Message-
 From: Alexander Barkov [SMTP:[EMAIL PROTECTED]]
 Sent: Thursday, March 01, 2001 5:13 AM
 To:   [EMAIL PROTECTED]; Briggs, Gary
 Subject:  Re: Searching multiple tags
 
 "Briggs, Gary" wrote:
  
  Is there any way that I can search from multiple tags in one search?
  
  Gary (-;
 
 
 Just submit several t=XXX  pairs from HTML form.
 At least search.cgi can do it.
 ___
 If you want to unsubscribe send "unsubscribe general"
 to [EMAIL PROTECTED]
 
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: Searching multiple tags

2001-03-01 Thread Alexander Barkov

Sergey, any comments?


"Briggs, Gary" wrote:
 
 Hmmm.
 The php one can't .
 
 Gary (-;
 
  -Original Message-
  From: Alexander Barkov [SMTP:[EMAIL PROTECTED]]
  Sent: Thursday, March 01, 2001 5:13 AM
  To:   [EMAIL PROTECTED]; Briggs, Gary
  Subject:  Re: Searching multiple tags
 
  "Briggs, Gary" wrote:
  
   Is there any way that I can search from multiple tags in one search?
  
   Gary (-;
 
 
  Just submit several t=XXX  pairs from HTML form.
  At least search.cgi can do it.
  ___
  If you want to unsubscribe send "unsubscribe general"
  to [EMAIL PROTECTED]
 
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re[2]: Searching multiple tags

2001-03-01 Thread Sergey Kartashoff

Hi!

Thursday, March 01, 2001, 12:34:17 PM, you wrote:

BG Hmmm.
BG The php one can't .

Php can do this.
Tags are searched against SQL LIKE syntax.
So you can search for example for "ABCD%" tag,
which is actially search by group of tags.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




php-mnogo-1.01

2001-03-01 Thread Sergey Kartashoff

Hi!

  From ChangeLog:
  Fixed memory leak in link list destructor.

-- 
Regards, Sergey aka gluke.
 php-mnogo-1.01.tgz


Re[2]: Searching multiple tags

2001-03-01 Thread Sergey Kartashoff

Hi!

Thursday, March 01, 2001, 1:11:17 PM, you wrote:

AB Sergey, any comments?

Already sent. Php does support it.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: 3.1.11: stopwords substring search

2001-03-01 Thread Alexander Barkov

"Andrey A. Chernov" wrote:
 
 There  is  apparent conflict between stopwords and substring search, f.e.
 try   to   search  Russian  'ÎÁÄÏ'  (which  is  stopword  by  itself)  in
 {beg,end,sub}string mode - search stopped with no result because stopword
 immediately  found,  but  it  prevents  other (longer) words to be found.
 Please  consider  stopwords  as  not showstoppers in substring search and
 just simple continue search after them.
 


Patch which fixes this is in attachement.
Now searchwords are not checked in the case 
of "substring match".

Thanks for reporting!
 seatchtool.c.patch.gz


Webboard: 3.1.11: stopwords substring search

2001-03-01 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
 There  is  apparent conflict between stopwords and substring search, f.e. try   to   
search  Russian  'íàäî'  (which  is  stopword  by  itself)  in {beg,end,sub}string 
mode - search stopped with no result because stopword immediately  found,  but  it  
prevents  other (longer) words to be found. Please  consider  stopwords  as  not 
showstoppers in substring search and just simple continue search after them.
 


Fixed, thanks!

Reply: http://search.mnogo.ru/board/message.php?id=1584

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: bugs or what?

2001-03-01 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
 First, I've tried to set up mnogosearch to my home directory, since I'm not a root 
on a system. The first íåñîîòâåòñòâèå was that for bult-in database we need to 
specify --with-biult-in keyword in configure - that's not described in manuals, 
how-to-install and so on... I've setted up and I've found the second problem - the 
indexer was not working - it wrote 
 Indexer[11730]: indexer from mnogosearch-3.1.11/UdmDB started with 
'/home/user/mnogo/etc/indexer.conf-dist'
 Indexer[11730]: [1] Error: ''
 Where can be trouble - config leaved untouched.

Please find a patch here:
http://gw.udmsearch.izhnet.ru/~bar/patches/files.c.patch.gz

Thanks for reporting!

Reply: http://search.mnogo.ru/board/message.php?id=1585

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: bugs or what?

2001-03-01 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
 I found that problem source - it was that mnogosearch tried to use SQL in stopword 
section, but I used a built-in database and when I commented that string indexer was 
working... But I want to ask you, how to index a ftp - I gave quot;-u 
ftp://URLquot; but indexer worked 0 sec adn finished...


Add Server command for it into indexer.conf, for example:

Server ftp://ftp.cdrom.com/

Reply: http://search.mnogo.ru/board/message.php?id=1586

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Indexer gets killed for not apparent reason - ongoing topic

2001-03-01 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Is core file produced?


  I know I have posted this message before but I got no replies, the problem is that 
indexer gets killed after the var/dict.txt files reaches ~7MB, I have more then 
enough space on the device (36GB) and it's error-free. Any help would be appreaciated.
  Here's the info:
 
 MnogoSearch vs. is 3.1.11 
 OS is IRIX 6.5.8m 
 Platform is a 100 MHZ IP22 with R4600 processor (SGI Indi) 
 DB is a built-in DB 
 GDB results are (there is no core dumped): 
 
 Starting program: /site/www/search/mnogosearch/sbin/indexer -a -v 5 
 indexer from mnogosearch-3.1.11/UdmDB started with 
'/site/www/search/mnogosearch/etc/indexer.conf' 
 [1] http://spinoza.tau.ac.il/robots.txt 
 
 lt;snipped lots of indexed .html filesgt; 
 
 
 [1] quot;hanoch1-evo.htmlquot;: Allow by default 
 [1] quot;hanoch1-heb.htmlquot;: Allow by default 
 [1] quot;kasher-hb.htmlquot;: Allow by default 
 
 Program terminated with signal SIGKILL, Killed. 
 The program no longer exists. 
 Current language: auto; currently asm
 
 
  Thanks,
 
  Nir

Reply: http://search.mnogo.ru/board/message.php?id=1587

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: indexer inserts unwanted url into database

2001-03-01 Thread john

Author: john
Email: [EMAIL PROTECTED]
Message:
I don't want to index full server,only some directories ,so i am giving commands like:

Server  http://www.raaga.ch/de/dienst/
Server  http://www.raaga.ch/de/kontakt/
Server  http://www.raaga.ch/de/ueberuns/

inside these directories i have other directories also,e.g
Server  http://www.raaga.ch/de/dienst/reply
Server  http://www.raaga.ch/de/dienst/answers
Server  http://www.raaga.ch/de/dienst/post


But in results page it is generating url corresponding to all these server commands,I 
want to avoid this,please help.e.g
http://www.raaga.ch/de/dienst/
http://www.raaga.ch/de/kontakt/
http://www.raaga.ch/de/ueberuns/
http://www.raaga.ch/de/dienst/reply
http://www.raaga.ch/de/dienst/answers
http://www.raaga.ch/de/dienst/post





Reply: http://search.mnogo.ru/board/message.php?id=1590

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




searching in url.description possible?

2001-03-01 Thread andreas

hi,

i'm  currently  trying to index a few MP3-files using the Id3-Tag. The
Id3-Tag   is   read  and  inserted  in  the  url-table  in  the  field
'description' for example.

When searching for a word, only the dict-tables are checked, right?

How  can  i  manage  the  Id3-Tag  information  to  get written to the
dict-tables  too?  Without that feature, it makes no sence to even try
to index MP3-files - or am i wrong?

Any ideas welcome

Andy


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




php-mnogo-1.03

2001-03-01 Thread Sergey Kartashoff

Hi!

 Added UDM_PARAM_CROSS_WORDS agent session parameter to use crosswords
 feature available since mnogosearch 3.1.11.

-- 
Regards, Sergey aka gluke.
 php-mnogo-1.03.tgz


Webboard: indexer inserts unwanted url into database

2001-03-01 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Use either robots.txt file or allow/disallow commands.

 I don't want to index full server,only some directories ,so i am giving commands 
like:
 
 Server  http://www.raaga.ch/de/dienst/
 Server  http://www.raaga.ch/de/kontakt/
 Server  http://www.raaga.ch/de/ueberuns/
 
 inside these directories i have other directories also,e.g
 Server  http://www.raaga.ch/de/dienst/reply
 Server  http://www.raaga.ch/de/dienst/answers
 Server  http://www.raaga.ch/de/dienst/post
 
 
 But in results page it is generating url corresponding to all these server 
commands,I want to avoid this,please help.e.g
 http://www.raaga.ch/de/dienst/
 http://www.raaga.ch/de/kontakt/
 http://www.raaga.ch/de/ueberuns/
 http://www.raaga.ch/de/dienst/reply
 http://www.raaga.ch/de/dienst/answers
 http://www.raaga.ch/de/dienst/post
 
 
 
 

Reply: http://search.mnogo.ru/board/message.php?id=1593

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: bugs or what?

2001-03-01 Thread DUKe

Author: DUKe
Email: 
Message:
I understand this, but if I want to index much more sites, should I include in 
'server' line them?

Reply: http://search.mnogo.ru/board/message.php?id=1594

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: php-mnogo-1.01

2001-03-01 Thread Alexander Barkov

Old PHP front-end uses SQL functions to access mnoGoSearch data.

New front-end is native module for PHP:

  http://www.php.net/manual/en/ref.mnogo.php



"Briggs, Gary" wrote:
 
 This is probably a REALLY stupid question, but what's the difference between
 this and the PHP frontend that appears on the mnogo homepage?
 
 Thanks,
 Gary (-;
 
  -Original Message-
  From: Sergey Kartashoff [SMTP:[EMAIL PROTECTED]]
  Sent: Thursday, March 01, 2001 12:04 PM
  To:   [EMAIL PROTECTED]
  Cc:   devel-ru
  Subject:  php-mnogo-1.01
 
  Hi!
 
From ChangeLog:
Fixed memory leak in link list destructor.
 
  --
  Regards, Sergey aka gluke.  File: php-mnogo-1.01.tgz 
 ___
 If you want to unsubscribe send "unsubscribe general"
 to [EMAIL PROTECTED]
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re[2]: php-mnogo-1.01

2001-03-01 Thread Sergey Kartashoff

Hi!

Thursday, March 01, 2001, 4:50:11 PM, you wrote:

BG This is probably a REALLY stupid question, but what's the difference between
BG this and the PHP frontend that appears on the mnogo homepage?

This is not frontend.
This is a mnogosearch php extension which adds native mnogosearch
function into php.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]