Re: just searching within a site (win32)

2001-05-09 Thread Ramil Kalimullin

Hi!

> 1) is there any way to search for matches within just a site ?

Please take a look into help (chapter 13, 14)

> besides that, pls. anyone to verify that the category filter in the
indexer
> tab does NOT work ?

We'll check it.

> and many times the indexer just hangs while selecting (not the first time)

Please describe this in more detail.

Regards, Ramil.

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Optimizing indexer

2001-05-09 Thread Mario Gray

Author: Mario Gray
Email: [EMAIL PROTECTED]
Message:
The blank to normal text links are nearly 1:2
which means that half of the good(OK) data are blank
and wont get indexed until next period, furthermore,
other status codes get thrown into the mix causing 
less pages to be accurately accounted for. 
For instance, 
503, only says server down, but indexer assigns this itself
whenever the system (not the remote host) cannot handle the traffic
itself.  This causes indexer to not store cache/ndict words

I am making some modification, but if anyone has any questions
or comments please reply!

Thanx


Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




mnogosearch on intranet

2001-05-09 Thread Florin Andrei

Suppose i have an intranet with several thousands hosts; well, maybe not
all of them are running web servers, but there still are several
hundreds httpd's out there...
Is mnogosearch the right tool to search and index such a network?

-- 
Florin Andrei

"Imagine working in a secure environment and finding the string
_NSAKEY in the OS binaries without a good explanation." - Alan Cox

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Danil Lavrentyuk

[ On Wed, 9 May 2001, Maxime Zakharov wrote: ]

MZ> > And what if a site having many texts uploaded by users?
MZ> > Have I manualy edit all they satting "lang" attributes? :)
MZ> > Have I demand it from uploader? They will not.
MZ>
MZ> Users may upload big mega gifs as .html files :)

It would be an obvious fraud...

MZ> Let talk about W3C recommendations.

... but ignoring of far-away-placed committee's recomendations could be a
simply laziness.
Not all of the software use all of the recomendations.
Not all of users know all of the recomedations. Even not all of users think on
using such recomendations.

Text could be converted to HTML from someone another text fromat.
Who, for example, will check for foreign phrases such text like big books
which consists of many volumes (like "Amber" by Zhilazny or "Wheel Of Time" by
Jordan or even bigger)? :)

Let's tall about real world where we would have to index multilanguage texts
without "lang" attributes.

MZ> > What if I have to index texts placed somewhere in the internet, not locally?
MZ> > What if a site contains texts of many books (something like www.lib.ry, for
MZ> > example)?
MZ>
MZ> Sometime, without explicit language definition it's impossible uniquely
MZ> select language for a word.
MZ> For example, word 'test' may be english or german.

I know.
Think it is real (but hard, I see) to make a system which could guess what the
text's language is. It could use 2 steps:
1) Create a list of encodings this text could be written in (symply by
testing, is all of the word's characters are aplhas in this encoding). Here we
could think that a two or more successive "foreign" words are from the same
language.
2) Check (using ispell tables) all the languages which use encondigs from list
(created above), looking for one where this words are correct.
3) (optoinal) If there more then one language suitable, select one that was
seelcted for the previous phrase.

OK. This method does not gurantee that selection will be correct always. But
in the most cases it will.

Yes, I know, this method is not too quick... But it is better then no any
method at all. Any way it is good to make it able to turn it of in the
indexer.conf file or by a command line option.


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




No Subject

2001-05-09 Thread Mark J. Hershenson


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Optimizing indexer

2001-05-09 Thread Mario Gray

Author: Mario Gray
Email: [EMAIL PROTECTED]
Message:
Hello,
   I am using indexer in a commercial environment.  I have about 2.5mega-urls but alot 
of them are blank descriptions with blank titles, that get shown in the results page. 
So before I make any major modifications to indexer, does someone have any suggestions 
for me in order to get less blank hits?

 Thanx


Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




just searching within a site (win32)

2001-05-09 Thread La Rocca Network

Hi !

1) is there any way to search for matches within just a site ?

besides that, pls. anyone to verify that the category filter in the indexer
tab does NOT work ?
and many times the indexer just hangs while selecting (not the first time)

thanks,
regards,

Nelson





___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Maxime Zakharov

Danil Lavrentyuk wrote:
> MZ> Testing. This is simple..
> MZ>
> MZ> will considered as english text, regardless document language.
> 
> And what if a site having many texts uploaded by users?
> Have I manualy edit all they satting "lang" attributes? :)
> Have I demand it from uploader? They will not.

Users may upload big mega gifs as .html files :) Let talk about W3C
recommendations.

> What if I have to index texts placed somewhere in the internet, not locally?
> What if a site contains texts of many books (something like www.lib.ry, for
> example)?

Sometime, without explicit language definition it's impossible uniquely
select language for a word.
For example, word 'test' may be english or german.

-- 
Maxime Zakharov   http://sochi.net.ru/~maxime/
 Sochi, Russia   http://www.sochi.com/
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: need to decode Intag field

2001-05-09 Thread souissi tijani

Author: souissi tijani
Email: [EMAIL PROTECTED]
Message:
hello 
can somebody help me understand the meaning of the intag field 
I need to know how is it calculated 
thanks a lot

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Danil Lavrentyuk

[ On Wed, 9 May 2001, Maxime Zakharov wrote: ]

MZ> > SK> DL> Will it simply think these wirds having an incorrect sepelling and (in 
case of
MZ> > SK> DL> IspellIncorrectFactor 1) use they 'as is' in indexing?
MZ> > SK>
MZ> > SK> It can correctly deterimine the multilanguage documents.
MZ> >
MZ> > How will it determine, what the language to use as "secondary".
MZ> > I.e. how can it guess, for example, have I english, french or latin phrase in
MZ> > russian text?
MZ>
MZ> by HTML attribute "lang".
MZ> For example,
MZ>
MZ> Testing. This is simple..
MZ>
MZ> will considered as english text, regardless document language.

And what if a site having many texts uploaded by users?
Have I manualy edit all they satting "lang" attributes? :)
Have I demand it from uploader? They will not.
What if I have to index texts placed somewhere in the internet, not locally?
What if a site contains texts of many books (something like www.lib.ry, for
example)?


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: resulting url when using frameset

2001-05-09 Thread Al

Author: Al
Email: [EMAIL PROTECTED]
Message:
Is there a simple way to give as answer to a search the frameset file
rather than the file appearing in the tags http://search.mnogo.ru/board/message.php?id=2168>

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Maxime Zakharov

Danil Lavrentyuk wrote:
> 
> SK> DL> Will it simply think these wirds having an incorrect sepelling and (in case 
>of
> SK> DL> IspellIncorrectFactor 1) use they 'as is' in indexing?
> SK>
> SK> It can correctly deterimine the multilanguage documents.
> 
> How will it determine, what the language to use as "secondary".
> I.e. how can it guess, for example, have I english, french or latin phrase in
> russian text?

by HTML attribute "lang".
For example,

Testing. This is simple..

will considered as english text, regardless document language.

-- 
Maxime Zakharov   http://sochi.net.ru/~maxime/
 Sochi, Russia   http://www.sochi.com/
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Danil Lavrentyuk

[ On Wed, 9 May 2001, Sergey Kartashoff wrote: ]

SK> DL> Could mnoGoSearch to correctly index, for example, english words in russian
SK> DL> text?
SK>
SK> Yes, it should.
SK>
SK> DL> Will it simply think these wirds having an incorrect sepelling and (in case of
SK> DL> IspellIncorrectFactor 1) use they 'as is' in indexing?
SK>
SK> It can correctly deterimine the multilanguage documents.

How will it determine, what the language to use as "secondary".
I.e. how can it guess, for example, have I english, french or latin phrase in
russian text?


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: Webboard: PHP+mnogosearch

2001-05-09 Thread Sebastian Nohn

On Wed, 9 May 2001, Martin wrote:

> Author: Martin
> Email: [EMAIL PROTECTED]
> Message:
> PHP doesn't seem to come with any facility for installing mnogosearch
> support. This must be covered by mnogosearch. Where is it documented?

Try PHP 4.0.5

Regards,
   Sebastian Nohn
-- 
NeT&Trade GmbH   email : [EMAIL PROTECTED]
TechnoPark der GMD   www   : http://www.nettrade.de/
Rathausallee 10  tel   : +49 22 41 - 20 40 33
53757 Sankt Augustin fax   : +49 22 41 - 20 40 34
Germany  mobile: +49 173 - 5 96 43 77




___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Long URLs and 3.2 branch

2001-05-09 Thread Markus

Author: Markus
Email: [EMAIL PROTECTED]
Message:
Hello All!

I just stumpled across a problem, is hopefully going to be solved. In
the mnogo 3.1 branch URLs which are longer than 128 bytes are
obviously not supported. I found a mail that says, this will be
tackled in the 3.2 branch.

Question:
Is there a timeline for the 3.2 branch? When will it be released?
or
Does anyone have a patch, which solves the problem for mnogosearch
with mysql?

Thanks for your answers,

Markus


Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-09 Thread Sergey Kartashoff

Hi!

Tuesday, May 08, 2001, 6:30:44 PM, you wrote:

DL> Could mnoGoSearch to correctly index, for example, english words in russian
DL> text?

Yes, it should.

DL> Will it simply think these wirds having an incorrect sepelling and (in case of
DL> IspellIncorrectFactor 1) use they 'as is' in indexing?

It can correctly deterimine the multilanguage documents.

-- 
Regards, Sergey aka gluke.


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]