Webboard: Perl Frontend (3.1)

2001-05-10 Thread Steve

Author: Steve
Email: [EMAIL PROTECTED]
Message:
I've just somewhat resolved my issue # 2 involving the output format, by simply 
removing the other two formats (short and URL) from the template... However, I'm still 
interested in a better way to control this (compatible with the original intention -- 
which was to allow for the dynamic control of the output format, possibly with a 
drop-down on the search form...)

I'm sure if I spend some time in the source I'll be able to get my head wrapped around 
this. However, I'm new to the sources and any pointing in the right direction would be 
greatly appreciated!

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: mnogosearch on intranet

2001-05-10 Thread Florin Andrei

On 10 May 2001 14:39:19 +0500, Alexander Barkov wrote:
> Florin Andrei wrote:
> > 
> > Suppose i have an intranet with several thousands hosts; well, maybe not
> > all of them are running web servers, but there still are several
> > hundreds httpd's out there...
> > Is mnogosearch the right tool to search and index such a network?
> > 
> 
> The number of hosts itself is not a problem. What affects search speed
> is
> aggregate amount of documents.

Ok.

What is the maximum total size of indexed documents that is manageable
by mnogosearch without marked performance degradation?

-- 
Florin Andrei

"Imagine working in a secure environment and finding the string
_NSAKEY in the OS binaries without a good explanation." - Alan Cox

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Perl Frontend (3.1)

2001-05-10 Thread Steve

Author: Steve
Email: [EMAIL PROTECTED]
Message:
While testing the Perl frontend I've run into two issues:

[1] The results appear to be returned basically with the order reversed compared to 
the C CGI (search.cgi). In addition, the top (#1) result returned by the C CGI is 
returned as the 17th result by the Perl frontend in my test?? (I performed the same 
query with both the standard C CGI and the Perl frontend, which returned 27 results: # 
1 on C interface was # 17 on Perl, # 2 on C was # 27 on perl, # 3 on C was # 26 on 
perl, and so on...) Does anyone have any comments regarding this behaviour??

[2] So far I can only get the results returned in the URL only output format with the 
Perl frontend. The C frontend returns in the long format and both frontend are using 
the same search.htm template file (perhaps I'm doing something wrong or have missed a 
config option)?

Is the PHP frontend preferred over the Perl frontend for some reason? It would be 
difficult for me to recompile the PHP interpreter to include mnoGo support, which is 
why I find the Perl frontend attractive.

Thanks!

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: indexer hangups

2001-05-10 Thread Mario Gray

Author: Mario Gray
Email: [EMAIL PROTECTED]
Message:
Hello, I am trying to deply an mnogo search on our corporate network.  But I am having 
problems.
The indexer seems to delete urls even though I set  "deletebad no"
I also use a list of urls to index (from a text file), but indexer
only seems to index SOME of them, and jumps around, or
indexes ALL of them, but deletes a lot of them in the sql table 
and inserts the word data in such a way that search.cgi comes up
with a lot of "0" rec_id which outputs as "no title" on the search page.

Is there a configuration Option I left out?
 

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Disease Information

2001-05-10 Thread Anonymous

Author: Anonymous
Email: Anonymous
Message:
How to cure Skitsofrenia, first step is always focus, with the right amount of open 
shakras and focus you would be able to bend the voices in your head (they're often 
caused by telepathic people with a hardon for suffer, the next step is to regress the 
damage they've done to your brain, without the actual damage no one would believe its 
a desiese hence its sometimes there
theres many ways of getting a man made desiese these days, the best known to me is 
pissing off religion, of course you'll never get any one to admit this its really 
quite tight

the final way of getting it is an odd one, take drugs.

the second way that i've found still exists by pissing people off, but this time the 
beefs with the Eric Martin Mental Health Facility in Victoria B.C, not only did i get 
my brain warped buy some over developed nurse but i got my life force drained by a 
satan worshiper, who more than likly also had a tie to my getting reliesed.
Weird shit happens a little too much in my town.

I hope this has helped you a little, and remember sometimes its real and sometimes its 
not.

PS: In the starange case that you really do have this disease seek hollistic medicine

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: just searching within a site (win32)

2001-05-10 Thread Ramil Kalimullin

Hi!

We've fixed this bug.

New version will be available tomorrow.

Regards,
Ramil.

- Original Message -
From: "La Rocca Network" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>; "Ramil Kalimullin" <[EMAIL PROTECTED]>
Sent: Thursday, May 10, 2001 7:01 PM
Subject: Re: just searching within a site (win32)


> Ramil:
>
> Usually we use both.
> We are indexing a large number of servers. Different paths from same
server
> may fit in different categories.
>
> Nelson
>
> - Original Message -
> From: "Ramil Kalimullin" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>; "La Rocca Network" <[EMAIL PROTECTED]>
> Sent: Thursday, May 10, 2001 6:26 AM
> Subject: Re: just searching within a site (win32)
>
>
> > Hi!
> >
> > Do you use different servers with different categories
> > (for example
> > Server http://server1/Category 01
> > Server http://server2/Category 02
> > )
> > or different paths at the same server with different categories
> > (for example
> > Server http://server/path1/Category 01
> > Server http://server/path2/Category 02
> > )?
> >
> > Ramil.
> >
> > - Original Message -
> > From: "La Rocca Network" <[EMAIL PROTECTED]>
> > To: <[EMAIL PROTECTED]>; "souissi tijani"
> <[EMAIL PROTECTED]>
> > Sent: Wednesday, May 09, 2001 10:20 PM
> > Subject: just searching within a site (win32)
> >
> >
> > > Hi !
> > >
> > > 1) is there any way to search for matches within just a site ?
> > >
> > > besides that, pls. anyone to verify that the category filter in the
> > indexer
> > > tab does NOT work ?
> > > and many times the indexer just hangs while selecting (not the first
> time)
> > >
> > > thanks,
> > > regards,
> > >
> > > Nelson
> > >
> > >
> > >
> > >
> > >
> > > ___
> > > If you want to unsubscribe send "unsubscribe general"
> > > to [EMAIL PROTECTED]
> > >
> > >
> >
> > ___
> > If you want to unsubscribe send "unsubscribe general"
> > to [EMAIL PROTECTED]
> >
>
>
>

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: just searching within a site (win32)

2001-05-10 Thread La Rocca Network

Ramil:

Usually we use both.
We are indexing a large number of servers. Different paths from same server
may fit in different categories.

Nelson

- Original Message -
From: "Ramil Kalimullin" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>; "La Rocca Network" <[EMAIL PROTECTED]>
Sent: Thursday, May 10, 2001 6:26 AM
Subject: Re: just searching within a site (win32)


> Hi!
>
> Do you use different servers with different categories
> (for example
> Server http://server1/Category 01
> Server http://server2/Category 02
> )
> or different paths at the same server with different categories
> (for example
> Server http://server/path1/Category 01
> Server http://server/path2/Category 02
> )?
>
> Ramil.
>
> - Original Message -
> From: "La Rocca Network" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>; "souissi tijani"
<[EMAIL PROTECTED]>
> Sent: Wednesday, May 09, 2001 10:20 PM
> Subject: just searching within a site (win32)
>
>
> > Hi !
> >
> > 1) is there any way to search for matches within just a site ?
> >
> > besides that, pls. anyone to verify that the category filter in the
> indexer
> > tab does NOT work ?
> > and many times the indexer just hangs while selecting (not the first
time)
> >
> > thanks,
> > regards,
> >
> > Nelson
> >
> >
> >
> >
> >
> > ___
> > If you want to unsubscribe send "unsubscribe general"
> > to [EMAIL PROTECTED]
> >
> >
>
> ___
> If you want to unsubscribe send "unsubscribe general"
> to [EMAIL PROTECTED]
>


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: mnogosearch on intranet

2001-05-10 Thread Alexander Barkov

Florin Andrei wrote:
> 
> Suppose i have an intranet with several thousands hosts; well, maybe not
> all of them are running web servers, but there still are several
> hundreds httpd's out there...
> Is mnogosearch the right tool to search and index such a network?
> 

The number of hosts itself is not a problem. What affects search speed
is
aggregate amount of documents.
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-10 Thread Alexander Barkov


3.2.x branch will have language guesser. It's already implemented
and work very fine for single-language pages or even "mostly
single-language"
pages. I hope first release of 3.2.x will be available in May.



Danil Lavrentyuk wrote:
> 
> [ On Wed, 9 May 2001, Maxime Zakharov wrote: ]
> 
> MZ> > And what if a site having many texts uploaded by users?
> MZ> > Have I manualy edit all they satting "lang" attributes? :)
> MZ> > Have I demand it from uploader? They will not.
> MZ>
> MZ> Users may upload big mega gifs as .html files :)
> 
> It would be an obvious fraud...
> 
> MZ> Let talk about W3C recommendations.
> 
> ... but ignoring of far-away-placed committee's recomendations could be a
> simply laziness.
> Not all of the software use all of the recomendations.
> Not all of users know all of the recomedations. Even not all of users think on
> using such recomendations.
> 
> Text could be converted to HTML from someone another text fromat.
> Who, for example, will check for foreign phrases such text like big books
> which consists of many volumes (like "Amber" by Zhilazny or "Wheel Of Time" by
> Jordan or even bigger)? :)
> 
> Let's tall about real world where we would have to index multilanguage texts
> without "lang" attributes.
> 
> MZ> > What if I have to index texts placed somewhere in the internet, not locally?
> MZ> > What if a site contains texts of many books (something like www.lib.ry, for
> MZ> > example)?
> MZ>
> MZ> Sometime, without explicit language definition it's impossible uniquely
> MZ> select language for a word.
> MZ> For example, word 'test' may be english or german.
> 
> I know.
> Think it is real (but hard, I see) to make a system which could guess what the
> text's language is. It could use 2 steps:
> 1) Create a list of encodings this text could be written in (symply by
> testing, is all of the word's characters are aplhas in this encoding). Here we
> could think that a two or more successive "foreign" words are from the same
> language.
> 2) Check (using ispell tables) all the languages which use encondigs from list
> (created above), looking for one where this words are correct.
> 3) (optoinal) If there more then one language suitable, select one that was
> seelcted for the previous phrase.
> 
> OK. This method does not gurantee that selection will be correct always. But
> in the most cases it will.
> 
> Yes, I know, this method is not too quick... But it is better then no any
> method at all. Any way it is good to make it able to turn it of in the
> indexer.conf file or by a command line option.
> 
> 
> Danil Lavrentyuk
> Communiware.net
> Programmer
> 
> ___
> If you want to unsubscribe send "unsubscribe general"
> to [EMAIL PROTECTED]
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: need to decode Intag field

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
When phrase yes:

It is combined using word position and it's weight:

  pos*0x1+weight

When phrase no word appearance count  is used instead of it's pos:

  count*0x1+weight

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: just searching within a site (win32)

2001-05-10 Thread Ramil Kalimullin

Hi!

Do you use different servers with different categories
(for example
Server http://server1/Category 01
Server http://server2/Category 02
)
or different paths at the same server with different categories
(for example
Server http://server/path1/Category 01
Server http://server/path2/Category 02
)?

Ramil.

- Original Message -
From: "La Rocca Network" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>; "souissi tijani" <[EMAIL PROTECTED]>
Sent: Wednesday, May 09, 2001 10:20 PM
Subject: just searching within a site (win32)


> Hi !
>
> 1) is there any way to search for matches within just a site ?
>
> besides that, pls. anyone to verify that the category filter in the
indexer
> tab does NOT work ?
> and many times the indexer just hangs while selecting (not the first time)
>
> thanks,
> regards,
>
> Nelson
>
>
>
>
>
> ___
> If you want to unsubscribe send "unsubscribe general"
> to [EMAIL PROTECTED]
>
>

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: resulting url when using frameset

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
> Is there a simple way to give as answer to a search the frameset file
> rather than the file appearing in the tags  

Unfortunatelly it's not implemented.


Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Long URLs and 3.2 branch

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Hello!

To fix this in 3.1.x:

1. Change SQL url table structure, make url field longer.
2. Change UDM_URLSIZE definition in udm_common.h
3. Recompile



> Hello All!
> 
> I just stumpled across a problem, is hopefully going to be solved. In
> the mnogo 3.1 branch URLs which are longer than 128 bytes are
> obviously not supported. I found a mail that says, this will be
> tackled in the 3.2 branch.
> 
> Question:
> Is there a timeline for the 3.2 branch? When will it be released?
> or
> Does anyone have a patch, which solves the problem for mnogosearch
> with mysql?
> 
> Thanks for your answers,
> 
> Markus
> 

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-10 Thread Danil Lavrentyuk

[ On Thu, 10 May 2001, Alexander Barkov wrote: ]

AB> > Could mnoGoSearch to correctly index, for example, english words in russian
AB> > text?
AB>
AB> It can.
AB>
AB> > Will it simply think these wirds having an incorrect sepelling and (in case of
AB> > IspellIncorrectFactor 1) use they 'as is' in indexing?
AB>
AB> Yes. But you may add English ispell files too.

Hmmm..

If I'll add English ispell files - will it take words of 'latin' laters as
english words (when such words are occur in a russian text)?


Danil Lavrentyuk
Communiware.net
Programmer

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: Errors During Make

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
> RH 5.2 i686 linux
> Getting these errors during make
> AM_PROG_LIBTOOL not found in lib
> AM_DISABLE_SHARED not found in lib
> What do you think?

Which version of msearch are you using? 
Is it taken from CVS? If so, probably you have
to upgrade automake and autoconf.



Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: htdb and his first entry

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
Try  this indexer.conf command
  URLWeight 0


> hello,
> my question: how do i get ride of the first entry in the url table with all the 
>other urls produced by htdblist inside? why? because this entry come out als first 
>when somebody search for a word included in the url!
> thanks,
> manu :-)

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Webboard: indexing multiple sites

2001-05-10 Thread Alexander Barkov

Author: Alexander Barkov
Email: [EMAIL PROTECTED]
Message:
You may use URL limits. Take a look into default search.htm.
SELECT NAME=ul  is responsible for them.


> I need to know like I can index several sites, and that the finder allows to look 
>for me in: 
> - All the sites 
> - Each site in particular form 
> - Some section of some site .
> Example:
> I've the folowing URLs to index:
> - http://www.tercera.cl/
> - http://www.tercera.cl/sitios/
> - http://www.tercera.cl/casos/
> - http://www.lacuarta.cl/
> - http://www.lacuarta.cl/temas/
> - http://www.lacuarta.cl/sitios/
> - http://www.deportivo.cl/
> - http://www.mouse.cl/
> and some others...
> Someone can send me the indexer.conf and some search.htm?
> thanks a lot.
> 
> 
> 
> 
> 

Reply: 

___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: multilanguage text

2001-05-10 Thread Alexander Barkov

Danil Lavrentyuk wrote:
> 
> Hello!
> 
> Could mnoGoSearch to correctly index, for example, english words in russian
> text?

It can.

> Will it simply think these wirds having an incorrect sepelling and (in case of
> IspellIncorrectFactor 1) use they 'as is' in indexing?

Yes. But you may add English ispell files too.
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: Few random things

2001-05-10 Thread Alexander Barkov

"Briggs, Gary" wrote:
> 
> Has anyone here got a way of indexing powerpoint or visio documents?
> 
> Changing the document is not viable; I need a way to get the strings out of
> it.
> 
> "strings" is not too bad on powerpoint, but for visio it's not worth the
> effort.


You may use so called external parser - any program which can convert
visio documents into text or html. Check doc/parsers.txt


> Also, Is there any way to convert documents with this in them:
>  
> ?
> I'd ideally like to convert them to something more standard... Can I do
> this?


What format do you want to get after convertion?


> As in, I can't change anything. At all. I need a way to do all these things
> in the search engine.
___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]




Re: Bug report

2001-05-10 Thread Ramil Kalimullin

Hello!

Please try attached SQL script.

Regards,
Ramil.

- Original Message -
From: "Molara Federico" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, May 10, 2001 1:02 PM
Subject: Bug report


>
>
> UdmSearch version: mnoGoSearch for Windows 3.1.12.11 (trial)
> Platform:  P II - 200 Mhz - 128 Mb RAM
> OS:Win98
> Database:  MsSql 6.5 - SP 5a
> Statistics:
>
>
> When I start indexing, the program stop immediately, whit the following
ODBC error:
>
> [SQL Server Driver][SQL Server]The column content_type in table url may
not be null[SQL Server Driver][SQL Server]The column title in table url may
not be null[]
>
> The tables (created with the ms_sql.sql script) on the MsSql DB are empty,
> and I\'m using the MS Data Access 2.6 (SqlServer driver ver.
2000.80.194.00)
>
>
> ___
> If you want to unsubscribe send "unsubscribe general"
> to [EMAIL PROTECTED]
>
>

 ms_sql.sql


Bug report

2001-05-10 Thread Molara Federico



UdmSearch version: mnoGoSearch for Windows 3.1.12.11 (trial)
Platform:  P II - 200 Mhz - 128 Mb RAM
OS:Win98
Database:  MsSql 6.5 - SP 5a
Statistics:


When I start indexing, the program stop immediately, whit the following ODBC error:

[SQL Server Driver][SQL Server]The column content_type in table url may not be 
null[SQL Server Driver][SQL Server]The column title in table url may not be null[]

The tables (created with the ms_sql.sql script) on the MsSql DB are empty,
and I\'m using the MS Data Access 2.6 (SqlServer driver ver. 2000.80.194.00)


___
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]