RE: search engine

2009-11-16 Thread Neil Aggarwal
Jill:

 Is there any search engine you would recommend that could 
 search public, and non public( page needs login) pages?

If your pages are HTML, you can use something like 
HtDig:
http://www.htdig.org/

If your pages are part of a web app, I have done
this in the past:
1. Write some code to pull the text content from
each page and store them in a MySQL table
with a full text index.
2. When your users perform a search, you run
a full text search query and return
the result.

I hope this helps,
  Neil

--
Neil Aggarwal, (281)846-8957, http://UnmeteredVPS.net
Host your tomcat app on a CentOS VPS for only $25/month!
Unmetered bandwidth, 7 day no risk trial, Google Checkout


-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



RE: search engine

2009-11-16 Thread Jill Han
There are .html, .php, .jsp, .pdf pages on the apache server.

Thanks,

Jill
-Original Message-
From: Neil Aggarwal [mailto:n...@jammconsulting.com] 
Sent: Monday, November 16, 2009 9:15 AM
To: 'Tomcat Users List'
Subject: RE: search engine
X-HOSTLOC: alverno.edu/10.0.60.10

Jill:

 Is there any search engine you would recommend that could 
 search public, and non public( page needs login) pages?

If your pages are HTML, you can use something like 
HtDig:
http://www.htdig.org/

If your pages are part of a web app, I have done
this in the past:
1. Write some code to pull the text content from
each page and store them in a MySQL table
with a full text index.
2. When your users perform a search, you run
a full text search query and return
the result.

I hope this helps,
  Neil

--
Neil Aggarwal, (281)846-8957, http://UnmeteredVPS.net
Host your tomcat app on a CentOS VPS for only $25/month!
Unmetered bandwidth, 7 day no risk trial, Google Checkout


-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org


-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



Re: search engine

2009-11-16 Thread Konstantin Kolinko
2009/11/16 Jill Han jill@alverno.edu:
 Sorry, for the non-tomcat issue, but I still hope I can get helps here.
 Is there any search engine you would recommend that could search public, and 
 non public( page needs login) pages?

 Thanks as always,

 Jill


Maybe you should look at
http://lucene.apache.org/

I have not used it yet, but at least they have more knowledge.


Best regards,
Konstantin Kolinko

-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



Re: search engine

2009-11-16 Thread Pid

On 16/11/2009 14:34, Jill Han wrote:

Sorry, for the non-tomcat issue, but I still hope I can get helps here.
Is there any search engine you would recommend that could search public, and 
non public( page needs login) pages?

Thanks as always,

Jill



If you have a question we recommend you start by starting a new email to 
the list, rather than by replying to an existing email, which is called 
'thread hijacking'.



p




-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org



Re: search engine

2009-11-16 Thread André Warnier

Jill Han wrote:

Sorry, for the non-tomcat issue, but I still hope I can get helps here.


You are right, this is totally off-topic for this list.
But even so,


Is there any search engine you would recommend that could search public,


You mean, like Google, Yahoo etc.. ?

 and non public( page needs login) pages?


How would it do that ? ask you each time it encounters a page with a 
login ? How would it even determine that this page asks for a login ?
(Well ok, if it requires a Basic authentication, then maybe it could, 
but still).


-
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org