On 6 May 00, at 14:20, abhishek shodhan wrote:

I think there are several applications out there that do this for you
already. Do a web search or look on Gamelan or similar Java
repositories. That's easier than writing your own.

> Hi all...
>
>      I need to develop a search servlet that will search for content both in
> my site and outside. How do I search the Meta Tags of pages?
You'll need to obtain a java class that can parse HTML. Then you'll
simply need to look for when you crawl across a meta-tag.

Unless your organization has a strict policy for how it defines meta-
tag information, meta-tags are likely to be useless for you. Just
simply having them there doesn't gurantee success. And I'd
imagine that 90% or more of your site doesn't even have any real
meta-tags (e.g. meta-tags populated with information besides the
content-generator information that standard Web development tools
like FrontPage put in an HTML page for you).

Building a search engine is non-trivial. Sure any Java programmer
worth 1/4 of their salary can figure out how to retrieve a Web page
and then parse the HTML and through that stuff into a database for
searching. But devloping true search engine requires much more
skill than that. That's why I would advise you to look into an
existing package, preferablly one that has been worked by
information professionals (e.g. people with library science training).
> And, for the rest of the web, do I need to tie up with Major Search
> Engines???
That would be much easier (and cheaper) than trying to build your
own Web crawler that crawls the entire Internet.

Mark
>
> Please advise me.
>
> Thanks!!
>
> Abhishek Shodhan.
>
>
>
> ____________________________________________________________________
> Get free email and a permanent address at http://www.amexmail.com/?A=1
>
> ===========================================================================
> To unsubscribe: mailto [EMAIL PROTECTED] with body: "signoff JSP-INTEREST".
> Some relevant FAQs on JSP/Servlets can be found at:
>
>  http://java.sun.com/products/jsp/faq.html
>  http://www.esperanto.org.nz/jsp/jspfaq.html
>  http://www.jguru.com/jguru/faq/faqpage.jsp?name=JSP
>  http://www.jguru.com/jguru/faq/faqpage.jsp?name=Servlets
>
>

===========================================================================
To unsubscribe: mailto [EMAIL PROTECTED] with body: "signoff JSP-INTEREST".
Some relevant FAQs on JSP/Servlets can be found at:

 http://java.sun.com/products/jsp/faq.html
 http://www.esperanto.org.nz/jsp/jspfaq.html
 http://www.jguru.com/jguru/faq/faqpage.jsp?name=JSP
 http://www.jguru.com/jguru/faq/faqpage.jsp?name=Servlets

Reply via email to