Re: Forrest - Lucene Help needed

2005-11-08 Thread David Crossley
Karthik Navaneethan wrote:
>  Is there any way raw HTML included as static content in
> src/documentation/content
> directory be indexed and searched by Lucene. The default site created using
> Forrest seed apears to only index and search the xml files.

You would need to add a project sitemap that did similar
operations to Forrest' default search which is handled
by main/webapp/search.xmap sitemap. As a quick fix, you
could add extra matches into main/webapp/search.xmap to
also handle your specific raw html docs.

That is quite a complex sitemap task, so you would need
to learn about sitemaps before tackling that.

Your sitemap might need to transform your html using Jtidy.
There are examples of that in the Forrest sitemaps too.
See the FAQ at lucene.apache.org

If any of that is helpful, then we should add some
tips to:
http://forrest.apache.org/docs/searching.html

-David


Re: Forrest-Lucene Help needed

2005-11-09 Thread David Crossley
Please do not send multiple requests without waiting for
answers. The world is round and people are asleep, others
are busy. Someone will try to help you sooner or later
so don't panic.

More below ...

Karthik Manimaran wrote:
> Hi,
>  I am trying to make the raw HTML included as static content in
> src/documentation/content directory be indexed and searched by Lucene.
>  I have a directory src/documentation/content/javadocs which contains
> javadocs for some components as html/other files.
>  Pointers to changes that need to be made in search.xmap needed. I guess
> tweaking the following map should bring about desired output but I am unable
> to succeed.

Don't guess. If you want to do such advanced things
with Forrest, then you will need to take some time
to learn the Cocoon sitemap.
http://forrest.apache.org/docs/project-sitemap.html
has some good references directly to the Cocoon docs.

I noticed in your subsequent posting to our "dev" list
that you had multiple generators and serializers.
Cocoon does not work like that. You will get seriously
frustrated by just guessing. Try some simple enhancements
first.

What you have below is closer to working than what you
sent to the dev list.

>   
> 
> 

Your files would have .html extension i presume.
Also you placed the files in content/javadocs/
So perhaps (i didn't try it) ...

However make sure that your browser request is
localhost:/javadocs/blah.html

> 

Syntax error with that extra space. Anyway it will be exactly
the path in the test above.

By the way, be careful with making changes to the core
of Forrest. Better to use a project sitemap.

-David

> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
>  Thanks and regards,
> Karthik.