Let me see if I understand something. Search normally finds resources based on their properties. When you use CONTAINS in your search, it checks against the actual contents of the resource for a match. Does that sound right?

DASL is the protocol level search while org.apache.slide.search is the Java API for doing searches. Can you do all the same types of searches with either one of these methods?


From: <[EMAIL PROTECTED]> Reply-To: "Slide Developers Mailing List" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Subject: RE: Full Text Search for MS Word and Excel files? Date: Tue, 24 Feb 2004 17:01:20 +0100

great if you could volonteer for Extractor!

Well, I'm hoping I'll get to do something like that...



From: <[EMAIL PROTECTED]> Reply-To: "Slide Developers Mailing List" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Subject: RE: Full Text Search for MS Word and Excel files? Date: Tue, 24 Feb 2004 17:01:20 +0100

Two possible ways: Write a Lucene index for those properties, then
you may query them as if they where content, or get the properties
from the extractor into the NodeProperties. If this is done, they
can be queried by DASL as if they where PROPPATCHed into the
WebDAV resource.


So, with the Lucene Index I get something like?


webapp --> org.apache.slide.search --> ContentStore --> WordDocIndexer

Now, if I pull the text out of a word doc and create a Lucene index with it. Does that mean my index is a new piece of content? Or is my index just a property of the original word doc? Or is an Index just something separate that is only related to searches that use CONTAINS?

regards,

Ryan Rhodes

_________________________________________________________________
Take off on a romantic weekend or a family adventure to these great U.S. locations. http://special.msn.com/local/hotdestinations.armx



--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to