Yes a profilenet is what Mark describes. In our Verity profilenet we got ~50.000 profiles (queries) the performance is fine around 20-25 documents / second.
>From what we can tell the matches are accurate unfortunately I don't have any >ideas on how verity does this under the hood so I don't know if there is any >approximation involved. We do however get information about the each query >that hits such as score and words (with their position within the document). We need this kind of functionality since we are monitoring the incoming documents for our customers. --- Answer to Marks first mail--- MoreLikeThis: I get a feeling that it would be very hard to do this using this kind of query, how do I index the queries with operators such as NOT, NEAR and WildCard? Taxonomy/Classification I'm totally lost here ;) Any one that knows what to look for in this case? MemoryIndex: We have ran benchmarks using this technique it wasn't enough, if I recall we could run like 10.000 profiles with good performance. And as you say it doesn't scale well at all. / Regards Marcus -----Ursprungligt meddelande----- Från: Mark Miller [mailto:[EMAIL PROTECTED] Skickat: den 17 januari 2008 13:58 Till: java-user@lucene.apache.org Ämne: Re: Inverted search / Search on profilenet Verity, autonomy, whatever, has a what they call a reverse query system called profilenet. A profile is just a query (or I guess more than one query?) and you can setup a bunch of them. Then you supply the document and you will get the matching queries as well as a score. They say its the opposite of doing a search with a query and getting back docs. Instead you do a search with a doc and get back these queries. They claim it can be used for things like taxonomy/classification among other things. I don't know how true this is to a real reverse query system as that would seem to be kind of slow -- my guess is its a bit of an approximation. - Mark Endre Stølsvik wrote: > > May I ask: What IS a profilenet? I ask since this obviously is > something that you two hit off on right away, while I haven't heard of..! > > > Thanks, > Endre. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]