Re: Question on jcr:deref usage

Marcel Reutegger Fri, 24 Nov 2006 08:59:11 -0800

Lei Zhou wrote:

Thanks Marcel!
So it seems that due to the limitation of JCR (no aggregation querysupport), it would be much slower to support this type of application thanRDBMS.Is that a correct assessment?

An RDBMS certainly provides a wider range of operations through SQL than JCRwith the current set of XPath or SQL syntax. depending on your needs some of thequeries won't be possible in JCR but others will just be obsolete. E.g. in JCRyou don't have to execute a query to follow a reference you simply call themethod Property.getNode().

Also, to articulate, if I have to present to users with a query resultview that is categorized (or grouped) by ProductName, I'd have to do thefollowing:
1. Run query #1
//element(*, Document)[EMAIL PROTECTED] = 'Manual' andjcr:contains(@description,'maintenance')]
2. iterate through the entire RowIterator (may have thousands ofentries), use Java codeto create an aggregated ProductNames/ProductReference pairs collection
    (since JCR doesn't have this type of query),
3. No "Order By" clause is used because the ProductReferences won't be insame order as
    the ProductNames, manual sorting is required in Java post-processing


The same can be achieved in one step:

//element(*, Document)[EMAIL PROTECTED] = 'Manual' and jcr:contains(@description,'maintenance')]/jcr:deref(@ProductReference, *) order by @ProductName


this will return an ordered list of product names which contain matches.

4. Depending on which category has been selected by user to expand, runquery #2, limitingresults to that single product category:
    (query #2)
//element(*, Document)[EMAIL PROTECTED] = 'Manual' andjcr:contains(@description,'maintenance') and @ProductReference = '<uuid-of-Product-#1>']


Correct.

5. Again, product names has to be de-referenced manually, and ordering hasto be moved from
    the query to the java post-processing

This step I don't understand. What's the purpose of this step and why is itneeded? Isn't all information already available?

I'm fairly new to JCR and Jackrabbit. I've found them very helpful in manyaspects of managing contents. But I do feel that certains improvementscould make Jackrabbit a better choice for enterprise use.#1. In the many years of enterprise application development, I've seen alot of our content based applications in need of support for complicatedsearch, e.g, search by arbitrary combination of document properties, andgrouping of search results (it is not uncommon to see 2, even 3 levels ofnested grouping).-- Aggregations and Joins are definitely a big plus for querying acomplicated content model.

Such requirements are also discussed in the expert group of JSR 283. You cancomment on the current spec and post enhancement wishes to [EMAIL PROTECTED]

I've seen posts mentioning use of Node references to compensate the lackof SQL Join, but what if I need to perform a search like below(ProductNames, Regions and AvailableFors would most likely be categoriesthat are referenced by all documents):FIND all manualsTHAT (ProductName is 'TV' or 'VCR' or 'DVD')and (Region is 'North America' or 'Europe')and (AvailableFor is 'distributor' or 'repairHouse')
     GROUP BY Region, ProductName

such a query is certainly not possible with the current set of XPath or SQL inJCR. You would have to break up the query into multiple queries. e.g. retrieveuuids for produces with names 'TV', 'VCR' and 'DVD' and use those uuids in aquery. The same applies to Region and AvailableFor.


IMO XQuery would be a nice fit for those requirements.

#2. The RDBMS based repository, current DB schema is not very convincingfor large enterprise level applications. A more normalized schema mighthelp both performance and #1, but yes, more DB level code may be needed(for performance's sake) and that may limit the portability of theproduct.

I'm not sure that's really the case. Usually a normalized schema means lessperformance. There were attempts to create a persistence manager using anormalized schema, but in the end the currently used schema turned out to be themost practical one.


regards
 marcel

Re: Question on jcr:deref usage

Reply via email to