Re: Plans for SolrDocument

Chris Hostetter Thu, 30 Aug 2007 14:39:31 -0700

imho, SolrDocument's role is best kept as a kind of value object used only
for communications with remote clients.  I don't think it should appear much
in the underlying codebase, where org.apache.lucene.Document is (i) already
used heavily, and (ii) does everything you need it to, including storing
arbitrary objects and permitting multivalued fields, albeit by different

I don't tend to follow discussions of updating internals that closely(that's more the forte of Mike, Yonik and Ryan) but one thing you left offyour list is schema awareness and FieldType encoding. The lucene DocumentAPI has a lot of flaws in it, and exposes a lot of options that hasno meaning in Solr because the schema controls those options perfield/type ... my understanding about hte intended use of SolrDocument inthe internals is to provide a clean solid API for people writing UpdateHandlers to use when specifing field values (without being distracted ythe other missleading methods in the lucene Document class) and to providea clear abstraction between real values and field type encoded values.

ie: Update Handlers construct SolrDocuments which always contain realvalues, these UpdateHandlers can be largely ignorant about schemainformation. There is one mechanism for converting a SolrDocument to alucene Document, this method knows about the schema, the field types, andthe field options. all low level code then uses the lucene Document.



-Hoss

Re: Plans for SolrDocument

Reply via email to