[
https://issues.apache.org/jira/browse/CONNECTORS-1066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14163794#comment-14163794
]
Karl Wright commented on CONNECTORS-1066:
-----------------------------------------
bq. It is wrong to put blank to RepositoryDocument in repository connector.
I am not sure I understand this. Perhaps you can clarify?
The way I am thinking is this: In a job, we do not know either full details of
the repository connector or output connector. We do not know if the repository
can include blank values or not, and we do not know if the output index treats
blank values the same as null values. Therefore, the *framework* must not make
any judgements here. But still, it is reasonable to have an *option* to remove
blank metadata values from a document, because as Alessandro points out, most
of the time these are just confusing to people. I believe that this should not
happen silently, but since it is common, it makes sense for it to happen
*easily*, without a lot of configuration work.
My suggestion to add this functionality to the Metadata Adjuster transformer
would achieve this goal, especially if the default configuration of the
Metadata Adjuster transformer would be to remove blanks. There would have to
be a checkbox that you could uncheck to leave blanks in the metadata.
> Don't send empty fields to Solr
> -------------------------------
>
> Key: CONNECTORS-1066
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1066
> Project: ManifoldCF
> Issue Type: Improvement
> Components: Lucene/SOLR connector
> Affects Versions: Manifold 1.7.1
> Reporter: Karl Wright
> Assignee: Karl Wright
> Labels: solr
> Fix For: ManifoldCF 2.0
>
> Attachments: CONNECTORS-1066.patch
>
>
> Don't transmit empty fields to Solr.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)