Hello people,

I was just wondering how to avoid that the content-type string is split in to multiple values. For example: If a document has the content-type: "Application/pdf" it is broken into three pieces "Application/pdf", "Application", "pdf" in the solr filed type.

I am not sure if this is done by nutch, or if it is an index topic in solr.

Sure someone knows the answer to that.

Thank you.

Reply via email to