1. SolrCell (ExtractingRequestHandler) - extract and index content from rich documents, such as PDF, Office docs, HTML (uses Tika)
2. Clustering - for result clustering.
3. Language identification (two update processors) - analyzes text of fields to determine language code.

None of those is mandatory, which is why they have separate libs.

-- Jack Krupansky

-----Original Message----- From: Raheel Hasan
Sent: Wednesday, June 05, 2013 5:57 AM
To: solr-user@lucene.apache.org
Subject: Files included from the default SolrConfig

Hi,

I am trying to optimize solr.

The default solrConfig that comes with solr>collection1 has a lot of libs
included I dont really need. Perhaps if someone could help we identifying
the purpose. (I only import from DIH):

Please tell me whats in these:
contrib/extraction/lib
solr-cell-

contrib/clustering/lib
solr-clustering-

contrib/langid/lib/
solr-langid


--
Regards,
Raheel Hasan

Reply via email to