1. SolrCell (ExtractingRequestHandler) - extract and index content from rich
documents, such as PDF, Office docs, HTML (uses Tika)
2. Clustering - for result clustering.
3. Language identification (two update processors) - analyzes text of fields
to determine language code.
None of those is mandatory, which is why they have separate libs.
-- Jack Krupansky
-----Original Message-----
From: Raheel Hasan
Sent: Wednesday, June 05, 2013 5:57 AM
To: solr-user@lucene.apache.org
Subject: Files included from the default SolrConfig
Hi,
I am trying to optimize solr.
The default solrConfig that comes with solr>collection1 has a lot of libs
included I dont really need. Perhaps if someone could help we identifying
the purpose. (I only import from DIH):
Please tell me whats in these:
contrib/extraction/lib
solr-cell-
contrib/clustering/lib
solr-clustering-
contrib/langid/lib/
solr-langid
--
Regards,
Raheel Hasan