Magnus Ebbesson created CONNECTORS-475:
------------------------------------------

             Summary: A Hydra Output Connector
                 Key: CONNECTORS-475
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-475
             Project: ManifoldCF
          Issue Type: New Feature
            Reporter: Magnus Ebbesson
            Priority: Minor


Hydra Processing Framework was recently released into the wild. 

Hydra offers to solve the the missing piece into creating great consolidated 
search solutions. 

What is Hydra?
When working with free text search using for example Apache Solr the quality of 
the data in the index is a key factor of the quality of the results delivered. 
Hydra is designed to give the search solution the tools necessary to modify the 
data that is to be indexed in an efficient and flexible way. This is done by 
providing a scalable and efficient pipeline which the documents will have to 
pass through before being indexed into the search engine.

Architecturally Hydra sits in between the search engine and the source 
integration. A common use-case would be to use Apache Manifold CF to crawl a 
folder on a filesystem and send the documents to hydra which in turn will 
process and dispatch processed documents to Solr for indexing.

More information and code to the framework on
https://github.com/Findwise/Hydra


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to