[ https://issues.apache.org/jira/browse/CONNECTORS-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tugba Dogan updated CONNECTORS-1162: ------------------------------------ Attachment: Documentation.zip Hi Karl, I attached screen shots and required document. Also, I fixed the exception handling. Can you check it? Here is the commit link: https://github.com/tugbadogan/manifoldcf/commit/f69946bf35bea88c2ac853fa158dc69b0dc4231b I searched for embedded Kafka Server and ZooKeeper examples. I found this: https://gist.github.com/fjavieralba/7930018 I will try to implement integration test by using these code pieces. But I'm not sure whether it is feasible or not. > Apache Kafka Output Connector > ----------------------------- > > Key: CONNECTORS-1162 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1162 > Project: ManifoldCF > Issue Type: Wish > Affects Versions: ManifoldCF 1.8.1, ManifoldCF 2.0.1 > Reporter: Rafa Haro > Assignee: Karl Wright > Labels: gsoc, gsoc2015 > Fix For: ManifoldCF 2.3 > > Attachments: 1.JPG, 2.JPG, Documentation.zip > > > Kafka is a distributed, partitioned, replicated commit log service. It > provides the functionality of a messaging system, but with a unique design. A > single Kafka broker can handle hundreds of megabytes of reads and writes per > second from thousands of clients. > Apache Kafka is being used for a number of uses cases. One of them is to use > Kafka as a feeding system for streaming BigData processes, both in Apache > Spark or Hadoop environment. A Kafka output connector could be used for > streaming or dispatching crawled documents or metadata and put them in a > BigData processing pipeline -- This message was sent by Atlassian JIRA (v6.3.4#6332)