[ https://issues.apache.org/jira/browse/CONNECTORS-1162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595261#comment-14595261 ]
Tugba Dogan commented on CONNECTORS-1162: ----------------------------------------- Hi Karl, I implemented the ingestion activity for Kafka output. Now, I will test it with different document repositories. Here is the commit link: https://github.com/tugbadogan/manifoldcf/commit/72eaed077b970624b730201f520cdfd3d0daec5a I have a question about something. In Kafka api, send() method works asynchronously as I understand from the following javadoc: http://kafka.apache.org/082/javadoc/index.html?org/apache/kafka/clients/producer/KafkaProducer.html So, I don't understand whether send operation is successful or not after calling the method. Can you suggest any way to deal with this situation ? > Apache Kafka Output Connector > ----------------------------- > > Key: CONNECTORS-1162 > URL: https://issues.apache.org/jira/browse/CONNECTORS-1162 > Project: ManifoldCF > Issue Type: Wish > Affects Versions: ManifoldCF 1.8.1, ManifoldCF 2.0.1 > Reporter: Rafa Haro > Assignee: Karl Wright > Labels: gsoc, gsoc2015 > Fix For: ManifoldCF 1.10, ManifoldCF 2.2 > > Attachments: 1.JPG, 2.JPG > > > Kafka is a distributed, partitioned, replicated commit log service. It > provides the functionality of a messaging system, but with a unique design. A > single Kafka broker can handle hundreds of megabytes of reads and writes per > second from thousands of clients. > Apache Kafka is being used for a number of uses cases. One of them is to use > Kafka as a feeding system for streaming BigData processes, both in Apache > Spark or Hadoop environment. A Kafka output connector could be used for > streaming or dispatching crawled documents or metadata and put them in a > BigData processing pipeline -- This message was sent by Atlassian JIRA (v6.3.4#6332)