[
https://issues.apache.org/jira/browse/CTAKES-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14146792#comment-14146792
]
jay vyas commented on CTAKES-314:
---------------------------------
Thanks Pei. so far the data flow is like
{noformat}
twitter-stream-1 -> spark_streaming
other-data-stream-2 -> spark_streaming
... -> spark_streaming
spark_streaming -> aggreagatedRDD -> ctakes ->topic_categorized_tweets.
{noformat}
What can we do with other than categorize the topics? i know there are lots of
possibilities.
> BigTop/Hadoop cTAKES integration
> --------------------------------
>
> Key: CTAKES-314
> URL: https://issues.apache.org/jira/browse/CTAKES-314
> Project: cTAKES
> Issue Type: New Feature
> Reporter: Pei Chen
> Fix For: future enhancement
>
> Attachments: Napkin_cTAKES_Hadoop.JPG
>
>
> Placeholder to-
> Create a simple application that can take in different datasources (public
> forums, twitter, etc.), scale up cTAKES using BigTop/Hadoop ecosystem.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)