[
https://issues.apache.org/jira/browse/CTAKES-314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jay vyas updated CTAKES-314:
----------------------------
Attachment: spark-streaming-twitter.tar
heres the patch . just untar it and the README should explain everything .
Let me know if it looks good ! Should have a tree like this under
ctakes-examples (or wherever you decide to put the sparkStreaming app).
{noformat}
[jay@rhbd ctakes]$ tree
.
├── build.sbt
├── org
│ └── apache
│ └── ctakes
│ └── dictionary
│ └── lookup
│ └── LookupDesc_Db.xml
├── README.md
├── spark-streaming-twitter.tar
├── src
│ ├── main
│ │ ├── java
│ │ ├── resources
│ │ └── scala
│ │ └── sparkapps
│ │ ├── ctakes
│ │ │ ├── CTakesExample.scala
│ │ │ ├── CTakesTwitterStreamingApp.scala
│ │ │ ├── Parser.scala
│ │ │ ├── TermAnalyzer.scala
│ │ │ ├── TwitterInputDStreamCTakes.scala
│ │ │ ├── TwitterUtilsJ.scala
│ │ │ └── Utils.scala
│ │ └── SparkApp1.scala
│ └── test
│ ├── java
│ ├── resources
│ └── scala
│ ├── TestSpark.scala
│ └── TestStreaming.scala
└── twitter
{noformat}
> BigTop/Hadoop cTAKES integration
> --------------------------------
>
> Key: CTAKES-314
> URL: https://issues.apache.org/jira/browse/CTAKES-314
> Project: cTAKES
> Issue Type: New Feature
> Affects Versions: 3.2.0
> Reporter: Pei Chen
> Fix For: 3.2.3
>
> Attachments: Napkin_cTAKES_Hadoop.JPG, fix.diff,
> spark-streaming-twitter.tar
>
>
> Placeholder to-
> Create a simple application that can take in different datasources (public
> forums, twitter, etc.), scale up cTAKES using BigTop/Hadoop ecosystem.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)