[
https://issues.apache.org/jira/browse/BAHIR-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15998509#comment-15998509
]
ASF GitHub Bot commented on BAHIR-117:
--------------------------------------
Github user c-w commented on the issue:
https://github.com/apache/bahir/pull/43
Getting the relevant excerpts from the logs about the failure:
07:49:06 [INFO] Reactor Summary:
07:49:06 [INFO]
07:49:06 [INFO] Apache Bahir - Parent POM ..........................
SUCCESS [ 4.278 s]
07:49:06 [INFO] Apache Bahir - Spark SQL Cloudant DataSource .......
SUCCESS [ 16.878 s]
07:49:06 [INFO] Apache Bahir - Spark Streaming Akka ................
SUCCESS [ 27.164 s]
**07:49:06 [INFO] Apache Bahir - Spark SQL Streaming Akka ............
FAILURE [01:16 min]**
07:49:06 [INFO] Apache Bahir - Spark Streaming MQTT ................ SKIPPED
07:49:06 [INFO] Apache Bahir - Spark SQL Streaming MQTT ............ SKIPPED
07:49:06 [INFO] Apache Bahir - Spark Streaming Twitter ............. SKIPPED
07:49:06 [INFO] Apache Bahir - Spark Streaming ZeroMQ .............. SKIPPED
07:49:06 [INFO] Apache Bahir - Spark Extensions Distribution ....... SKIPPED
07:49:06 # A fatal error has been detected by the Java Runtime Environment:
07:49:06 #
07:49:06 # SIGSEGV (0xb) at pc=0x00007f100f8f4988, pid=5934,
tid=139706863331072
07:49:06 #
07:49:06 # JRE version: OpenJDK Runtime Environment (8.0_91-b14) (build
1.8.0_91-b14)
07:49:06 # Java VM: OpenJDK 64-Bit Server VM (25.91-b14 mixed mode
linux-amd64 compressed oops)
07:49:06 # Problematic frame:
**07:49:06 # C [librocksdbjni8312626221310549185.so+0x1f8988]
rocksdb::GetColumnFamilyID(rocksdb::ColumnFamilyHandle\*)+0x8**
Not sure what to make of this... running the tests locally on Windows 10
with JDK 1.8.0_131 and Maven 3.5.0 everything passes ([full build
log](https://pastebin.com/dsCNg1XN)). Any ideas, @lresende?
> Expand filtering options for TwitterInputDStream
> ------------------------------------------------
>
> Key: BAHIR-117
> URL: https://issues.apache.org/jira/browse/BAHIR-117
> Project: Bahir
> Issue Type: Improvement
> Components: Spark Streaming Connectors
> Reporter: Clemens Wolff
>
> Currently, the TwitterInputDStream only supports filtering by keywords [1]
> which corresponds to the "track" option in the Twitter API [2]. The Twitter
> API supports many more ways to receive a filtered stream (e.g. get Tweets in
> a particular location [3]). It would be very useful to expose these
> additional filtering options in this library.
> Proposal: add a new public method to TwitterUtils which follows the same
> interface as createStream [4] but which takes a FilterQuery [5] object as
> argument. In this way, we give full filtering flexibility to our users.
> I'm currently working on Project Fortis, a social data analysis platform for
> the United Nations [6]. The extra filtering options would be very useful for
> my project so I'm happy to implement this and create a pull request.
> [1]
> https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterInputDStream.scala#L44
> [2] https://dev.twitter.com/streaming/overview/request-parameters#track
> [3] https://dev.twitter.com/streaming/overview/request-parameters#locations
> [4]
> https://github.com/apache/bahir/blob/fd4c35fc9f7ebb57464d231cf5d66e7bc4096a1b/streaming-twitter/src/main/scala/org/apache/spark/streaming/twitter/TwitterUtils.scala#L39
> [5] http://twitter4j.org/javadoc/twitter4j/FilterQuery.html
> [6] https://fortis-web.azurewebsites.net/#/site/ocha/
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)