[jira] [Created] (STREAMS-680) improve performance/efficiency of twitter search providers

2020-10-20 Thread Steve Blackmon (Jira)
Steve Blackmon created STREAMS-680:
--

 Summary: improve performance/efficiency of twitter search providers
 Key: STREAMS-680
 URL: https://issues.apache.org/jira/browse/STREAMS-680
 Project: Streams
  Issue Type: Task
Reporter: Steve Blackmon
Assignee: Steve Blackmon


improve performance and memory efficiency of twitter search providers



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (STREAMS-679) add support for accessing and searching with twitter derived fields

2020-10-20 Thread Steve Blackmon (Jira)


[ 
https://issues.apache.org/jira/browse/STREAMS-679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217852#comment-17217852
 ] 

Steve Blackmon commented on STREAMS-679:


more details on these fields:

https://developer.twitter.com/en/docs/twitter-api/v1/enrichments/overview/profile-geo

> add support for accessing and searching with twitter derived fields
> ---
>
> Key: STREAMS-679
> URL: https://issues.apache.org/jira/browse/STREAMS-679
> Project: Streams
>  Issue Type: Task
>Reporter: Steve Blackmon
>Assignee: Steve Blackmon
>Priority: Major
>
> add support for accessing and searching with twitter derived fields



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (STREAMS-679) add support for accessing and searching with twitter derived fields

2020-10-20 Thread Steve Blackmon (Jira)
Steve Blackmon created STREAMS-679:
--

 Summary: add support for accessing and searching with twitter 
derived fields
 Key: STREAMS-679
 URL: https://issues.apache.org/jira/browse/STREAMS-679
 Project: Streams
  Issue Type: Task
Reporter: Steve Blackmon
Assignee: Steve Blackmon


add support for accessing and searching with twitter derived fields



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[streams] branch STREAMS-676 created (now f47c42c)

2020-10-20 Thread sblackmon
This is an automated email from the ASF dual-hosted git repository.

sblackmon pushed a change to branch STREAMS-676
in repository https://gitbox.apache.org/repos/asf/streams.git.


  at f47c42c  resolves STREAMS-676

This branch includes the following new commits:

 new f47c42c  resolves STREAMS-676

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




[streams] 01/01: resolves STREAMS-676

2020-10-20 Thread sblackmon
This is an automated email from the ASF dual-hosted git repository.

sblackmon pushed a commit to branch STREAMS-676
in repository https://gitbox.apache.org/repos/asf/streams.git

commit f47c42c5df3410387aba291dad48a4558869242f
Author: sblackmon 
AuthorDate: Mon Oct 12 17:33:49 2020 -0500

resolves STREAMS-676

remove unnecessary keying and use GlobalWindow in 
streams-examples-flink/flink-twitter-collection
---
 .../flink-twitter-collection/pom.xml   | 15 ++
 .../main/jsonschema/StreamsFlinkConfiguration.json |  4 +--
 .../apache/streams/examples/flink/FlinkBase.scala  | 11 
 .../collection/FlinkTwitterFollowingPipeline.scala | 13 +
 .../collection/FlinkTwitterPostsPipeline.scala | 19 +
 .../FlinkTwitterUserInformationPipeline.scala  | 32 --
 .../FollowingCollectorFlatMapFunction.scala|  8 +++---
 .../TimelineCollectorFlatMapFunction.scala |  8 +++---
 8 files changed, 55 insertions(+), 55 deletions(-)

diff --git 
a/streams-examples/streams-examples-flink/flink-twitter-collection/pom.xml 
b/streams-examples/streams-examples-flink/flink-twitter-collection/pom.xml
index 467536b..dee8dfc 100644
--- a/streams-examples/streams-examples-flink/flink-twitter-collection/pom.xml
+++ b/streams-examples/streams-examples-flink/flink-twitter-collection/pom.xml
@@ -95,6 +95,11 @@
 
 
 org.apache.streams
+streams-converters
+${project.version}
+
+
+org.apache.streams
 streams-provider-twitter
 ${project.version}
 
@@ -402,6 +407,16 @@
 
 
 org.apache.streams
+streams-provider-twitter
+${project.version}
+
+
+org.apache.streams
+streams-converters
+${project.version}
+
+
+org.apache.streams
 streams-persist-hdfs
 ${project.version}
 
diff --git 
a/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/jsonschema/StreamsFlinkConfiguration.json
 
b/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/jsonschema/StreamsFlinkConfiguration.json
index 7c6291e..b45ec14 100644
--- 
a/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/jsonschema/StreamsFlinkConfiguration.json
+++ 
b/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/jsonschema/StreamsFlinkConfiguration.json
@@ -12,10 +12,10 @@
   },
   "additionalProperties": false,
   "properties": {
-"test": {
+"local": {
   "type": "boolean"
 },
-"local": {
+"test": {
   "type": "boolean"
 }
   }
diff --git 
a/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/scala/org/apache/streams/examples/flink/FlinkBase.scala
 
b/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/scala/org/apache/streams/examples/flink/FlinkBase.scala
index 980f5eb..a2123b8 100644
--- 
a/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/scala/org/apache/streams/examples/flink/FlinkBase.scala
+++ 
b/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/scala/org/apache/streams/examples/flink/FlinkBase.scala
@@ -29,6 +29,9 @@ import org.apache.flink.streaming.api.CheckpointingMode
 import 
org.apache.flink.streaming.api.functions.sink.filesystem.bucketassigners.BasePathBucketAssigner
 import 
org.apache.flink.streaming.api.functions.sink.filesystem.rollingpolicies.DefaultRollingPolicy
 import org.apache.flink.streaming.api.scala.StreamExecutionEnvironment
+import org.apache.flink.streaming.api.scala.function.AllWindowFunction
+import org.apache.flink.streaming.api.windowing.windows.GlobalWindow
+import org.apache.flink.util.Collector
 import org.apache.streams.config.{ComponentConfigurator, StreamsConfigurator}
 import org.apache.streams.flink.{FlinkBatchConfiguration, 
FlinkStreamingConfiguration, StreamsFlinkConfiguration}
 import org.apache.streams.hdfs.{HdfsConfiguration, HdfsReaderConfiguration, 
HdfsWriterConfiguration}
@@ -46,6 +49,14 @@ object FlinkBase {
   input.substring(input.lastIndexOf(':')+1)
 else input
   }
+
+  class idListWindowFunction extends AllWindowFunction[String, List[String], 
GlobalWindow] {
+override def apply(window: GlobalWindow, input: Iterable[String], out: 
Collector[List[String]]): Unit = {
+  if( input.nonEmpty )
+out.collect(input.map(id => toProviderId(id)).toList)
+}
+  }
+
 }
 
 trait FlinkBase {
diff --git 
a/streams-examples/streams-examples-flink/flink-twitter-collection/src/main/scala/org/apache/streams/examples/flink/twitter/collection/FlinkTwitterFollowingPipeline.scala
 

[jira] [Created] (STREAMS-678) RdfFreemarkerCli.java should employ StreamsConfigurator

2020-10-20 Thread Steve Blackmon (Jira)
Steve Blackmon created STREAMS-678:
--

 Summary: RdfFreemarkerCli.java should employ StreamsConfigurator
 Key: STREAMS-678
 URL: https://issues.apache.org/jira/browse/STREAMS-678
 Project: Streams
  Issue Type: Task
Reporter: Steve Blackmon


RdfFreemarkerCli.java should employ StreamsConfigurator instead of 
ConfigFactory.load() making it easier for wrapper code to initialize it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)