[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=319197=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-319197 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 26/Sep/19 20:18 Start Date: 26/Sep/19 20:18 Worklog Time Spent: 10m Work Description: pabloem commented on pull request #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 319197) Time Spent: 1.5h (was: 1h 20m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 1.5h > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=319196=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-319196 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 26/Sep/19 20:17 Start Date: 26/Sep/19 20:17 Worklog Time Spent: 10m Work Description: pabloem commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-535669628 Thanks! This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 319196) Time Spent: 1h 20m (was: 1h 10m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 1h 20m > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=319126=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-319126 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 26/Sep/19 18:20 Start Date: 26/Sep/19 18:20 Worklog Time Spent: 10m Work Description: angulartist commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-535626785 The file is in fact accessible. I've deleted the previous file and updated the comment. I left the default original input as it is, because we just want a lighter alternative xoxo This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 319126) Time Spent: 1h 10m (was: 1h) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 1h 10m > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=318566=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-318566 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 25/Sep/19 20:09 Start Date: 25/Sep/19 20:09 Worklog Time Spent: 10m Work Description: pabloem commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-535191247 Alright, I've uploaded the file to `gs://apache-beam-samples/game/small/gaming_data.csv`. Can you check that it's accessible? If so, for this PR, could you: - Remove the file - Update the documentation to point to `gs://apache-beam-samples/game/small/gaming_data.csv` for the small dataset? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 318566) Time Spent: 1h (was: 50m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 1h > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=318082=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-318082 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 25/Sep/19 05:02 Start Date: 25/Sep/19 05:02 Worklog Time Spent: 10m Work Description: angulartist commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-534853172 Yeah that sounds like a good idea, file is kinda small so friends will be able to download it :fire: This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 318082) Time Spent: 50m (was: 40m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 50m > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=317930=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-317930 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 25/Sep/19 00:38 Start Date: 25/Sep/19 00:38 Worklog Time Spent: 10m Work Description: pabloem commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-534801347 I can push your file to the GCS bucket. I am just wondering if you think that's a good idea? If so, we'd have to amend the docstring. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 317930) Time Spent: 40m (was: 0.5h) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 40m > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=317929=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-317929 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 25/Sep/19 00:37 Start Date: 25/Sep/19 00:37 Worklog Time Spent: 10m Work Description: pabloem commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-534801219 Thanks for helping to generate the data. Maybe we should push it to a GCS bucket instead of keeping it in the Github repo? This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 317929) Time Spent: 0.5h (was: 20m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Assignee: John Patoch >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 0.5h > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=316503=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-316503 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 23/Sep/19 07:49 Start Date: 23/Sep/19 07:49 Worklog Time Spent: 10m Work Description: angulartist commented on issue #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633#issuecomment-533993501 retest this please This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking --- Worklog Id: (was: 316503) Time Spent: 20m (was: 10m) > Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples" > --- > > Key: BEAM-1296 > URL: https://issues.apache.org/jira/browse/BEAM-1296 > Project: Beam > Issue Type: Wish > Components: examples-java >Reporter: Keiji Yoshida >Priority: Trivial > Labels: ccoss2019, newbie, starter > Time Spent: 20m > Remaining Estimate: 0h > > A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam > Mobile Gaming Pipeline Examples" is so huge (about 12 GB) and it takes long > time to download the dataset. It might pose difficulties to Apache Beam > beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" > quickly. > How about providing a small dataset (say less than 1 GB) for this examples? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Work logged] (BEAM-1296) Providing a small dataset for "Apache Beam Mobile Gaming Pipeline Examples"
[ https://issues.apache.org/jira/browse/BEAM-1296?focusedWorklogId=316315=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-316315 ] ASF GitHub Bot logged work on BEAM-1296: Author: ASF GitHub Bot Created on: 22/Sep/19 19:45 Start Date: 22/Sep/19 19:45 Worklog Time Spent: 10m Work Description: angulartist commented on pull request #9633: [BEAM-1296] Providing a small dataset for "Apache Beam Mobile Gaming … URL: https://github.com/apache/beam/pull/9633 …Pipeline Examples" A dataset "gs://apache-beam-samples/game/gaming_data*.csv" for "Apache Beam Mobile Gaming Pipeline Examples" is so huge (about 2 chunks of ~12 GB) and it takes long time to download the dataset. It might pose difficulties to Apache Beam beginners who want to try "Apache Beam Mobile Gaming Pipeline Examples" quickly. Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily: - [ ] [**Choose reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and mention them in a comment (`R: @username`). - [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue. - [ ] If this contribution is large, please file an Apache [Individual Contributor License Agreement](https://www.apache.org/licenses/icla.pdf). Post-Commit Tests Status (on master branch) Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark --- | --- | --- | --- | --- | --- | --- | --- Go | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/) | --- | --- | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/) | --- | --- | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/) Java | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/) | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/) Python | [![Build Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build