[jira] [Work logged] (GOBBLIN-1060) Fix YarnAppLauncher resource existence checking with wrong fs object

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1060?focusedWorklogId=393142=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-393142
 ]

ASF GitHub Bot logged work on GOBBLIN-1060:
---

Author: ASF GitHub Bot
Created on: 26/Feb/20 02:34
Start Date: 26/Feb/20 02:34
Worklog Time Spent: 10m 
  Work Description: codecov-io commented on issue #2899: [GOBBLIN-1060]Fix 
wrong fileSystem object in YarnApplauncher
URL: 
https://github.com/apache/incubator-gobblin/pull/2899#issuecomment-591042125
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=h1)
 Report
   > Merging 
[#2899](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/0de109b6941e636f4a35069752ef1e3c4188054d?src=pr=desc)
 will **decrease** coverage by `<.01%`.
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2899  +/-   ##
   
   - Coverage 45.86%   45.85%   -0.01% 
   + Complexity 9170 9167   -3 
   
 Files  1932 1932  
 Lines 7278972792   +3 
 Branches   8022 8022  
   
   - Hits  3338633382   -4 
   - Misses3634236352  +10 
   + Partials   3061 3058   -3
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...rg/apache/gobblin/yarn/GobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vR29iYmxpbllhcm5BcHBMYXVuY2hlci5qYXZh)
 | `21.02% <0%> (-0.16%)` | `8 <0> (ø)` | |
   | 
[...in/java/org/apache/gobblin/cluster/HelixUtils.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1jbHVzdGVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL2NsdXN0ZXIvSGVsaXhVdGlscy5qYXZh)
 | `38.26% <0%> (-3.48%)` | `14% <0%> (-1%)` | |
   | 
[...e/gobblin/runtime/locks/ZookeeperBasedJobLock.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvbG9ja3MvWm9va2VlcGVyQmFzZWRKb2JMb2NrLmphdmE=)
 | `63.33% <0%> (-1.12%)` | `15% <0%> (-1%)` | |
   | 
[...main/java/org/apache/gobblin/yarn/YarnService.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vWWFyblNlcnZpY2UuamF2YQ==)
 | `15.32% <0%> (-0.84%)` | `4% <0%> (-1%)` | |
   | 
[.../apache/gobblin/runtime/api/JobExecutionState.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0pvYkV4ZWN1dGlvblN0YXRlLmphdmE=)
 | `80.37% <0%> (+0.93%)` | `24% <0%> (ø)` | :arrow_down: |
   | 
[.../org/apache/gobblin/metrics/RootMetricContext.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1tZXRyaWNzLWxpYnMvZ29iYmxpbi1tZXRyaWNzLWJhc2Uvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vbWV0cmljcy9Sb290TWV0cmljQ29udGV4dC5qYXZh)
 | `79.68% <0%> (+1.56%)` | `16% <0%> (+1%)` | :arrow_up: |
   | 
[...lin/restli/throttling/ZookeeperLeaderElection.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1yZXN0bGkvZ29iYmxpbi10aHJvdHRsaW5nLXNlcnZpY2UvZ29iYmxpbi10aHJvdHRsaW5nLXNlcnZpY2Utc2VydmVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3Jlc3RsaS90aHJvdHRsaW5nL1pvb2tlZXBlckxlYWRlckVsZWN0aW9uLmphdmE=)
 | `72.22% <0%> (+2.22%)` | `13% <0%> (ø)` | :arrow_down: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=footer).
 Last update 
[0de109b...5c71800](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please 

[GitHub] [incubator-gobblin] codecov-io edited a comment on issue #2899: [GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher

2020-02-25 Thread GitBox
codecov-io edited a comment on issue #2899: [GOBBLIN-1060]Fix wrong fileSystem 
object in YarnApplauncher
URL: 
https://github.com/apache/incubator-gobblin/pull/2899#issuecomment-591042125
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=h1)
 Report
   > Merging 
[#2899](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/0de109b6941e636f4a35069752ef1e3c4188054d?src=pr=desc)
 will **decrease** coverage by `<.01%`.
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2899  +/-   ##
   
   - Coverage 45.86%   45.85%   -0.01% 
   + Complexity 9170 9167   -3 
   
 Files  1932 1932  
 Lines 7278972792   +3 
 Branches   8022 8022  
   
   - Hits  3338633382   -4 
   - Misses3634236352  +10 
   + Partials   3061 3058   -3
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...rg/apache/gobblin/yarn/GobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vR29iYmxpbllhcm5BcHBMYXVuY2hlci5qYXZh)
 | `21.02% <0%> (-0.16%)` | `8 <0> (ø)` | |
   | 
[...in/java/org/apache/gobblin/cluster/HelixUtils.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1jbHVzdGVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL2NsdXN0ZXIvSGVsaXhVdGlscy5qYXZh)
 | `38.26% <0%> (-3.48%)` | `14% <0%> (-1%)` | |
   | 
[...e/gobblin/runtime/locks/ZookeeperBasedJobLock.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvbG9ja3MvWm9va2VlcGVyQmFzZWRKb2JMb2NrLmphdmE=)
 | `63.33% <0%> (-1.12%)` | `15% <0%> (-1%)` | |
   | 
[...main/java/org/apache/gobblin/yarn/YarnService.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vWWFyblNlcnZpY2UuamF2YQ==)
 | `15.32% <0%> (-0.84%)` | `4% <0%> (-1%)` | |
   | 
[.../apache/gobblin/runtime/api/JobExecutionState.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0pvYkV4ZWN1dGlvblN0YXRlLmphdmE=)
 | `80.37% <0%> (+0.93%)` | `24% <0%> (ø)` | :arrow_down: |
   | 
[.../org/apache/gobblin/metrics/RootMetricContext.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1tZXRyaWNzLWxpYnMvZ29iYmxpbi1tZXRyaWNzLWJhc2Uvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vbWV0cmljcy9Sb290TWV0cmljQ29udGV4dC5qYXZh)
 | `79.68% <0%> (+1.56%)` | `16% <0%> (+1%)` | :arrow_up: |
   | 
[...lin/restli/throttling/ZookeeperLeaderElection.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi1yZXN0bGkvZ29iYmxpbi10aHJvdHRsaW5nLXNlcnZpY2UvZ29iYmxpbi10aHJvdHRsaW5nLXNlcnZpY2Utc2VydmVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3Jlc3RsaS90aHJvdHRsaW5nL1pvb2tlZXBlckxlYWRlckVsZWN0aW9uLmphdmE=)
 | `72.22% <0%> (+2.22%)` | `13% <0%> (ø)` | :arrow_down: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=footer).
 Last update 
[0de109b...5c71800](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Work logged] (GOBBLIN-1050) Verify requester when updating/deleting FlowConfig

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1050?focusedWorklogId=393115=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-393115
 ]

ASF GitHub Bot logged work on GOBBLIN-1050:
---

Author: ASF GitHub Bot
Created on: 26/Feb/20 01:42
Start Date: 26/Feb/20 01:42
Worklog Time Spent: 10m 
  Work Description: arjun4084346 commented on issue #2890: [GOBBLIN-1050] 
Verify requester when updating/deleting FlowConfig
URL: 
https://github.com/apache/incubator-gobblin/pull/2890#issuecomment-591191796
 
 
   I did not like the idea of updating signature of update at multiple places. 
It made the code non-uniform. Place of requesterList as a parameter in update() 
is not quite justified in my opinion.
   Is it possible to do checkRequester() in FlowConfigsResource::update/delete 
instead of doing in resource handlers' update/delete ?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 393115)
Time Spent: 40m  (was: 0.5h)

> Verify requester when updating/deleting FlowConfig
> --
>
> Key: GOBBLIN-1050
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1050
> Project: Apache Gobblin
>  Issue Type: Improvement
>Reporter: Jack Moseley
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [incubator-gobblin] arjun4084346 commented on issue #2890: [GOBBLIN-1050] Verify requester when updating/deleting FlowConfig

2020-02-25 Thread GitBox
arjun4084346 commented on issue #2890: [GOBBLIN-1050] Verify requester when 
updating/deleting FlowConfig
URL: 
https://github.com/apache/incubator-gobblin/pull/2890#issuecomment-591191796
 
 
   I did not like the idea of updating signature of update at multiple places. 
It made the code non-uniform. Place of requesterList as a parameter in update() 
is not quite justified in my opinion.
   Is it possible to do checkRequester() in FlowConfigsResource::update/delete 
instead of doing in resource handlers' update/delete ?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [incubator-gobblin] sv2000 commented on a change in pull request #2899: [GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher

2020-02-25 Thread GitBox
sv2000 commented on a change in pull request #2899: [GOBBLIN-1060]Fix wrong 
fileSystem object in YarnApplauncher
URL: https://github.com/apache/incubator-gobblin/pull/2899#discussion_r384128391
 
 

 ##
 File path: 
gobblin-yarn/src/main/java/org/apache/gobblin/yarn/GobblinYarnAppLauncher.java
 ##
 @@ -608,21 +608,22 @@ private Resource 
prepareContainerResource(GetNewApplicationResponse newApplicati
 Path appMasterWorkDir = new Path(appWorkDir, 
GobblinYarnConfigurationKeys.APP_MASTER_WORK_DIR_NAME);
 
 Map appMasterResources = Maps.newHashMap();
+FileSystem localFs = FileSystem.getLocal(this.yarnConfiguration);
 
 Review comment:
   Maybe it is safer to use FileSystem.getLocal(new Configuration()) since we 
are dependent of fs.defaultFS not being configured in yarnConfiguration. 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Work logged] (GOBBLIN-1060) Fix YarnAppLauncher resource existence checking with wrong fs object

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1060?focusedWorklogId=392932=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-392932
 ]

ASF GitHub Bot logged work on GOBBLIN-1060:
---

Author: ASF GitHub Bot
Created on: 25/Feb/20 21:11
Start Date: 25/Feb/20 21:11
Worklog Time Spent: 10m 
  Work Description: sv2000 commented on pull request #2899: 
[GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher
URL: https://github.com/apache/incubator-gobblin/pull/2899#discussion_r384128391
 
 

 ##
 File path: 
gobblin-yarn/src/main/java/org/apache/gobblin/yarn/GobblinYarnAppLauncher.java
 ##
 @@ -608,21 +608,22 @@ private Resource 
prepareContainerResource(GetNewApplicationResponse newApplicati
 Path appMasterWorkDir = new Path(appWorkDir, 
GobblinYarnConfigurationKeys.APP_MASTER_WORK_DIR_NAME);
 
 Map appMasterResources = Maps.newHashMap();
+FileSystem localFs = FileSystem.getLocal(this.yarnConfiguration);
 
 Review comment:
   Maybe it is safer to use FileSystem.getLocal(new Configuration()) since we 
are dependent of fs.defaultFS not being configured in yarnConfiguration. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 392932)
Time Spent: 0.5h  (was: 20m)

> Fix YarnAppLauncher resource existence checking with wrong fs object
> 
>
> Key: GOBBLIN-1060
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1060
> Project: Apache Gobblin
>  Issue Type: Bug
>Reporter: Lei Sun
>Priority: Major
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (GOBBLIN-1060) Fix YarnAppLauncher resource existence checking with wrong fs object

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1060?focusedWorklogId=392881=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-392881
 ]

ASF GitHub Bot logged work on GOBBLIN-1060:
---

Author: ASF GitHub Bot
Created on: 25/Feb/20 20:03
Start Date: 25/Feb/20 20:03
Worklog Time Spent: 10m 
  Work Description: codecov-io commented on issue #2899: [GOBBLIN-1060]Fix 
wrong fileSystem object in YarnApplauncher
URL: 
https://github.com/apache/incubator-gobblin/pull/2899#issuecomment-591042125
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=h1)
 Report
   > Merging 
[#2899](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/0de109b6941e636f4a35069752ef1e3c4188054d?src=pr=desc)
 will **decrease** coverage by `<.01%`.
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree)
   
   ```diff
   @@ Coverage Diff @@
   ## master   #2899  +/-   ##
   ===
   - Coverage  4.12%   4.12%   -0.01% 
 Complexity  750 750  
   ===
 Files  19321932  
 Lines 72789   72792   +3 
 Branches   80228022  
   ===
 Hits   30023002  
   - Misses69469   69472   +3 
 Partials318 318
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...rg/apache/gobblin/yarn/GobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vR29iYmxpbllhcm5BcHBMYXVuY2hlci5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=footer).
 Last update 
[0de109b...0abc430](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 392881)
Time Spent: 20m  (was: 10m)

> Fix YarnAppLauncher resource existence checking with wrong fs object
> 
>
> Key: GOBBLIN-1060
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1060
> Project: Apache Gobblin
>  Issue Type: Bug
>Reporter: Lei Sun
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [incubator-gobblin] codecov-io commented on issue #2899: [GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher

2020-02-25 Thread GitBox
codecov-io commented on issue #2899: [GOBBLIN-1060]Fix wrong fileSystem object 
in YarnApplauncher
URL: 
https://github.com/apache/incubator-gobblin/pull/2899#issuecomment-591042125
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=h1)
 Report
   > Merging 
[#2899](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/0de109b6941e636f4a35069752ef1e3c4188054d?src=pr=desc)
 will **decrease** coverage by `<.01%`.
   > The diff coverage is `0%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree)
   
   ```diff
   @@ Coverage Diff @@
   ## master   #2899  +/-   ##
   ===
   - Coverage  4.12%   4.12%   -0.01% 
 Complexity  750 750  
   ===
 Files  19321932  
 Lines 72789   72792   +3 
 Branches   80228022  
   ===
 Hits   30023002  
   - Misses69469   69472   +3 
 Partials318 318
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...rg/apache/gobblin/yarn/GobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2899/diff?src=pr=tree#diff-Z29iYmxpbi15YXJuL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3lhcm4vR29iYmxpbllhcm5BcHBMYXVuY2hlci5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=footer).
 Last update 
[0de109b...0abc430](https://codecov.io/gh/apache/incubator-gobblin/pull/2899?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [incubator-gobblin] autumnust opened a new pull request #2899: [GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher

2020-02-25 Thread GitBox
autumnust opened a new pull request #2899: [GOBBLIN-1060]Fix wrong fileSystem 
object in YarnApplauncher
URL: https://github.com/apache/incubator-gobblin/pull/2899
 
 
   Dear Gobblin maintainers,
   
   Please accept this PR. I understand that it will not be reviewed until I 
have checked off all the steps below!
   
   
   ### JIRA
   - [ ] My PR addresses the following [Gobblin 
JIRA](https://issues.apache.org/jira/browse/GOBBLIN/) issues and references 
them in the PR title. For example, "[GOBBLIN-XXX] My Gobblin PR"
   - https://issues.apache.org/jira/browse/GOBBLIN-1060
   
   
   ### Description
   The existence checking of files to be localized in yarn appmaster / 
container need to happen against local fs object. Currently it is using HDFS fs 
object which is incorrect. 
   
   
   ### Tests
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   
   ### Commits
   - [ ] My commits all reference JIRA issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
   1. Subject is separated from body by a blank line
   2. Subject is limited to 50 characters
   3. Subject does not end with a period
   4. Subject uses the imperative mood ("add", not "adding")
   5. Body wraps at 72 characters
   6. Body explains "what" and "why", not "how"
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Work logged] (GOBBLIN-1060) Fix YarnAppLauncher resource existence checking with wrong fs object

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1060?focusedWorklogId=392835=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-392835
 ]

ASF GitHub Bot logged work on GOBBLIN-1060:
---

Author: ASF GitHub Bot
Created on: 25/Feb/20 19:00
Start Date: 25/Feb/20 19:00
Worklog Time Spent: 10m 
  Work Description: autumnust commented on pull request #2899: 
[GOBBLIN-1060]Fix wrong fileSystem object in YarnApplauncher
URL: https://github.com/apache/incubator-gobblin/pull/2899
 
 
   Dear Gobblin maintainers,
   
   Please accept this PR. I understand that it will not be reviewed until I 
have checked off all the steps below!
   
   
   ### JIRA
   - [ ] My PR addresses the following [Gobblin 
JIRA](https://issues.apache.org/jira/browse/GOBBLIN/) issues and references 
them in the PR title. For example, "[GOBBLIN-XXX] My Gobblin PR"
   - https://issues.apache.org/jira/browse/GOBBLIN-1060
   
   
   ### Description
   The existence checking of files to be localized in yarn appmaster / 
container need to happen against local fs object. Currently it is using HDFS fs 
object which is incorrect. 
   
   
   ### Tests
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   
   ### Commits
   - [ ] My commits all reference JIRA issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
   1. Subject is separated from body by a blank line
   2. Subject is limited to 50 characters
   3. Subject does not end with a period
   4. Subject uses the imperative mood ("add", not "adding")
   5. Body wraps at 72 characters
   6. Body explains "what" and "why", not "how"
   
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 392835)
Remaining Estimate: 0h
Time Spent: 10m

> Fix YarnAppLauncher resource existence checking with wrong fs object
> 
>
> Key: GOBBLIN-1060
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1060
> Project: Apache Gobblin
>  Issue Type: Bug
>Reporter: Lei Sun
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (GOBBLIN-1058) Make emitTrackingEvents method accept additional PartitionsToTags map for ease of extension on metrics

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1058?focusedWorklogId=392832=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-392832
 ]

ASF GitHub Bot logged work on GOBBLIN-1058:
---

Author: ASF GitHub Bot
Created on: 25/Feb/20 18:58
Start Date: 25/Feb/20 18:58
Worklog Time Spent: 10m 
  Work Description: asfgit commented on pull request #2898: [GOBBLIN-1058] 
Refactor method emitting GTE for ease of adding new tags 
URL: https://github.com/apache/incubator-gobblin/pull/2898
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 392832)
Time Spent: 50m  (was: 40m)

> Make emitTrackingEvents method accept additional PartitionsToTags map for 
> ease of extension on metrics
> --
>
> Key: GOBBLIN-1058
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1058
> Project: Apache Gobblin
>  Issue Type: Improvement
>Reporter: Lei Sun
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (GOBBLIN-1060) Fix YarnAppLauncher resource existence checking with wrong fs object

2020-02-25 Thread Lei Sun (Jira)
Lei Sun created GOBBLIN-1060:


 Summary: Fix YarnAppLauncher resource existence checking with 
wrong fs object
 Key: GOBBLIN-1060
 URL: https://issues.apache.org/jira/browse/GOBBLIN-1060
 Project: Apache Gobblin
  Issue Type: Bug
Reporter: Lei Sun






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [incubator-gobblin] asfgit closed pull request #2898: [GOBBLIN-1058] Refactor method emitting GTE for ease of adding new tags

2020-02-25 Thread GitBox
asfgit closed pull request #2898: [GOBBLIN-1058] Refactor method emitting GTE 
for ease of adding new tags 
URL: https://github.com/apache/incubator-gobblin/pull/2898
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Work logged] (GOBBLIN-1058) Make emitTrackingEvents method accept additional PartitionsToTags map for ease of extension on metrics

2020-02-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1058?focusedWorklogId=392811=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-392811
 ]

ASF GitHub Bot logged work on GOBBLIN-1058:
---

Author: ASF GitHub Bot
Created on: 25/Feb/20 18:33
Start Date: 25/Feb/20 18:33
Worklog Time Spent: 10m 
  Work Description: codecov-io commented on issue #2898: [GOBBLIN-1058] 
Refactor method emitting GTE for ease of adding new tags 
URL: 
https://github.com/apache/incubator-gobblin/pull/2898#issuecomment-590601056
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=h1)
 Report
   > Merging 
[#2898](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/a580d6e4beb0ea53e7971caa489f1fe508159ec2?src=pr=desc)
 will **increase** coverage by `0.01%`.
   > The diff coverage is `25%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=tree)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2898  +/-   ##
   
   + Coverage 45.84%   45.85%   +0.01% 
   - Complexity 9151 9168  +17 
   
 Files  1929 1932   +3 
 Lines 7264972789 +140 
 Branches   7999 8022  +23 
   
   + Hits  3330433379  +75 
   - Misses3630536351  +46 
   - Partials   3040 3059  +19
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...source/extractor/extract/kafka/KafkaExtractor.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYUV4dHJhY3Rvci5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   | 
[...in/source/extractor/extract/kafka/KafkaSource.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYVNvdXJjZS5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   | 
[...ctor/extract/kafka/KafkaExtractorStatsTracker.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYUV4dHJhY3RvclN0YXRzVHJhY2tlci5qYXZh)
 | `85.97% <28.57%> (+31.48%)` | `29 <0> (+4)` | :arrow_up: |
   | 
[.../org/apache/gobblin/metrics/RootMetricContext.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tZXRyaWNzLWxpYnMvZ29iYmxpbi1tZXRyaWNzLWJhc2Uvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vbWV0cmljcy9Sb290TWV0cmljQ29udGV4dC5qYXZh)
 | `73.43% <0%> (-6.25%)` | `15% <0%> (-1%)` | |
   | 
[...in/java/org/apache/gobblin/cluster/HelixUtils.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1jbHVzdGVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL2NsdXN0ZXIvSGVsaXhVdGlscy5qYXZh)
 | `38.26% <0%> (-4.22%)` | `14% <0%> (-1%)` | |
   | 
[...org/apache/gobblin/runtime/api/FsSpecConsumer.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0ZzU3BlY0NvbnN1bWVyLmphdmE=)
 | `60% <0%> (-3.83%)` | `6% <0%> (+1%)` | |
   | 
[...org/apache/gobblin/runtime/api/FsSpecProducer.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0ZzU3BlY1Byb2R1Y2VyLmphdmE=)
 | `72.54% <0%> (-3.07%)` | `9% <0%> (+1%)` | |
   | 
[...gobblin/azkaban/AzkabanGobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4tYXprYWJhbi9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvZ29iYmxpbi9hemthYmFuL0F6a2FiYW5Hb2JibGluWWFybkFwcExhdW5jaGVyLmphdmE=)
 | `30.55% <0%> (-2.78%)` | `2% <0%> (ø)` | |
   | 
[...anagement/copy/replication/ConfigBasedDataset.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1kYXRhLW1hbmFnZW1lbnQvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vZGF0YS9tYW5hZ2VtZW50L2NvcHkvcmVwbGljYXRpb24vQ29uZmlnQmFzZWREYXRhc2V0LmphdmE=)
 | `68.87% <0%> (-1.68%)` | `10% <0%> (ø)` | |
   | 

[GitHub] [incubator-gobblin] codecov-io edited a comment on issue #2898: [GOBBLIN-1058] Refactor method emitting GTE for ease of adding new tags

2020-02-25 Thread GitBox
codecov-io edited a comment on issue #2898: [GOBBLIN-1058] Refactor method 
emitting GTE for ease of adding new tags 
URL: 
https://github.com/apache/incubator-gobblin/pull/2898#issuecomment-590601056
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=h1)
 Report
   > Merging 
[#2898](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-gobblin/commit/a580d6e4beb0ea53e7971caa489f1fe508159ec2?src=pr=desc)
 will **increase** coverage by `0.01%`.
   > The diff coverage is `25%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/graphs/tree.svg?width=650=4MgURJ0bGc=150=pr)](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=tree)
   
   ```diff
   @@ Coverage Diff  @@
   ## master#2898  +/-   ##
   
   + Coverage 45.84%   45.85%   +0.01% 
   - Complexity 9151 9168  +17 
   
 Files  1929 1932   +3 
 Lines 7264972789 +140 
 Branches   7999 8022  +23 
   
   + Hits  3330433379  +75 
   - Misses3630536351  +46 
   - Partials   3040 3059  +19
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-gobblin/pull/2898?src=pr=tree) 
| Coverage Δ | Complexity Δ | |
   |---|---|---|---|
   | 
[...source/extractor/extract/kafka/KafkaExtractor.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYUV4dHJhY3Rvci5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   | 
[...in/source/extractor/extract/kafka/KafkaSource.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYVNvdXJjZS5qYXZh)
 | `0% <0%> (ø)` | `0 <0> (ø)` | :arrow_down: |
   | 
[...ctor/extract/kafka/KafkaExtractorStatsTracker.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4ta2Fma2EtY29tbW9uL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3NvdXJjZS9leHRyYWN0b3IvZXh0cmFjdC9rYWZrYS9LYWZrYUV4dHJhY3RvclN0YXRzVHJhY2tlci5qYXZh)
 | `85.97% <28.57%> (+31.48%)` | `29 <0> (+4)` | :arrow_up: |
   | 
[.../org/apache/gobblin/metrics/RootMetricContext.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tZXRyaWNzLWxpYnMvZ29iYmxpbi1tZXRyaWNzLWJhc2Uvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vbWV0cmljcy9Sb290TWV0cmljQ29udGV4dC5qYXZh)
 | `73.43% <0%> (-6.25%)` | `15% <0%> (-1%)` | |
   | 
[...in/java/org/apache/gobblin/cluster/HelixUtils.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1jbHVzdGVyL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL2NsdXN0ZXIvSGVsaXhVdGlscy5qYXZh)
 | `38.26% <0%> (-4.22%)` | `14% <0%> (-1%)` | |
   | 
[...org/apache/gobblin/runtime/api/FsSpecConsumer.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0ZzU3BlY0NvbnN1bWVyLmphdmE=)
 | `60% <0%> (-3.83%)` | `6% <0%> (+1%)` | |
   | 
[...org/apache/gobblin/runtime/api/FsSpecProducer.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvYXBpL0ZzU3BlY1Byb2R1Y2VyLmphdmE=)
 | `72.54% <0%> (-3.07%)` | `9% <0%> (+1%)` | |
   | 
[...gobblin/azkaban/AzkabanGobblinYarnAppLauncher.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1tb2R1bGVzL2dvYmJsaW4tYXprYWJhbi9zcmMvbWFpbi9qYXZhL29yZy9hcGFjaGUvZ29iYmxpbi9hemthYmFuL0F6a2FiYW5Hb2JibGluWWFybkFwcExhdW5jaGVyLmphdmE=)
 | `30.55% <0%> (-2.78%)` | `2% <0%> (ø)` | |
   | 
[...anagement/copy/replication/ConfigBasedDataset.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1kYXRhLW1hbmFnZW1lbnQvc3JjL21haW4vamF2YS9vcmcvYXBhY2hlL2dvYmJsaW4vZGF0YS9tYW5hZ2VtZW50L2NvcHkvcmVwbGljYXRpb24vQ29uZmlnQmFzZWREYXRhc2V0LmphdmE=)
 | `68.87% <0%> (-1.68%)` | `10% <0%> (ø)` | |
   | 
[...e/gobblin/runtime/locks/ZookeeperBasedJobLock.java](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree#diff-Z29iYmxpbi1ydW50aW1lL3NyYy9tYWluL2phdmEvb3JnL2FwYWNoZS9nb2JibGluL3J1bnRpbWUvbG9ja3MvWm9va2VlcGVyQmFzZWRKb2JMb2NrLmphdmE=)
 | `63.33% <0%> (-1.12%)` | `15% <0%> (-1%)` | |
   | ... and [19 
more](https://codecov.io/gh/apache/incubator-gobblin/pull/2898/diff?src=pr=tree-more)
 | |
   
   --
   

[jira] [Updated] (GOBBLIN-1059) Failed to run job for MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Priority: Trivial  (was: Critical)

> Failed to run job for MySQL to HDFS ingestion
> -
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Trivial
>  Labels: beginner
>
> I am tring to ingest data from mysql to hdfs but getting below error.  The 
> job file and properties are as below
> mysql.pull
> 
> #Job properties
>  job.name=GobblinMySql
>  job.group=MySql
>  job.description=Data pull from MySql
> #Extract properties
>  extract.namespace=demodb
>  extract.table.type=snapshot_only
>  extract.table.name=user
> #Property to consider the extract as full dump
>  extract.is.full=true
> #Source properties
>  source.querybased.schema=user
>  source.entity=user
>  source.querybased.extract.type=snapshot   
> ```
>  mysql.propertes
>  ```
> #Source properties - source class to extract data from Mysql Source
>  source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
> #Source properties
>  source.max.number.of.partitions=1
>  source.querybased.partition.interval=1
>  source.querybased.is.compression=true
>  source.querybased.watermark.type=timestamp
> #Source connection properties
>  source.conn.driver=com.mysql.jdbc.Driver
>  source.conn.username=root
>  source.conn.password=root
>  source.conn.host=localhost
>  source.conn.port=3306
>  source.conn.timeout=1500
> #Converter properties - Record from mysql source will be processed by the 
> below series of converters
>  
> converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter
> #date columns format
>  converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
>  converter.avro.date.format=-MM-dd
>  converter.avro.time.format=HH:mm:ss
> #Qualitychecker properties
>  
> qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
>  qualitychecker.task.policy.types=OPTIONAL,OPTIONAL
> #Publisher properties
>  data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
> below is error log
> ```
> GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed 
> to run job GobblinMySql
>  org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
>  at 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
>  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748)
>  Caused by: java.lang.RuntimeException: Failed to create job launcher: 
> java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
>  at 
> org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
>  ... 4 more
>  Caused by: java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
>  at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
>  at java.lang.Class.forName0(Native Method)
>  at java.lang.Class.forName(Class.java:264)
>  at org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
>  at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
>  at 
> org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
>  at 
> org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
>  ... 9 more
>  2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed 
> to run job GobblinMySql
>  

[jira] [Updated] (GOBBLIN-1059) Failed to run job for MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Priority: Critical  (was: Major)

> Failed to run job for MySQL to HDFS ingestion
> -
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Critical
>  Labels: beginner
>
> I am tring to ingest data from mysql to hdfs but getting below error.  The 
> job file and properties are as below
> mysql.pull
> 
> #Job properties
>  job.name=GobblinMySql
>  job.group=MySql
>  job.description=Data pull from MySql
> #Extract properties
>  extract.namespace=demodb
>  extract.table.type=snapshot_only
>  extract.table.name=user
> #Property to consider the extract as full dump
>  extract.is.full=true
> #Source properties
>  source.querybased.schema=user
>  source.entity=user
>  source.querybased.extract.type=snapshot   
> ```
>  mysql.propertes
>  ```
> #Source properties - source class to extract data from Mysql Source
>  source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
> #Source properties
>  source.max.number.of.partitions=1
>  source.querybased.partition.interval=1
>  source.querybased.is.compression=true
>  source.querybased.watermark.type=timestamp
> #Source connection properties
>  source.conn.driver=com.mysql.jdbc.Driver
>  source.conn.username=root
>  source.conn.password=root
>  source.conn.host=localhost
>  source.conn.port=3306
>  source.conn.timeout=1500
> #Converter properties - Record from mysql source will be processed by the 
> below series of converters
>  
> converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter
> #date columns format
>  converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
>  converter.avro.date.format=-MM-dd
>  converter.avro.time.format=HH:mm:ss
> #Qualitychecker properties
>  
> qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
>  qualitychecker.task.policy.types=OPTIONAL,OPTIONAL
> #Publisher properties
>  data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
> below is error log
> ```
> GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed 
> to run job GobblinMySql
>  org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
>  at 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
>  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748)
>  Caused by: java.lang.RuntimeException: Failed to create job launcher: 
> java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
>  at 
> org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
>  ... 4 more
>  Caused by: java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
>  at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
>  at java.lang.Class.forName0(Native Method)
>  at java.lang.Class.forName(Class.java:264)
>  at org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
>  at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
>  at 
> org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
>  at 
> org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
>  ... 9 more
>  2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed 
> to run job GobblinMySql
>  

[jira] [Updated] (GOBBLIN-1059) Failed to run job for MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Summary: Failed to run job for MySQL to HDFS ingestion  (was: MySQL to HDFS 
ingestion)

> Failed to run job for MySQL to HDFS ingestion
> -
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Major
>  Labels: beginner
>
> I am tring to ingest data from mysql to hdfs but getting below error.  The 
> job file and properties are as below
> mysql.pull
> 
> #Job properties
>  job.name=GobblinMySql
>  job.group=MySql
>  job.description=Data pull from MySql
> #Extract properties
>  extract.namespace=demodb
>  extract.table.type=snapshot_only
>  extract.table.name=user
> #Property to consider the extract as full dump
>  extract.is.full=true
> #Source properties
>  source.querybased.schema=user
>  source.entity=user
>  source.querybased.extract.type=snapshot   
> ```
>  mysql.propertes
>  ```
> #Source properties - source class to extract data from Mysql Source
>  source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
> #Source properties
>  source.max.number.of.partitions=1
>  source.querybased.partition.interval=1
>  source.querybased.is.compression=true
>  source.querybased.watermark.type=timestamp
> #Source connection properties
>  source.conn.driver=com.mysql.jdbc.Driver
>  source.conn.username=root
>  source.conn.password=root
>  source.conn.host=localhost
>  source.conn.port=3306
>  source.conn.timeout=1500
> #Converter properties - Record from mysql source will be processed by the 
> below series of converters
>  
> converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter
> #date columns format
>  converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
>  converter.avro.date.format=-MM-dd
>  converter.avro.time.format=HH:mm:ss
> #Qualitychecker properties
>  
> qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
>  qualitychecker.task.policy.types=OPTIONAL,OPTIONAL
> #Publisher properties
>  data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
> below is error log
> ```
> GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 - Failed 
> to run job GobblinMySql
>  org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
>  at 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
>  at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  at java.lang.Thread.run(Thread.java:748)
>  Caused by: java.lang.RuntimeException: Failed to create job launcher: 
> java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
>  at 
> org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
>  at org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
>  ... 4 more
>  Caused by: java.lang.ClassNotFoundException: 
> org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
>  at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
>  at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
>  at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
>  at java.lang.Class.forName0(Native Method)
>  at java.lang.Class.forName(Class.java:264)
>  at org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
>  at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
>  at 
> org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
>  at 
> org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
>  at 
> org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
>  ... 9 more
>  2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
> org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner 637 

[jira] [Updated] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
  Docs Text:   (was: I am tring to ingest data from mysql to hdfs but 
getting below error.  The job file and properties are as below

mysql.pull
```

# Job properties
job.name=GobblinMySql
job.group=MySql
job.description=Data pull from MySql

# Extract properties
extract.namespace=demodb
extract.table.type=snapshot_only
extract.table.name=user

# Property to consider the extract as full dump
extract.is.full=true

# Source properties
source.querybased.schema=user
source.entity=user
source.querybased.extract.type=snapshot   

```
mysql.propertes
```
# Source properties - source class to extract data from Mysql Source
source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource

# Source properties
source.max.number.of.partitions=1
source.querybased.partition.interval=1
source.querybased.is.compression=true
source.querybased.watermark.type=timestamp

# Source connection properties
source.conn.driver=com.mysql.jdbc.Driver
source.conn.username=root
source.conn.password=root
source.conn.host=localhost
source.conn.port=3306
source.conn.timeout=1500

# Converter properties - Record from mysql source will be processed by the 
below series of converters
converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter

# date columns format
converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
converter.avro.date.format=-MM-dd
converter.avro.time.format=HH:mm:ss

# Qualitychecker properties
qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
qualitychecker.task.policy.types=OPTIONAL,OPTIONAL

# Publisher properties
data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
```

below is error log

```
GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to create job launcher: 
java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
at 
org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
... 4 more
Caused by: java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:264)
at 
org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
at 
org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
at 
org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
... 9 more
2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to create job launcher: 

[jira] [Updated] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Description: (was: using command 
`bin/gobblin.sh service standalone start --jars 
/mnt/C/Users/name/incubator-gobblin/build/gobblin-sql/libs/gobblin-sql-0.15.0.jar`

How to solve it?)

> MySQL to HDFS ingestion
> ---
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Blocker
>  Labels: beginner
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Priority: Major  (was: Blocker)

> MySQL to HDFS ingestion
> ---
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Major
>  Labels: beginner
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
Description: 
using command 
`bin/gobblin.sh service standalone start --jars 
/mnt/C/Users/name/incubator-gobblin/build/gobblin-sql/libs/gobblin-sql-0.15.0.jar`

How to solve it?

> MySQL to HDFS ingestion
> ---
>
> Key: GOBBLIN-1059
> URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
> Project: Apache Gobblin
>  Issue Type: Bug
>  Components: gobblin-sql
> Environment: linux subsystem, hadoop 2.3.0,
>Reporter: Chhaya Vankhede
>Assignee: Shirshanka Das
>Priority: Blocker
>  Labels: beginner
>
> using command 
> `bin/gobblin.sh service standalone start --jars 
> /mnt/C/Users/name/incubator-gobblin/build/gobblin-sql/libs/gobblin-sql-0.15.0.jar`
> How to solve it?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)


 [ 
https://issues.apache.org/jira/browse/GOBBLIN-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chhaya Vankhede updated GOBBLIN-1059:
-
  Docs Text: 
I am tring to ingest data from mysql to hdfs but getting below error.  The job 
file and properties are as below

mysql.pull
```

# Job properties
job.name=GobblinMySql
job.group=MySql
job.description=Data pull from MySql

# Extract properties
extract.namespace=demodb
extract.table.type=snapshot_only
extract.table.name=user

# Property to consider the extract as full dump
extract.is.full=true

# Source properties
source.querybased.schema=user
source.entity=user
source.querybased.extract.type=snapshot   

```
mysql.propertes
```
# Source properties - source class to extract data from Mysql Source
source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource

# Source properties
source.max.number.of.partitions=1
source.querybased.partition.interval=1
source.querybased.is.compression=true
source.querybased.watermark.type=timestamp

# Source connection properties
source.conn.driver=com.mysql.jdbc.Driver
source.conn.username=root
source.conn.password=root
source.conn.host=localhost
source.conn.port=3306
source.conn.timeout=1500

# Converter properties - Record from mysql source will be processed by the 
below series of converters
converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter

# date columns format
converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
converter.avro.date.format=-MM-dd
converter.avro.time.format=HH:mm:ss

# Qualitychecker properties
qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
qualitychecker.task.policy.types=OPTIONAL,OPTIONAL

# Publisher properties
data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
```

below is error log

```
GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to create job launcher: 
java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
at 
org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
... 4 more
Caused by: java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:264)
at 
org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
at 
org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
at 
org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
... 9 more
2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to create job launcher: 

[jira] [Created] (GOBBLIN-1059) MySQL to HDFS ingestion

2020-02-25 Thread Chhaya Vankhede (Jira)
Chhaya Vankhede created GOBBLIN-1059:


 Summary: MySQL to HDFS ingestion
 Key: GOBBLIN-1059
 URL: https://issues.apache.org/jira/browse/GOBBLIN-1059
 Project: Apache Gobblin
  Issue Type: Bug
  Components: gobblin-sql
 Environment: linux subsystem, hadoop 2.3.0,
Reporter: Chhaya Vankhede
Assignee: Shirshanka Das


I am tring to ingest data from mysql to hdfs but getting below error.  The job 
file and properties are as below

mysql.pull
```

# Job properties
job.name=GobblinMySql
job.group=MySql
job.description=Data pull from MySql

# Extract properties
extract.namespace=demodb
extract.table.type=snapshot_only
extract.table.name=user

# Property to consider the extract as full dump
extract.is.full=true

# Source properties
source.querybased.schema=user
source.entity=user
source.querybased.extract.type=snapshot   

```
mysql.propertes
```
# Source properties - source class to extract data from Mysql Source
source.class=org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource

# Source properties
source.max.number.of.partitions=1
source.querybased.partition.interval=1
source.querybased.is.compression=true
source.querybased.watermark.type=timestamp

# Source connection properties
source.conn.driver=com.mysql.jdbc.Driver
source.conn.username=root
source.conn.password=root
source.conn.host=localhost
source.conn.port=3306
source.conn.timeout=1500

# Converter properties - Record from mysql source will be processed by the 
below series of converters
converter.classes=org.apache.gobblin.converter.avro.JsonIntermediateToAvroConverter

# date columns format
converter.avro.timestamp.format=-MM-dd HH:mm:ss'.0'
converter.avro.date.format=-MM-dd
converter.avro.time.format=HH:mm:ss

# Qualitychecker properties
qualitychecker.task.policies=org.apache.gobblin.policies.count.RowCountPolicy,gobblin.policies.schema.SchemaCompatibilityPolicy
qualitychecker.task.policy.types=OPTIONAL,OPTIONAL

# Publisher properties
data.publisher.type=org.apache.gobblin.publisher.BaseDataPublisher
```

below is error log

```
GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to create job launcher: 
java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:158)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:107)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:85)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:68)
at 
org.apache.gobblin.scheduler.JobScheduler.buildJobLauncher(JobScheduler.java:450)
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:443)
... 4 more
Caused by: java.lang.ClassNotFoundException: 
org.apache.gobblin.source.extractor.extract.jdbc.MysqlSource
at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
at java.lang.ClassLoader.loadClass(ClassLoader.java:419)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
at java.lang.ClassLoader.loadClass(ClassLoader.java:352)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:264)
at 
org.apache.gobblin.runtime.JobContext.createSource(JobContext.java:240)
at org.apache.gobblin.runtime.JobContext.(JobContext.java:172)
at 
org.apache.gobblin.runtime.AbstractJobLauncher.(AbstractJobLauncher.java:201)
at 
org.apache.gobblin.runtime.local.LocalJobLauncher.(LocalJobLauncher.java:86)
at 
org.apache.gobblin.runtime.JobLauncherFactory.newJobLauncher(JobLauncherFactory.java:144)
... 9 more
2020-02-25 08:02:45 GMT ERROR [JobScheduler-0] 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner  637 - Failed 
to run job GobblinMySql
org.apache.gobblin.runtime.JobException: Failed to run job GobblinMySql
at 
org.apache.gobblin.scheduler.JobScheduler.runJob(JobScheduler.java:445)
at 
org.apache.gobblin.scheduler.JobScheduler$NonScheduledJobRunner.run(JobScheduler.java:635)
at