[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-04-03 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-89326238
  
I have tried to compare the performance difference. However, no significant 
improvement can be found.

Data: 6000 rows of `complexFieldAndType1` in `TestJsonData`. Call `jsonRDD` 
to create two json RDDs.

old:
29 seconds, 714 milliseconds
29 seconds, 654 milliseconds
29 seconds, 590 milliseconds
29 seconds, 556 milliseconds
29 seconds, 461 milliseconds


new:
29 seconds, 272 milliseconds
29 seconds, 358 milliseconds
28 seconds, 991 milliseconds
29 seconds, 118 milliseconds
28 seconds, 926 milliseconds

Because this pr can't bring much performance improvement, I am closing it 
now.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-04-03 Thread viirya
Github user viirya closed the pull request at:

https://github.com/apache/spark/pull/4712


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-04-02 Thread marmbrus
Github user marmbrus commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-89095009
  
Sorry for the delay here.  Can you post a performance comparison?

/cc @yhuai


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-03-17 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-82218632
  
@marmbrus @liancheng Please take time to review this. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-27 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-76509265
  
@liancheng Description is updated. Please take a look when you have time. 
Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-27 Thread liancheng
Github user liancheng commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-76498312
  
@viirya Could you please add a brief overview of your update in the PR 
description? It's not only for the sake of code review. PR descriptions are 
merged into the Git history together with PRs, thus can be greatly helpful when 
tracking Git history. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-27 Thread marmbrus
Github user marmbrus commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-76423117
  
Not until after the 1.3 release.
On Feb 27, 2015 8:24 AM, "Liang-Chi Hsieh"  wrote:

> @marmbrus  do you have time to take a look
> of this?
>
> —
> Reply to this email directly or view it on GitHub
> .
>



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-27 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-76422284
  
@marmbrus do you have time to take a look of this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-25 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-76102545
  
@marmbrus please take a look of this when you have time. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-24 Thread viirya
Github user viirya commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75758381
  
cc @marmbrus.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-24 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75716105
  
  [Test build #27884 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27884/consoleFull)
 for   PR 4712 at commit 
[`723a8f3`](https://github.com/apache/spark/commit/723a8f3dcc17cc87e883ef7c83a6ff6f9aba8665).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75716111
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27884/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-23 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75709018
  
  [Test build #27884 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27884/consoleFull)
 for   PR 4712 at commit 
[`723a8f3`](https://github.com/apache/spark/commit/723a8f3dcc17cc87e883ef7c83a6ff6f9aba8665).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75370232
  
Test PASSed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27811/
Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75370229
  
  [Test build #27811 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27811/consoleFull)
 for   PR 4712 at commit 
[`3d775ed`](https://github.com/apache/spark/commit/3d775ed81975fe9738f205262f69737332ac66c4).
 * This patch **passes all tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75368060
  
  [Test build #27811 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27811/consoleFull)
 for   PR 4712 at commit 
[`3d775ed`](https://github.com/apache/spark/commit/3d775ed81975fe9738f205262f69737332ac66c4).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75367933
  
Test FAILed.
Refer to this link for build results (access rights to CI server needed): 
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27810/
Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75367932
  
  [Test build #27810 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27810/consoleFull)
 for   PR 4712 at commit 
[`530ca35`](https://github.com/apache/spark/commit/530ca35e75f3b6e112fb3690c2fc40c574e63dd7).
 * This patch **fails Scala style tests**.
 * This patch merges cleanly.
 * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread SparkQA
Github user SparkQA commented on the pull request:

https://github.com/apache/spark/pull/4712#issuecomment-75367901
  
  [Test build #27810 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/27810/consoleFull)
 for   PR 4712 at commit 
[`530ca35`](https://github.com/apache/spark/commit/530ca35e75f3b6e112fb3690c2fc40c574e63dd7).
 * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-5938][SQL] Generate Row from JSON strin...

2015-02-21 Thread viirya
GitHub user viirya opened a pull request:

https://github.com/apache/spark/pull/4712

[SPARK-5938][SQL] Generate Row from JSON string efficiently

Generate row from json efficiently in `JsonRDD` object.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/viirya/spark-1 efficient_jsonrow

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/4712.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #4712


commit 530ca35e75f3b6e112fb3690c2fc40c574e63dd7
Author: Liang-Chi Hsieh 
Date:   2015-02-21T11:25:11Z

Generate row from json efficiently.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org