Mailing lists matching spark.apache.org

commits spark.apache.org
dev spark.apache.org
issues spark.apache.org
reviews spark.apache.org
user spark.apache.org


[GitHub] [spark] dongjoon-hyun commented on pull request #33101: [SPARK-35907][CORE] Instead of File#mkdirs, Files#createDirectories is expected.

2021-06-26 Thread GitBox
dongjoon-hyun commented on pull request #33101: URL: https://github.com/apache/spark/pull/33101#issuecomment-869030227 Thank you for making a PR, @Shockang . Could you enable GitHub Action on your Spark fork? - https://spark.apache.org/developer-tools.html (Testing with GitHub Actions

[GitHub] [spark] HyukjinKwon commented on pull request #33345: [PYTHON] clarify documentation for dayofweek

2021-07-14 Thread GitBox
HyukjinKwon commented on pull request #33345: URL: https://github.com/apache/spark/pull/33345#issuecomment-880347647 @dominikgehl would you mind filing a JIRA, and link it to the PR title? see also https://spark.apache.org/contributing.html. Also Apache Spark uses the resources from

[GitHub] [spark] caican00 commented on pull request #37608: update

2022-08-22 Thread GitBox
caican00 commented on PR #37608: URL: https://github.com/apache/spark/pull/37608#issuecomment-1221948497 > @caican00 mind creating a JIRA, and fix the PR title? See also https://spark.apache.org/contributing.html. > > Also, we should probably fix it in the `master` branch i

[GitHub] [spark] itholic commented on pull request #37647: Fixed timestamp conversion on Windows

2022-08-26 Thread GitBox
[contribution guide for Apache Spark](https://spark.apache.org/contributing.html). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr

[GitHub] [spark] HyukjinKwon commented on pull request #37793: add sparksql wirte mysql support update ,the design from replace into…

2022-09-04 Thread GitBox
HyukjinKwon commented on PR #37793: URL: https://github.com/apache/spark/pull/37793#issuecomment-1236525006 @jyong-somnambulist please file a JIRA, and link it to the PR title. See also https://spark.apache.org/contributing.html. The codebase is written in English so let's stick t

[GitHub] [spark] MaxGekk commented on pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv)

2023-02-04 Thread via GitHub
MaxGekk commented on PR #39719: URL: https://github.com/apache/spark/pull/39719#issuecomment-1417051227 @NarekDW Could you regenerate benchmark results using GitHub actions, see https://spark.apache.org/developer-tools.html (Running benchmarks in your forked repository) and update

[GitHub] [spark] MaxGekk commented on pull request #40033: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval

2023-02-16 Thread via GitHub
see "Pull request" at https://spark.apache.org/contributing.html -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spa

[GitHub] [spark] HyukjinKwon commented on pull request #38167: fix problems that affect windows shell environments (cygwin/msys2/mingw)

2022-10-10 Thread GitBox
HyukjinKwon commented on PR #38167: URL: https://github.com/apache/spark/pull/38167#issuecomment-1273155534 Thanks for the contribution. Would you mind checking https://github.com/apache/spark/pull/38167/checks?check_run_id=8783733198 and https://spark.apache.org/contributing.html? e.g

[GitHub] [spark] chenminghua8 commented on pull request #38213: fix runtime filter do not execute when no stats

2022-10-12 Thread GitBox
chenminghua8 commented on PR #38213: URL: https://github.com/apache/spark/pull/38213#issuecomment-1276931035 > A JIRA ticket is needed, you can refer to https://spark.apache.org/contributing.html > > if this is a bug fix, it's better to have a UT for the fix.

[GitHub] [spark] chenminghua8 commented on pull request #38381: Fix the LogicalRelation computeStats for Row-level Runtime Filtering cannot be applied

2022-10-24 Thread GitBox
chenminghua8 commented on PR #38381: URL: https://github.com/apache/spark/pull/38381#issuecomment-1289909307 > @chenminghua8 mind linking the JIRA ticket into the PR title? See also https://spark.apache.org/contributing.html @HyukjinKwon Thank you! but I don't know how to get

[GitHub] [spark] dongjoon-hyun commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10

2022-11-17 Thread GitBox
dongjoon-hyun commented on PR #38262: URL: https://github.com/apache/spark/pull/38262#issuecomment-1319545728 Apache Spark has a pre-defined release cadence, @vitas and @bjornjorgensen . - https://spark.apache.org/versioning-policy.html ![Screenshot 2022-11-17 at 8 56 29 PM](https

Re: [PR] [MINOR][PYTHON][DOCS] Typo fixed yyy to yyyy at date_format function [spark]

2023-10-19 Thread via GitHub
metecanakar commented on PR #43442: URL: https://github.com/apache/spark/pull/43442#issuecomment-1770830518 > Mind taking a look at https://github.com/apache/spark/pull/43442/checks?check_run_id=17836826969? Let's also file a JIRA, see also https://spark.apache.org/contribut

Re: [PR] [MINOR][PYTHON][DOCS] Typo fixed yyy to yyyy at date_format function [spark]

2023-10-19 Thread via GitHub
metecanakar commented on PR #43442: URL: https://github.com/apache/spark/pull/43442#issuecomment-1771448937 > Mind taking a look at https://github.com/apache/spark/pull/43442/checks?check_run_id=17836826969? Let's also file a JIRA, see also https://spark.apache.org/contribut

Re: [PR] [SPARK-46111][DOCS][PYTHON] Add copyright to the PySpark official documentation. [spark]

2023-11-26 Thread via GitHub
/licenses/LICENSE-2.0";>Apache License, Version 2.0. Review Comment: FYI: I follow this copyright format from [Apache Spark official web page](https://spark.apache.org/). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [spark] dongjoon-hyun commented on pull request #41000: [SPARK-43327] Trigger `committer.setupJob` before plan execute in `FileFormatWriter#write`

2023-05-11 Thread via GitHub
`. - https://spark.apache.org/versioning-policy.html -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries

Re: [PR] Do not convert array type string retrieved from jdbc driver [spark]

2024-01-07 Thread via GitHub
phanhuyn commented on PR #44459: URL: https://github.com/apache/spark/pull/44459#issuecomment-1880206832 > Thanks for the PR. Mind creating a JIRA please? (see also https://spark.apache.org/contributing.html). Thanks for the reply @HyukjinKwon. I've applied for a JIRA ac

[GitHub] [spark] bersprockets commented on pull request #36871: [SPARK-39469][SQL] Infer date type for CSV schema inference

2022-06-16 Thread GitBox
bersprockets commented on PR #36871: URL: https://github.com/apache/spark/pull/36871#issuecomment-1158329998 @Jonathancui123 You probably want to turn on github actions so tests will run. From https://spark.apache.org/contributing.html: >Go to “Actions” tab on your for

[GitHub] [spark] srowen commented on pull request #37016: Driver cores mult be a positive number fix

2022-06-28 Thread GitBox
srowen commented on PR #37016: URL: https://github.com/apache/spark/pull/37016#issuecomment-1168984035 Please file a JIRA and update the PR per https://spark.apache.org/contributing.html Can we not just fix the Mesos component itself? this is hacky but not terrible. Mesos support

[GitHub] [spark] ArjunSharda commented on pull request #37056: (GitHub CI) Bump workflow versions

2022-07-03 Thread GitBox
ArjunSharda commented on PR #37056: URL: https://github.com/apache/spark/pull/37056#issuecomment-1173251977 > @ArjunSharda mind taking a look at https://spark.apache.org/contributing.html? We should file a JIRA, fix the title format, feel the PR description format, etc. Hey

[GitHub] [spark] dongjoon-hyun commented on pull request #36069: [SPARK-38767][SQL] Support `ignoreCorruptFiles` and `ignoreMissingFiles` in Data Source options

2022-07-20 Thread GitBox
dongjoon-hyun commented on PR #36069: URL: https://github.com/apache/spark/pull/36069#issuecomment-1190997582 FYI, Apache Spark 3.4 code freeze was March 15th, 2022. This patch simply arrived one month later after code freeze deadline. - https://spark.apache.org/versioning

[GitHub] [spark] pan3793 commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+

2023-08-15 Thread via GitHub
oved in Spark 4?](https://www.mail-archive.com/dev@spark.apache.org/msg30708.html). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: rev

[GitHub] [spark] dongjoon-hyun commented on pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any optional dependency

2023-08-25 Thread via GitHub
dongjoon-hyun commented on PR #42678: URL: https://github.com/apache/spark/pull/42678#issuecomment-1693865446 I added `optional` into the PR title because `numpy` is required dependency for MLLib. - https://spark.apache.org/docs/latest/api/python/getting_started/install.html

[GitHub] [spark] HeartSaVioR commented on pull request #42822: [SPARK-45084][SS] ProgressReport to include accurate effective shuffle partition number

2023-09-06 Thread via GitHub
HeartSaVioR commented on PR #42822: URL: https://github.com/apache/spark/pull/42822#issuecomment-1709293644 https://spark.apache.org/developer-tools.html Could you please follow the section `Running benchmarks in your forked repository`? I'm not sure how I can enable the CI -

Re: [PR] [SPARK-45273][CORE][UI] Support for set the access host in http header [spark]

2023-10-09 Thread via GitHub
srowen commented on PR #43169: URL: https://github.com/apache/spark/pull/43169#issuecomment-1752893775 How does this arise? Use priv...@spark.apache.org if needed. I am not clear what attack you have in mind or whether it can affect spark, so, no this would not be useful unless there'

Re: [PR] [SPARK-49324] Add state transition e2e test for happy path [spark-kubernetes-operator]

2024-08-23 Thread via GitHub
: -apiVersion: v1 +apiVersion: spark.apache.org/v1alpha1 Review Comment: I believe this is an orthogonal PR which needs a new JIRA ID. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] spark issue #18611: Create _404_programming-guide.html

2017-07-12 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/18611 You're right that the page is no longer generated, but, this PR doesn't make sense as a fix. It should be closed. Instead the main spark.apache.org site needs to poin

[GitHub] HyukjinKwon commented on issue #23348: [SPARK-25857][core] Add developer documentation regarding delegation tokens.

2019-01-03 Thread GitBox
https://spark.apache.org/docs/latest/running-on-yarn.html after refining it? We have Kerberos chapter https://spark.apache.org/docs/latest/running-on-yarn.html#kerberos This is an automated message from the Apache Git Service. To respond to the me

[GitHub] spark pull request: SPARK-5390 [DOCS] Encourage users to post on S...

2015-03-02 Thread pwendell
Github user pwendell commented on a diff in the pull request: https://github.com/apache/spark/pull/4843#discussion_r25646936 --- Diff: docs/index.md --- @@ -115,6 +115,8 @@ options for deployment: * [Spark Homepage](http://spark.apache.org) * [Spark Wiki](https

Re: How to bind webui to localhost?

2016-01-14 Thread Zee Chen
;t allow the user to directly specify the ip addr to bind >> services to. >> >> - >> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org >> For additional commands, e-mail: user-h...@spark.apache.org >> > --

RE: Achieving 700 Spark SQL Queries Per Second

2016-03-10 Thread Silvio Fiorito
Very cool stuff Evan. Thanks for your work on this and sharing! From: Evan Chan<mailto:velvia.git...@gmail.com> Sent: Thursday, March 10, 2016 1:38 PM To: user@spark.apache.org<mailto:user@spark.apache.org> Subject: Achieving 700 Spark SQL Queries Per Second Hey folks,

DataFrame more efficient than RDD?

2015-07-15 Thread k0ala
Hi, I have been working a bit with RDD, and am now taking a look at DataFrames. The schema definition using case classes looks very attractive; https://spark.apache.org/docs/1.4.0/sql-programming-guide.html#inferring-the-schema-using-reflection <https://spark.apache.org/docs/1.4.0/

Re: it seem like the exactly once feature not work on spark1.4

2015-07-17 Thread JoneZhang
base on http://spark.apache.org/docs/latest/streaming-programming-guide.html <http://spark.apache.org/docs/latest/streaming-programming-guide.html> -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/it-seem-like-the-exactly-once-feature-not-work-o

Fwd: use S3-Compatible Storage with spark

2015-07-19 Thread Schmirr Wurst
amazon, is there >> a way I can specify the host somewhere ? >> >> - >> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org >> For additional commands, e-mail: user-h...@spark.apache.org >> > -

Re: Java 8 lambdas

2015-08-18 Thread Sean Owen
spark-easier-to-use-in-java-with-java-8 > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > -

Re: Spark standalone hangup during shuffle flatMap or explode in cluster

2015-10-07 Thread Sean Owen
rt dropping. I am attching > the logs > Saif > > > > > > > ----- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org -----

Re: Multilabel Classification in spark

2015-05-05 Thread DB Tsai
e-spark-user-list.1001560.n3.nabble.com/Multilabel-Classification-in-spark-tp22775.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > ----- > To unsubscribe, e-mail: user-unsubscr

Re: SparkSQL: How to specify replication factor on the persisted parquet files?

2015-06-07 Thread Cheng Lian
ggestions are appreciated very much! - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org - To unsubscribe, e-mail: us

[ANNOUNCE] Announcing Spark 1.4

2015-06-11 Thread Patrick Wendell
lease! [1] http://spark.apache.org/releases/spark-release-1-4-0.html [2] http://spark.apache.org/downloads.html - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org

Re: MLlib + Streaming

2014-12-23 Thread Xiangrui Meng
We have streaming linear regression (since v1.1) and k-means (v1.2) in MLlib. You can check the user guide: http://spark.apache.org/docs/latest/mllib-linear-methods.html#streaming-linear-regression http://spark.apache.org/docs/latest/mllib-clustering.html#streaming-clustering -Xiangrui On Tue

Re: Elastic allocation(spark.dynamicAllocation.enabled) results in task never being executed.

2015-01-04 Thread Tsuyoshi Ozawa
ults-in-task-never-being-executed-tp18969p20957.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional comma

Re: spark with cdh 5.2.1

2015-01-30 Thread Sean Owen
k? > > Mohit. > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For addit

Re: Upgrade to Spark 1.2.1 using Guava

2015-02-27 Thread Pat Ferrel
rk. -- Marcelo - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional

Re: SparkSQL, executing an "OR"

2015-03-03 Thread Guillermo Ortiz
> val teenagers = people.where('age >= 10 'or 'age <= 4).where('age <= > 19).select('name) > > I have tried different ways and I didn't get it. > > -

Re: Reading a text file into RDD[Char] instead of RDD[String]

2015-03-19 Thread Sean Owen
----- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > ----- To unsubscrib

Re: Serialization error

2015-04-28 Thread madhvi
org.apache.accumulo.core.data.Key) This is due to the 'key' class of accumulo which does not implement serializable interface.How it can be solved and accumulo can be used with spark Thanks Madhvi --

Re: Spark Screencast doesn't show in Chrome on OS X

2014-08-25 Thread Michael Hausenblas
> https://spark.apache.org/screencasts/1-first-steps-with-spark.html > > The embedded YouTube video shows up in Safari on OS X but not in Chrome. I’m using Chrome 36.0.1985.143 on MacOS 10.9.4 and it it works like a charm for me. Cheers, Michael -- Michael Hausenbla

Re: transforming a Map object to RDD

2014-08-28 Thread Sean Owen
l > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > --

Re: Accuracy hit in classification with Spark

2014-09-15 Thread Xiangrui Meng
html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org >

Re: Stable spark streaming app

2014-09-17 Thread Soumitra Kumar
er of nodes, events per second, broad stream processing workflow, config highlights etc? Thanks, Tim - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-ma

Re: K-means faster on Mahout then on Spark

2014-09-25 Thread Xiangrui Meng
List mailing list archive at Nabble.com. > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > - To unsu

Re: Spark LIBLINEAR

2014-10-24 Thread DB Tsai
> -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Spark-LIBLINEAR-tp5546p17240.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > - > To unsu

Re: Spark 1.1.0 on Hive 0.13.1

2014-10-29 Thread arthur.hk.c...@gmail.com
0.1.3.1 be available? >> >> Regards >> Arthur >> >> - >> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org >> For additional commands, e-mail: user-h...@spark.apache.org >> > -

Re: Matrix multiplication in spark

2014-11-05 Thread Xiangrui Meng
2562p18164.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional co

Re: using LogisticRegressionWithSGD.train in Python crashes with "Broken pipe"

2014-11-13 Thread Davies Liu
User List mailing list archive at Nabble.com. > > ----- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > ----- T

Re: Cores on Master

2014-11-18 Thread Pat Ferrel
be, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org

Re: How to insert complex types like map> in spark sql

2014-11-25 Thread Cheng Lian
archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org - To unsubscri

RE: A proposal for Spark 2.0

2015-11-12 Thread Ulanov, Alexander
regards, Alexander From: Nan Zhu [mailto:zhunanmcg...@gmail.com] Sent: Thursday, November 12, 2015 7:28 AM To: wi...@qq.com Cc: dev@spark.apache.org Subject: Re: A proposal for Spark 2.0 Being specific to Parameter Server, I think the current agreement is that PS shall exist as a third-party library

Re: [VOTE] Release Apache Spark 1.4.0 (RC4)

2015-06-05 Thread Kousuke Saruta
sing this package as Apache Spark 1.4.0! >>>> >>>> The vote is open until Saturday, June 06, at 05:00 UTC and passes >>>> if a majority of at least 3 +1 PMC votes are cast. >>>>

[beam-site] 05/11: Blog post updates based on @iemejia's feedback

2018-08-20 Thread mergebot-role
aming-connectors.md index aa19675..fded813 100644 --- a/src/_posts/2018-08-XX-review-input-streaming-connectors.md +++ b/src/_posts/2018-08-XX-review-input-streaming-connectors.md @@ -21,7 +21,7 @@ Spark is written in Scala and has a [Java API](https://spark.apache.org/docs/lat Spark offer

[spark] branch master updated: [MINOR][DOCS] Fix some links for python api doc

2020-03-25 Thread gurwls223
follows casting rules to :class:`pyspark.sql.types.DateType` if the format is omitted. Equivalent to ``col.cast("date")``. -.. _datetime pattern: https://spark.apache.org/docs/latest/sql-ref-datetime-pattern.html - >>> df = spark.createDataFrame([('1997

[spark] branch branch-3.0 updated: [MINOR][DOCS] Fix some links for python api doc

2020-03-25 Thread gurwls223
/functions.py +++ b/python/pyspark/sql/functions.py @@ -1143,8 +1143,6 @@ def to_date(col, format=None): By default, it follows casting rules to :class:`pyspark.sql.types.DateType` if the format is omitted. Equivalent to ``col.cast("date")``. -.. _datetime pattern: https://spark.

[spark] branch master updated: [SPARK-42446][DOCS][PYTHON] Updating PySpark documentation to enhance usability

2023-02-15 Thread gurwls223
source/user_guide/index.rst b/python/docs/source/user_guide/index.rst index 5cc8bc3d38e..67f8c8d4d0f 100644 --- a/python/docs/source/user_guide/index.rst +++ b/python/docs/source/user_guide/index.rst @@ -16,21 +16,12 @@ under the License. -== -User Guide -== - -There are basic gu

[spark] branch branch-3.4 updated: [SPARK-42446][DOCS][PYTHON] Updating PySpark documentation to enhance usability

2023-02-15 Thread gurwls223
e/index.rst @@ -16,21 +16,12 @@ under the License. -== -User Guide -== - -There are basic guides shared with other languages in Programming Guides -at `the Spark documentation <https://spark.apache.org/docs/latest/index.html#where-to-go-from-here>`_ as below: - -- `RDD

Re: HDFS small file generation problem

2015-10-03 Thread Jörn Franke
olas > > > - Mail original - > De: "Jörn Franke" > À: nib...@free.fr, "Brett Antonides" > Cc: user@spark.apache.org > Envoyé: Samedi 3 Octobre 2015 11:17:51 > Objet: Re: HDFS small file generation problem > > > > You can update data

Re: HDFS small file generation problem

2015-10-03 Thread Jörn Franke
re still updatable. > > Tks to confirm if it can be solution for my use case. Or any other idea.. > > Thanks a lot ! > Nicolas > > > - Mail original - > De: "Jörn Franke" > À: nib...@free.fr, "Brett Antonides" > Cc: user@spark.apache.org >

RE : Re: HDFS small file generation problem

2015-10-03 Thread nibiau
firm if it can be solution for my use case. Or any other idea.. Thanks a lot ! Nicolas - Mail original - De: "Jörn Franke" À: nib...@free.fr, "Brett Antonides" Cc: user@spark.apache.org Envoyé: Samedi 3 Octobre 2015 11:17:51 Objet: Re: HDFS small file generation pro

Re: Spark + Kinesis

2015-04-03 Thread Kelly, Jonathan
"uber jar". They all must be in there because they are not part of the Spark distribution in your cluster. However, as I mentioned before, I think making this change might cause you to run into the same problems I spoke of in the thread I linked below (https://www.mail-archive.com/u

RE: Announcing Spark 1.1.0!

2014-09-11 Thread Haopu Wang
Got it, thank you, Denny! From: Denny Lee [mailto:denny.g@gmail.com] Sent: Friday, September 12, 2014 11:04 AM To: user@spark.apache.org; Haopu Wang; d...@spark.apache.org; Patrick Wendell Subject: RE: Announcing Spark 1.1.0! Yes, atleast for my query

Re: Using CUDA within Spark / boosting linear algebra

2015-03-12 Thread Shivaram Venkataraman
s, however I am > not sure I understand in details how to build this and will appreciate any > help from you ☺ > > From: Sam Halliday [mailto:sam.halli...@gmail.com] > Sent: Monday, March 09, 2015 6:01 PM > To: Ulanov, Alexander > Cc: dev@spark.apache.org; Xiangrui Meng; Joseph B

RE: Using CUDA within Spark / boosting linear algebra

2015-03-24 Thread Ulanov, Alexander
100x100 to 12000x12000 Could you suggest might the LD_PRELOAD not affect Spark shell? Best regards, Alexander From: Sam Halliday [mailto:sam.halli...@gmail.com] Sent: Monday, March 09, 2015 6:01 PM To: Ulanov, Alexander Cc: dev@spark.apache.org; Xiangrui Meng; Joseph Bradley; Evan R. Sparks

Re: [VOTE] Release Apache Spark 1.1.1 (RC1)

2014-11-17 Thread Patrick Wendell
sure we can address them down the road. [1] https://spark.apache.org/releases/spark-release-1-1-0.html On Mon, Nov 17, 2014 at 2:04 PM, Kevin Markey wrote: > +0 (non-binding) > > Compiled Spark, recompiled and ran application with 1.1.1 RC1 with Yarn, > plain-vanilla Hadoop 2.3.

Re: Does Spark automatically run different stages concurrently when possible?

2015-01-20 Thread Mark Hamstra
age-task-td13083.html >> >> >>> From: so...@cloudera.com >>> Date: Tue, 20 Jan 2015 10:02:20 + >>> Subject: Re: Does Spark automatically run different stages concurrently >>> when possible? >>> To: paliwalash...@gmail.com >>> CC: davidkl

Re: acquire and give back resources dynamically

2014-08-16 Thread fireflyc
http://spark.apache.org/docs/latest/running-on-yarn.html Spark just a Yarn application > 在 2014年8月14日,11:12,牛兆捷 写道: > > Dear all: > > Does spark can acquire resources from and give back resources to > YARN dynamically ? > > >

[GitHub] spark issue #20254: [SPARK-23062][SQL] Improve EXCEPT documentation

2018-01-16 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/20254 @henryr Since Spark 2.3, Spark SQL documents all the behavior changes in [Migration Guides](https://spark.apache.org/docs/latest/sql-programming-guide.html#migration-guide). Hopefully, this can

[GitHub] spark issue #21207: SPARK-24136: Fix MemoryStreamDataReader.next to skip sle...

2018-05-03 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/21207 @arunmahadevan, not a big deal but mind if I ask to fix the PR title to `[SPARK-24136][SS] blabla`? It's actually encouraged in the guide - https://spark.apache.org/contributing

[GitHub] spark issue #21264: Branch 2.2

2018-05-07 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/21264 @yotingting, mind closing this and open an issue at JIRA or asking it to mailing list please? I think you can have a better answer there. Please check out https://spark.apache.org

[GitHub] spark issue #21162: shaded guava is not used anywhere, seems guava is not sh...

2018-05-13 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/21162 CC @vanzin but it's more complex than that as far as I know. It is still shaded. You need to read https://spark.apache.org/contributing

[GitHub] spark issue #21767: SPARK-24804 There are duplicate words in the test title ...

2018-07-18 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/21767 yeah, please avoid PRs that are this trivial, it's just not worth the overhead. But I merged it this time. Also please read https://spark.apache.org/contributing

[GitHub] spark issue #21828: Update regression.py

2018-07-22 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/21828 @woodthom2, if you have some plans to update this PR quite soon, please see https://spark.apache.org/contributing.html and proceed. Otherwise, I would suggest to leave this closed so that

[GitHub] spark issue #22116: Update configuration.md

2018-08-15 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/22116 @KraFusion, mind double checking if there's same instance and fixing the PR title to reflect the change? Also should be good to read https://spark.apache.org/contributing.html even though

Re: I want to unsubscribe

2016-04-05 Thread Jakob Odersky
to unsubscribe, send an email to user-unsubscr...@spark.apache.org On Tue, Apr 5, 2016 at 4:50 PM, Ranjana Rajendran wrote: > I get to see the threads in the public mailing list. I don;t want so many > messages in my inbox. I want to unsub

Re: udf StructField to JSON String

2016-03-11 Thread Tristan Nixon
Have you looked at DataFrame.write.json( path )? https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter > On Mar 11, 2016, at 7:15 AM, Caires Vinicius wrote: > > I have one DataFrame with nested StructField and I want to convert to JSON > S

RE: unsubscribe

2014-03-11 Thread Kapil Malik
Ohh ! I thought you're unsubscribing :) Kapil Malik | kma...@adobe.com | 33430 / 8800836581 -Original Message- From: Matei Zaharia [mailto:matei.zaha...@gmail.com] Sent: 12 March 2014 00:51 To: user@spark.apache.org Subject: Re: unsubscribe To unsubscribe from this list, p

Re: [VOTE] Release Apache Spark 1.1.1 (RC2)

2014-11-20 Thread Matei Zaharia
are signed with the following key: > >>>> https://people.apache.org/keys/committer/andrewor14.asc > >>>> <https://people.apache.org/keys/committer/andrewor14.asc> > >>>> > >>>> The staging repository for this release can be found at: >

FW: Websphere MQ as a data source for Apache Spark Streaming

2015-05-29 Thread Chaudhary, Umesh
running in local mode but not able to save it as text file. Is there any other way for saving streaming data? From: Chaudhary, Umesh Sent: Tuesday, May 26, 2015 2:39 AM To: 'Arush Kharbanda'; user@spark.apache.org Subject: RE: Websphere MQ as a data source for Apache Spark Streaming Than

RE: Announcing Spark 1.1.0!

2014-09-11 Thread Haopu Wang
From the web page (https://spark.apache.org/docs/latest/building-with-maven.html) which is pointed out by you, it’s saying “Because HDFS is not protocol-compatible across versions, if you want to read from HDFS, you’ll need to build Spark against the specific HDFS version in your environment

Re: [VOTE] Release Apache Spark 1.3.1

2015-04-07 Thread Josh Rosen
1080 >>> >>> The documentation corresponding to this release can be found at: >>> http://people.apache.org/~pwendell/spark-1.3.1-rc1-docs/ >>> >>> Please vote on releasing this package as Apache Spark 1.3.1! >>> >>> The vo

[jira] [Updated] (FLINK-27237) Partitioned table statement enhancement

2022-04-15 Thread dalongliu (Jira)
://spark.apache.org/docs/3.2.1/sql-ref-syntax-ddl-alter-table.html#add-partition|https://spark.apache.org/docs/3.0.0/sql-ref-syntax-ddl-alter-table.html#add-partition] [4]: [https://spark.apache.org/docs/3.2.1/sql-ref-syntax-aux-show-partitions.html] was: This is an umbrella issue which is used

Re: reduceByKey as Action or Transformation

2016-04-25 Thread Weiping Qu
zily executed or not. As far as I saw from my codes, the reduceByKey will be executed without any operations in the Action category. Please correct me if I am wrong. Thanks, Regards, Weiping On 25.04.2016 17 :20, Chadha Pooja wrote: Reduce By Key is a Transformati

RE: Problem with WINDOW functions?

2015-12-29 Thread Cheng, Hao
Which version are you using? Have you tried the 1.6? From: Vadim Tkachenko [mailto:apache...@gmail.com] Sent: Wednesday, December 30, 2015 10:17 AM To: Cheng, Hao Cc: user@spark.apache.org Subject: Re: Problem with WINDOW functions? When I allocate 200g to executor, it is able to make better

Re: Spark on Apache Ingnite?

2016-01-06 Thread Ravi Kora
...@gmail.com>> Date: Tuesday, January 5, 2016 at 11:47 PM To: "n...@reactor8.com<mailto:n...@reactor8.com>" mailto:n...@reactor8.com>> Cc: "user@spark.apache.org<mailto:user@spark.apache.org>" mailto:user@spark.apache.org>> Subject: RE: Spark on Apache Ingn

spark git commit: [MINOR][DOCS] Remove Apache Spark Wiki address

2016-12-10 Thread srowen
in `README.md` and `docs/index.md`, too. These two lines are the last occurrence of that links. ``` All current wiki content has been merged into pages at http://spark.apache.org as of November 2016. Each page links to the new location of its information on the Spark web site. Obsolete wiki content

spark git commit: [MINOR][DOCS] Remove Apache Spark Wiki address

2016-12-10 Thread srowen
in `README.md` and `docs/index.md`, too. These two lines are the last occurrence of that links. ``` All current wiki content has been merged into pages at http://spark.apache.org as of November 2016. Each page links to the new location of its information on the Spark web site. Obsolete wiki content

Re: Master build fails ?

2015-11-03 Thread Jacek Laskowski
4: >> not found: value HashCodes >> [error] val cookie = >> HashCodes.fromBytes(secret).toString() >> [error] ^ >> >> >> >> >> -- >> Best Regards >> >>

Re: [VOTE] Release Apache Spark 1.1.0 (RC2)

2014-08-28 Thread Timothy Chen
elease. > > - Original Message - > From: "Patrick Wendell" > To: dev@spark.apache.org > Sent: Thursday, August 28, 2014 8:32:11 PM > Subject: Re: [VOTE] Release Apache Spark 1.1.0 (RC2) > > I'll kick off the vote with a +1. > > On Thu, Aug 28, 2014 at 7:14 PM,

Re: [VOTE] Release Apache Spark 1.1.0 (RC2)

2014-08-28 Thread Cheng Lian
8, 2014 at 8:53 PM, Burak Yavuz wrote: > > +1. Tested MLlib algorithms on Amazon EC2, algorithms show speed-ups > between 1.5-5x compared to the 1.0.2 release. > > > > - Original Message - > > From: "Patrick Wendell" > > To: dev@spark.apache.org &

[jira] [Updated] (SPARK-35030) ANSI SQL compliance

2021-04-12 Thread Gengliang Wang (Jira)
://spark.apache.org/docs/latest/sql-ref-ansi-compliance.html|https://spark.apache.org/docs/latest/sql-ref-ansi-compliance.html]. Note that some ANSI dialect features maybe not from the ANSI SQL standard directly, but their behaviors align with ANSI SQL's style. was: Build an ANSI comp

Re: HDFS small file generation problem

2015-10-02 Thread nibiau
Ok thanks, but can I also update data instead of insert data ? - Mail original - De: "Brett Antonides" À: user@spark.apache.org Envoyé: Vendredi 2 Octobre 2015 18:18:18 Objet: Re: HDFS small file generation problem I had a very similar problem and solved it with Hi

Re: Any NLP library for sentiment analysis in Spark?

2017-04-11 Thread Jayant Shekhar
.ja...@heliase.com<mailto:gabriel.ja...@heliase.com>> >> Date: Tuesday, April 11, 2017 at 2:13 PM >> To: 'Kevin Wang' mailto:buz...@gmail.com>>, 'Alonso >> Isidoro Roman' mailto:alons...@gmail.com>> >> Cc: 'Gaurav1809' mailto:g

Re: Spark or Storm

2015-06-17 Thread Michael Segel
ail.com > <mailto:asoni.le...@gmail.com> wrote: > > Hi All, > > I am evaluating spark VS storm ( spark streaming ) and i am not able to see > what is equivalent of Bolt in storm inside spark. > > Any help will be appreciated on this ? > > Thanks , &g

<    10   11   12   13   14   15   16   17   18   19   >