Re: Is it feasible to build and run Spark on Windows?

Deepak Vohra Thu, 05 Dec 2019 16:14:53 -0800

 The Guava issue could be fixed in one of two ways:
- Use Hadoop v3- Create an Uber jar, 
referhttps://gite.lirmm.fr/yagoubi/spark/commit/c9f743957fa963bc1dbed7a44a346ffce1a45cf2
  Managing Java dependencies for Apache Spark applications on Cloud Dataproc | 
Google Cloud Blog


| 
| 
| 
|  |  |

 |

 |
| 
|  | 
Managing Java dependencies for Apache Spark applications on Cloud Datapr...

Learn how to set up Java imported packages for Apache Spark on Cloud Dataproc 
to avoid conflicts.
 |

 |

 |




    On Thursday, December 5, 2019, 11:49:47 PM UTC, Ping Liu 
<pingpinga...@gmail.com> wrote:  
 
 Hi Deepak,
For Spark, I am using master branch and just have code updated yesterday.
For Guava, I actually deleted my old versions from the local Maven repo.  The 
build process of Spark automatically downloaded a few versions.  The oldest 
version is 14.0.1.
But even in 14.0,1 
(https://guava.dev/releases/14.0.1/api/docs/com/google/common/base/Preconditions.html)
 Preconditions already requires boolean as first parameter.

| static void | checkArgument(boolean expression, String errorMessageTemplate, 
Object... errorMessageArgs) |


The newer Guava version, checkArgument() all require boolean as first parameter.
For Docker, using EC2 is a good idea.  Is there a document or guidance for it?
Thanks.
Ping



On Thu, Dec 5, 2019 at 3:30 PM Deepak Vohra <dvohr...@yahoo.com> wrote:

 Such type exception could occur if a dependency (most likely Guava) version is 
not supported by the Spark version. What is the Spark and Guava versions? Use a 
more recent Guava version dependency in Maven pom.xml. 
Regarding Docker, a cloud platform instance such as EC2 could be used with 
Hyper-V support.
    On Thursday, December 5, 2019, 10:51:59 PM UTC, Ping Liu 
<pingpinga...@gmail.com> wrote:  
 
 Hi Deepak,
Yes, I did use Maven. I even have the build pass successfully when setting 
Hadoop version to 3.2.  Please see my response to Sean's email.
Unfortunately, I only have Docker Toolbox as my Windows doesn't have Microsoft 
Hyper-V.  So I want to avoid using Docker to do major work if possible.
Thanks!
Ping

On Thu, Dec 5, 2019 at 2:24 PM Deepak Vohra <dvohr...@yahoo.com> wrote:

 Several alternatives are available:
- Use Maven to build Spark on Windows. 
http://spark.apache.org/docs/latest/building-spark.html#apache-maven

- Use Docker image for  CDH on WindowsDocker Hub

| 
| 
|  | 
Docker Hub


 |

 |

 |





    On Thursday, December 5, 2019, 09:33:43 p.m. UTC, Sean Owen 
<sro...@gmail.com> wrote:  
 
 What was the build error? you didn't say. Are you sure it succeeded?
Try running from the Spark home dir, not bin.
I know we do run Windows tests and it appears to pass tests, etc.

On Thu, Dec 5, 2019 at 3:28 PM Ping Liu <pingpinga...@gmail.com> wrote:
>
> Hello,
>
> I understand Spark is preferably built on Linux.  But I have a Windows 
> machine with a slow Virtual Box for Linux.  So I wish I am able to build and 
> run Spark code on Windows environment.
>
> Unfortunately,
>
> # Apache Hadoop 2.6.X
> ./build/mvn -Pyarn -DskipTests clean package
>
> # Apache Hadoop 2.7.X and later
> ./build/mvn -Pyarn -Phadoop-2.7 -Dhadoop.version=2.7.3 -DskipTests clean 
> package
>
>
> Both are listed on 
> http://spark.apache.org/docs/latest/building-spark.html#specifying-the-hadoop-version-and-enabling-yarn
>
> But neither works for me (I stay directly under spark root directory and run 
> "mvn -Pyarn -Phadoop-2.7 -Dhadoop.version=2.7.3 -DskipTests clean package"
>
> and
>
> Then I tried "mvn -Pyarn -Phadoop-3.2 -Dhadoop.version=3.2.1 -DskipTests 
> clean package"
>
> Now build works.  But when I run spark-shell.  I got the following error.
>
> D:\apache\spark\bin>spark-shell
> Exception in thread "main" java.lang.NoSuchMethodError: 
> com.google.common.base.Preconditions.checkArgument(ZLjava/lang/String;Ljava/lang/Object;)V
>        at org.apache.hadoop.conf.Configuration.set(Configuration.java:1357)
>        at org.apache.hadoop.conf.Configuration.set(Configuration.java:1338)
>        at 
>org.apache.spark.deploy.SparkHadoopUtil$.org$apache$spark$deploy$SparkHadoopUtil$$appendS3AndSparkHadoopHiveConfigurations(SparkHadoopUtil.scala:456)
>        at 
>org.apache.spark.deploy.SparkHadoopUtil$.newConfiguration(SparkHadoopUtil.scala:427)
>        at 
>org.apache.spark.deploy.SparkSubmit.$anonfun$prepareSubmitEnvironment$2(SparkSubmit.scala:342)
>        at 
>org.apache.spark.deploy.SparkSubmit$$Lambda$132/817978763.apply(Unknown Source)
>        at scala.Option.getOrElse(Option.scala:189)
>        at 
>org.apache.spark.deploy.SparkSubmit.prepareSubmitEnvironment(SparkSubmit.scala:342)
>        at 
>org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:871)
>        at 
>org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:180)
>        at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:203)
>        at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:90)
>        at 
>org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1007)
>        at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1016)
>        at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
>
>
> Has anyone experienced building and running Spark source code successfully on 
> Windows?  Could you please share your experience?
>
> Thanks a lot!
>
> Ping
>

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Re: Is it feasible to build and run Spark on Windows?

Reply via email to