Abhishek Reddy Chamakura created SPARK-24623:
------------------------------------------------

             Summary: Hadoop - Spark Cluster - Python XGBoost - Not working in 
distributed mode
                 Key: SPARK-24623
                 URL: https://issues.apache.org/jira/browse/SPARK-24623
             Project: Spark
          Issue Type: Bug
          Components: Deploy
    Affects Versions: 2.1.1
         Environment: Hadoop - Hortonworks Cluster

 

Total Nodes - 18

Worker Nodes - 13
            Reporter: Abhishek Reddy Chamakura


Hi


We recently installed python on the Hadoop cluster with lot of data science 
python modules including xgboost , spicy , scikit learn , pandas
Using pyspark the data scientists are able to test there scoring models in the 
distributed mode on the Hadoop cluster. But with python - xgboost the pyspark 
job is not getting distributed and it is trying to run only on one instance.
we are trying to achieve the distributed mode when using python xgboost via 
pyspark. 
It would be a great help if you can direct me on how to achieve this.

Thanks,
Abhishek



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to