Thank you Bjorn Jorgensen and also thank to Sean Owen.
DataFrame and .format("jdbc") is good way to resolved it.
But in some reasons, i can't using DataFrame API, only can use RDD API in
PySpark.
...T_T...
thanks all you guys help. but still need new idea to resolve it. XD
[email protected]
发件人: Bjørn Jørgensen
发送时间: 2022-09-19 18:34
收件人: [email protected]
抄送: Xiao, Alton; [email protected]
主题: Re: 答复: [how to]RDD using JDBC data source in PySpark
https://www.projectpro.io/recipes/save-dataframe-mysql-pyspark
and
https://towardsdatascience.com/pyspark-mysql-tutorial-fa3f7c26dc7
man. 19. sep. 2022 kl. 12:29 skrev [email protected] <[email protected]>:
Thank you answer alton.
But i see that is use scala to implement it.
I know java/scala can get data from mysql using JDBCRDD farily well.
But i want to get same way in Python Spark.
Would you to give me more advice, very thanks to you.
[email protected]
发件人: Xiao, Alton
发送时间: 2022-09-19 18:04
收件人: [email protected]; [email protected]
主题: 答复: [how to]RDD using JDBC data source in PySpark
Hi javacaoyu:
https://hevodata.com/learn/spark-mysql/#Spark-MySQL-Integration
I think spark have already integrated mysql
发件人: [email protected] <[email protected]>
日期: 星期一, 2022年9月19日 17:53
收件人: [email protected] <[email protected]>
主题: [how to]RDD using JDBC data source in PySpark
你通常不会收到来自 [email protected] 的电子邮件。了解这一点为什么很重要
Hi guys:
Does have some way to let rdd can using jdbc data source in pyspark?
i want to get data from mysql, but in PySpark, there is not supported
JDBCRDD like java/scala.
and i search docs from web site, no answer.
So i need your guys help, Thank you very much.
[email protected]
--
Bjørn Jørgensen
Vestre Aspehaug 4, 6010 Ålesund
Norge
+47 480 94 297