9.spark连接mysql数据库
spark连接mysql数据库
1.安装启动检查Mysql服务。
netstat -tunlp (3306)

2.spark 连接mysql驱动程序。
–cp /usr/local/hive/lib/mysql-connector-java-5.1.40-bin.jar /usr/local/spark/jars

3.启动 Mysql shell,新建数据库spark,表student。
select * from student;
|
1
2
3
4
5
6
|
create database spark;use spark;create table student (id int(4), name char(20), gender char(4), age int(4));insert into student values(1,'Xueqian','F',23);insert into student values(2,'Weiliang','M',24);select * from student; |

4.spark读取MySQL数据库中的数据
spark.read.format("jdbc").option("url", "jdbc:mysql://localhost:3306/spark?useSSL=false") ... .load()
|
1
2
|
jdbcDF = spark.read.format("jdbc").option("url", "jdbc:mysql://localhost:3306/spark").option("driver","com.mysql.jdbc.Driver").option("dbtable", "student").option("user", "root").option("password", "a123").load()jdbcDF.show() |

|
1
|
spark.read.format("jdbc").option("url", "jdbc:mysql://localhost:3306/spark?useSSL=false").option("driver","com.mysql.jdbc.Driver").option("dbtable", "student").option("user", "root").option("password", "a123").load().show() |

5.spark向MySQL数据库写入数据
studentDF.write.format(‘jdbc’).option(…).mode(‘append’).save()
|
1
2
3
4
5
6
7
8
9
10
11
12
13
14
|
from pyspark.sql.types import Rowfrom pyspark.sql.types import StructTypefrom pyspark.sql.types import StructFieldfrom pyspark.sql.types import StringTypefrom pyspark.sql.types import IntegerTypestudentRDD = spark.sparkContext.parallelize(["3 Rongcheng M 26","4 Guanhua M 27"]).map(lambda line : line.split(" "))schema = StructType([StructField("name", StringType(), True),StructField("gender", StringType(), True),StructField("age",IntegerType(), True)])rowRDD = studentRDD.map(lambda p : Row(p[1].strip(), p[2].strip(),int(p[3])))studentDF = spark.createDataFrame(rowRDD, schema)prop = {}prop['user'] = 'root'prop['password'] = 'a123'prop['driver'] = "com.mysql.jdbc.Driver"studentDF.write.jdbc("jdbc:mysql://localhost:3306/spark",'student','append', prop) |



浙公网安备 33010602011771号