scala+spark把某列string类型转换dataFrame的日期,array<double>,array<string>格式

result = resdf.withColumn("Date", to_date(col("Date"), "yyyy-MM-dd")).\
withColumn("arrayDouble", regexp_replace(col("arrayDouble"), "\\]", "")).\
withColumn("arrayDouble", regexp_replace(col("arrayDouble"), "\\[", "")).\
withColumn("arrayDouble", functions.split(col("arrayDouble"), ",")).\
withColumn("arrayDouble", col("arrayDouble").cast("array<double>")). \
withColumn("arrayString", functions.split(col("arrayString"), ",")).\
withColumn("arrayString", col("arrayString").cast("array<string>"))
arrayDouble
posted @ 2023-01-29 15:14  ivyJ  阅读(348)  评论(0)    收藏  举报