使用decode无法将%23解析为#,代码如下:
val rdd = spark.sparkContext.parallelize(List(“1104319%231104317”,“1104319%231104318”,“1104317”))
import spark.implicits._
val rs = rdd.map(People(_)).toDF()
rs.select(decode($"name","utf-8")).show(false)
结果如下:
±------------------+
|decode(name, utf-8)|
±------------------+
|1104319%231104317 |
|1104319%231104318 |
|1104317 |
±------------------+