a.m*_*ssa 5 scala apache-spark
正如我在标题中所说,对于以下两种情况,我想知道如何删除 Spark 字符串列的第一个字符:
\n\nval myDF1 = Seq(("\xc2\xa314326"),("\xc2\xa31258634"),("\xc2\xa315626"),("\xc2\xa3163262")).toDF("A")\nval myDF2 = Seq(("a14326"),("c1258634"),("t15626"),("f163262")).toDF("A")\nmyDF1.show\nmyDF2.show\n\n+--------+\n| A|\n+--------+\n|\xc2\xa314326 |\n|\xc2\xa31258634|\n|\xc2\xa315626 |\n|\xc2\xa3163262 |\n+--------+\n\n+--------+\n| A |\n+--------+\n|a14326 |\n|c1258634|\n|t15626 |\n|f163262 |\n+--------+\nRun Code Online (Sandbox Code Playgroud)\n\n我想获得:
\n\n+--------+-------+\n| A| B|\n+--------+-------+\n|\xc2\xa314326 | 14326|\n|\xc2\xa31258634|1258634|\n|\xc2\xa315626 | 15626|\n|\xc2\xa3163262 | 163262|\n+--------+-------+\n\n+--------+-------+\n| A| B|\n+--------+-------+\n|a14326 |14326 |\n|c1258634|1258634|\n|t15626 |15626 |\n|f163262 |163262 |\n+--------+-------+\nRun Code Online (Sandbox Code Playgroud)\n\n你有什么主意吗?
\nPiy*_*tel 11
你可以做这样的事情。
\n\nmyDF1.show\n+------+\n| A|\n+------+\n|\xc2\xa314326|\n|\xc2\xa312586|\n|\xc2\xa315626|\n|\xc2\xa316326|\n+------+\n\nmyDF1.withColumn("B", expr("substring(A, 2, length(A))")).show\n+------+-----+\n| A| B|\n+------+-----+\n|\xc2\xa314326|14326|\n|\xc2\xa312586|12586|\n|\xc2\xa315626|15626|\n|\xc2\xa316326|16326|\n+------+-----+\nRun Code Online (Sandbox Code Playgroud)\n
| 归档时间: |
|
| 查看次数: |
18695 次 |
| 最近记录: |