tweetStream.foreachRDD((rdd, time) => {
val count = rdd.count()
if (count > 0) {
var fileName = outputDirectory + "/tweets_" + time.milliseconds.toString
val outputRDD = rdd.repartition(partitionsEachInterval)
outputRDD.saveAsTextFile(fileName)
}
Run Code Online (Sandbox Code Playgroud)
我正在尝试以python方式检查流数据中的计数值或空RDD,很难找到方法,还尝试了以下链接中的示例。 http://spark.apache.org/docs/latest/streaming-programming-guide.html
尝试了以下语法,它们都没有帮助将字符串类型列转换为日期
select INVC_,APIDT,APDDT from APAPP100 limit 10
select current_date, APIDT,APDDT from APAPP100 limit 10
select date_format( b.APIDT, '%Y-%m-%d') from APAPP100 b
select CAST( b.APIDT AS date) from APAPP100 b
select date(b.APIDT) from APAPP100 b
select convert(datetime, b.APIDT) from APAPP100 b
select date_parse(b.APIDT, '%Y-%m-%d') from APAPP100 b
select str_to_date(b.APIDT) from APAPP100 b
Run Code Online (Sandbox Code Playgroud)