zl程序教程

您现在的位置是:首页 >  后端

当前栏目

大数据Spark “蘑菇云”行动第47课程 Spark 2.0实战之Dataset:collect_list、collect_set、avg、sum、countDistinct等

Listset数据Spark 实战 课程 2.0 sum
2023-09-27 14:26:47 时间

大数据Spark “蘑菇云”行动第47课程 Spark 2.0实战之Dataset:collect_list、collect_set、avg、sum、countDistinct等

 

Dataset API:
 
people.json

{"name":"Michael", "age":16}
{"name":"Andy", "age":30}
{"name":"Justin", "age":19}
{"name":"Justin", "age":29}
{"name":"Michael", "age":46}


运行结果

 

16/09/17 22:22:15 INFO CodeGenerator: Code generated in 20.317672 ms
+-------+--------+--------+--------+--------+-------------------+--------+--------------+
|   name|sum(age)