日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當(dāng)前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

Hive窗口函数经典案例(保姆级案例)

發(fā)布時間:2023/12/14 编程问答 36 豆豆
生活随笔 收集整理的這篇文章主要介紹了 Hive窗口函数经典案例(保姆级案例) 小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

建表:

create table t_window(name string,orderdate date,cost int ) row format delimited fields terminated by ',';

order.csv 文件

jack,2015-01-01,10 tony,2015-01-02,15 jack,2015-02-03,23 tony,2015-01-04,29 jack,2015-01-05,46 jack,2015-04-06,42 tony,2015-01-07,50 jack,2015-01-08,55 mart,2015-04-08,62 mart,2015-04-09,68 neil,2015-05-10,12 mart,2015-04-11,75 neil,2015-06-12,80 mart,2015-04-13,94

load data local inpath '/opt/tmp/order.csv' into table t_window;

窗口函數(shù)操作:

1、查詢2015-04月購買的人和人數(shù)
select distinct name,count(*) over() from t_window where substring(orderdate,1,7)='2015-04';


2、顯示購買的總金額
select name,orderdate,cost,sum(cost) over() from t_window;

3、顯示月份的總金額
select name,orderdate,cost,sum(cost) over(partition by month(orderdate)) from t_window;

4、綜合練習(xí)

select name,orderdate,cost, sum(cost) over() as sample1,--所有行相加 sum(cost) over(partition by name) as sample2,--按name分組,組內(nèi)數(shù)據(jù)相加 sum(cost) over(partition by name order by orderdate) as sample3,--按name分組,組內(nèi)數(shù)據(jù)累加 sum(cost) over(partition by name order by orderdate rows between UNBOUNDED PRECEDING and current row ) as sample4 ,--和sample3一樣,由起點(diǎn)到當(dāng)前行的聚合 sum(cost) over(partition by name order by orderdate rows between 1 PRECEDING and current row) as sample5, --當(dāng)前行和前面一行做聚合 sum(cost) over(partition by name order by orderdate rows between 1 PRECEDING AND 1 FOLLOWING ) as sample6,--當(dāng)前行和前邊一行及后面一行 sum(cost) over(partition by name order by orderdate rows between current row and UNBOUNDED FOLLOWING ) as sample7 --當(dāng)前行及后面所有行 from t_window;


5、ntile函數(shù)(分組處理)(執(zhí)行的話建議單個拎出來執(zhí)行)

select name,orderdate,cost,ntile(3) over() as sample1, ntile(3) over(partition by name) as simple2, ntile(2) over(partition by month(orderdate)) as simple3, ntile(3) over(partition by name order by cost desc) as simple4 from t_window;


6、rank函數(shù)(排名)

select name,orderdate,cost,row_number() over() as r1, row_number() over(order by name) as r2, rank() over(order by name) as r3, DENSE_RANK () over(order by name) as r4 from t_window;


7、lag和lead函數(shù)

select name,orderdate,cost, lag(orderdate,1) over(partition by name order by orderdate) as sample1, lag(orderdate,1,'1999-10-02') over(partition by name order by orderdate) as sample2, lead(orderdate,1,'1999-10-02') over(partition by name order by orderdate) as sample3 from t_window;


8、first_value和last_value函數(shù)

select name,orderdate,cost,first_value(orderdate) over(partition by name order by orderdate) as time, last_value(orderdate) over(partition by name order by orderdate) as time from t_window;

總結(jié)

以上是生活随笔為你收集整理的Hive窗口函数经典案例(保姆级案例)的全部內(nèi)容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯,歡迎將生活随笔推薦給好友。