日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程语言 > python >内容正文

python

在hadoop上运行python_hadoop上运行python程序

發布時間:2023/11/27 python 36 豆豆
生活随笔 收集整理的這篇文章主要介紹了 在hadoop上运行python_hadoop上运行python程序 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

數據來源:

http://www.nber.org/patents/acite75_99.zip

首先上傳測試數據到hdfs:

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop

fs -ls /user/root/test-in

Found 5 items

-rw-r--r-- 1

root supergroup

101 2010-10-24 14:39 /user/root/test-in/NOTICE.txt

-rw-r--r-- 1

root supergroup

1366 2010-10-24 14:39 /user/root/test-in/README.txt

-rw-r--r-- 1

root supergroup 264075431 2010-10-24

19:23 /user/root/test-in/cite75_99.txt

-rw-r--r-- 1

root supergroup

22 2010-10-24 14:39 /user/root/test-in/file1.txt

-rw-r--r-- 1

root supergroup

28 2010-10-24 14:39 /user/root/test-in/file2.txt

2創建python程序

1 #!/usr/bin/env

python

2 import sys,

random

3 for line in

sys.stdin:

4

if (random.randint(1,100) <=

int(sys.argv[1])):

5

print line.strip()

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop

jar contrib/streaming/hadoop-0.19.2-streaming.jar -input

test-in/cite75_99.txt -output testoutput -mapper 'RandomSample.py

10' -file RandomSample.py

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-ls /user/root/

Found 4 items

drwxr-xr-x?- root

supergroup?0 2010-10-24 19:25 /user/root/output

drwxr-xr-x?- root

supergroup?0 2010-10-24 19:23 /user/root/test-in

drwxr-xr-x?- root

supergroup?0 2010-10-24 14:41 /user/root/test-out

drwxr-xr-x?- root

supergroup?0 2010-10-24 22:12 /user/root/testoutput

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-ls /user/root/testoutput

Found 2 items

drwxr-xr-x?- root

supergroup?0 2010-10-24 22:08 /user/root/testoutput/_logs

-rw-r--r--?1 root

supergroup?28075087 2010-10-24

22:12 /user/root/testoutput/part-00000

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-cat /user/root/testoutput/part-00000 | head

3858242,3319261

3858243,3156927

3858243,3681785

3858243,3684611

3858248,3641592

3858253,2331472

3858254,2869143

3858256,3413665

3858262,3557750

3858264,3530488

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-ls /user/root/

Found 4 items

drwxr-xr-x?- root

supergroup?0 2010-10-24 19:25 /user/root/output

drwxr-xr-x?- root

supergroup?0 2010-10-24 19:23 /user/root/test-in

drwxr-xr-x?- root

supergroup?0 2010-10-24 14:41 /user/root/test-out

drwxr-xr-x?- root

supergroup?0 2010-10-24 22:12 /user/root/testoutput

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-ls /user/root/testoutput

Found 2 items

drwxr-xr-x?- root

supergroup?0 2010-10-24 22:08 /user/root/testoutput/_logs

-rw-r--r--?1 root

supergroup?28075087 2010-10-24

22:12 /user/root/testoutput/part-00000

[root@localhost:/usr/local/hadoop/hadoop-0.19.2]#bin/hadoop fs

-cat /user/root/testoutput/part-00000 | head

3858242,3319261

3858243,3156927

3858243,3681785

3858243,3684611

3858248,3641592

3858253,2331472

3858254,2869143

3858256,3413665

3858262,3557750

3858264,3530488

總結

以上是生活随笔為你收集整理的在hadoop上运行python_hadoop上运行python程序的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。

歡迎分享!

轉載請說明來源于"生活随笔",并保留原作者的名字。

本文地址:在hadoop上运行python_hadoop上运行pyth