日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

hadoop 分布式缓存

發布時間:2023/11/29 编程问答 28 豆豆
生活随笔 收集整理的這篇文章主要介紹了 hadoop 分布式缓存 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

Hadoop 分布式緩存實現目的是在所有的MapReduce調用一個統一的配置文件,首先將緩存文件放置在HDFS中,然后程序在執行的過程中會可以通過設定將文件下載到本地具體設定如下:

public static void main(String[] arge) throws IOException, ClassNotFoundException, InterruptedException{
?? ?
?? ??? ?Configuration conf=new Configuration();
?? ??? ?conf.set("fs.default.name", "hdfs://192.168.1.45:9000");
?? ??? ?FileSystem fs=FileSystem.get(conf);
?? ??? ?fs.delete(new Path("CASICJNJP/gongda/Test_gd20140104"));
?? ??? ?
?? ??? ?conf.set("mapred.job.tracker", "192.168.1.45:9001");
?? ??? ?conf.set("mapred.jar", "/home/hadoop/workspace/jar/OBDDataSelectWithImeiTxt.jar");
?? ??? ?Job job=new Job(conf,"myTaxiAnalyze");
?? ??? ?
?? ??? ?
?? ?????DistributedCache.createSymlink(job.getConfiguration());//
?? ??? ?try {
?? ??? ??? ?DistributedCache.addCacheFile(new URI("/user/hadoop/CASICJNJP/DistributeFiles/imei.txt"), job.getConfiguration());
?? ??? ?} catch (URISyntaxException e1) {
?? ??? ??? ?// TODO Auto-generated catch block
?? ??? ??? ?e1.printStackTrace();
?? ??? ?}?? ??????? ?
?? ??? ?job.setMapperClass(OBDDataSelectMaper.class);
?? ??? ?job.setReducerClass(OBDDataSelectReducer.class);
?? ??? ?//job.setNumReduceTasks(10);
?? ??? ?//job.setCombinerClass(IntSumReducer.class);
?? ??? ?job.setMapOutputKeyClass(Text.class);
?? ??? ?job.setMapOutputValueClass(Text.class);
?? ??? ?
?? ??? ?FileInputFormat.addInputPath(job, new Path("/user/hadoop/CASICJNJP/SortedData/20140104"));
?? ??? ?FileOutputFormat.setOutputPath(job, new Path("CASICJNJP/gongda/SelectedData"));
?? ??? ?
?? ??? ?System.exit(job.waitForCompletion(true)?0:1);
?? ??? ?
?? ?}

??? 代碼中標紅的為將HDFS中的/user/hadoop/CASICJNJP/DistributeFiles/imei.txt作為分布式緩存

?

public class OBDDataSelectMaper extends Mapper<Object, Text, Text, Text> {
?? ?String[] strs;
?? ?String[] ImeiTimes;
?? ?String timei;
?? ?String time;
?? ?private java.util.List<Integer> ImeiList = new java.util.ArrayList<Integer>();

?? ?protected void setup(Context context) throws IOException,
?? ??? ??? ?InterruptedException {

?? ?????try {
?? ??? ??? ?Path[] cacheFiles = DistributedCache.getLocalCacheFiles(context
?? ??? ??? ??? ??? ?.getConfiguration());
?? ??? ??? ?if (cacheFiles != null && cacheFiles.length > 0) {
?? ??? ??? ??? ?String line;
?? ??? ??? ??? ?BufferedReader br = new BufferedReader(new FileReader(
?? ??? ??? ??? ??? ??? ?cacheFiles[0].toString()));
?? ??? ??? ??? ?try {
?? ??? ??? ??? ??? ?line = br.readLine();
?? ??? ??? ??? ??? ?while ((line = br.readLine()) != null) {
?? ??? ??? ??? ??? ??? ?ImeiList.add(Integer.parseInt(line));
?? ??? ??? ??? ??? ?}
?? ??? ??? ??? ?} finally {
?? ??? ??? ??? ??? ?br.close();
?? ??? ??? ??? ?}
?? ??? ??? ?}
?? ??? ?} catch (IOException e) {
?? ??? ??? ?System.err.println("Exception reading DistributedCache: " + e);
?? ??? ?}
?? ?}

?? ?public void map(Object key, Text value, Context context)
?? ??? ??? ?throws IOException, InterruptedException {

?? ??? ?try {
?? ??? ??? ?strs = value.toString().split("\t");
?? ??? ??? ?ImeiTimes = strs[0].split("_");
?? ??? ??? ?timei = ImeiTimes[0];
?? ??? ??? ?if (ImeiList.contains(Integer.parseInt(timei))) {
?? ??? ??? ??? ?context.write(new Text(strs[0]), value);
?? ??? ??? ?}
?? ??? ?} catch (Exception ex) {

?? ??? ?}
?? ?}
}

上述標紅代碼中在Map的setup函數中加載分布式緩存。

轉載于:https://www.cnblogs.com/mfryf/p/5360306.html

總結

以上是生活随笔為你收集整理的hadoop 分布式缓存的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。