Проблемы с входными данными Hadoop

Question

Проблемы с входными данными Hadoop

У меня проблемы с функциями карты: исходные данные хранятся в файле tsv: я просто хочу сохранить последние два столбца: первый - исходный узел (383), второй - цель (4575), третий - вес (1)

383 4575 1

383 4764 1

383 5458 1

383 5491 1

         public void map(LongWritable key, Text value,OutputCollector output, Reporter reporter) throws IOException {
        String line = value.toString();
        String[] tokens = line.split("t");

        int weight = Integer.parseInt(tokens[2]);
        int target = Integer.parseInt(tokens[0]);

    }

Вот мой код:

public void map(LongWritable key, Text value, Context context) throws IOException  InterruptedException
  {
  String line = value.toString();
  //split  the tsv file 
  String[] tokens = line.split("/t");
  //save the weight and target    
  private Text target = Integer.parsetxt(tokens[0]);
  int weight = Integer.parseInt(tokens[2]);
  context.write(new Text(target), new Intwritable(weight) );
  }
}


public class Reduce extends Reducer<Text, IntWritable, Text, IntWritable> 
{
@Override
public void reduce(Text key, Iterable<IntWritable> values,Context context)
throws IOException, InterruptedException 
{
 //initialize the  count variable
    int weightsum = 0;
       for (IntWritable value : values) {
        weightsum += value.get();
 }
         context.write(key, new IntWritable(weightsum));
  }

}

0

hadoop hadoop-streaming

Источник

user4406733 23 мар '15 в 04:17

2 ответа

Другие вопросы по тегам hadoop hadoop-streaming

user4581745 23 мар '15 в 04:27 2015-03-23 04:27 · Answer 1 · 2015-03-23 04:27

String[] tokens = line.split("t");

должно быть

String[] tokens = line.split("\t");

0

Источник

user4581745 23 мар '15 в 04:27

user3840088 23 мар '15 в 05:48 2015-03-23 05:48 · Answer 2 · 2015-03-23 05:48

Разделить пробелами.

    String[] tokens = line.split("\\s+");

    private Text target = Integer.parsetxt(tokens[1]);
    int weight = Integer.parseInt(tokens[2]);

0

Источник

user3840088 23 мар '15 в 05:48