Mapping below :
lopez,charlie,2002,11,21
parker,ward,1995,04,08
henderson,russell,2007,10,01
to
lopez,charlie,20021121
parker,ward,19950408
henderson,russell,20071001
DATA
#vi sample-data.txt
Add below :
lopez,charlie,2002,11,21
parker,ward,1995,04,08
henderson,russell,2007,10,01
EXECUTE MAP-REDUCE
#hadoop fs -cat sample-data.txt | awk -F"," '{ print $1","$2","$3$4$5 }' | hadoop fs -put - sample-data-coalesed.txt
CHECK FILE IN HDFS
#hadoop ls
DONE!
FROM : Anja Skrba
ไม่มีความคิดเห็น:
แสดงความคิดเห็น