apache pig - Reading a CSV File in Pig -


i using cloudera cdh3 pseudo mode cluster. in cdh3 pig version 0.8

i read csv or excel file using pig script

i downloaded piggybank-0.11.0.jar , kept inside /home/cloudera/ directory

my csv file this..

id    name       city 100   surrender  chennai 101   raja       chennai 

my pig script below

register '/home/cloudera/piggybank-0.11.0.jar';  = load '/user/cloudera/inputfiles/sample_rec.csv' using csvexcelstorage(',') (id:int,name:chararray,city:chararray); b = dump a; 

but getting below error

error org.apache.pig.tools.grunt.grunt - error 1070: not resolve csvexcelstorage using imports: [, org.apache.pig.builtin., org.apache.pig.impl.builtin. 

do need download piggbank jar pig 0.8 version?

what wrong here? possible read csv file in pig 0.8 version?

specify complete package name while using csvexcelstorage() :

using org.apache.pig.piggybank.storage.csvexcelstorage() ... 

other checks :

  1. unjar , see if having csvexcelstorage class.

  2. "," default delimiter csvexcelstorage, need not specify same.

other alternative make use of csvloader

 = load 'a.csv' using org.apache.pig.piggybank.storage.csvloader() (f1,f2,f3); 

ref : http://pig.apache.org/docs/r0.8.1/api/org/apache/pig/piggybank/storage/csvloader.html


Comments