i using cloudera cdh3 pseudo mode cluster. in cdh3 pig version 0.8
i read csv or excel file using pig script
i downloaded piggybank-0.11.0.jar , kept inside /home/cloudera/ directory
my csv file this..
id name city 100 surrender chennai 101 raja chennai my pig script below
register '/home/cloudera/piggybank-0.11.0.jar'; = load '/user/cloudera/inputfiles/sample_rec.csv' using csvexcelstorage(',') (id:int,name:chararray,city:chararray); b = dump a; but getting below error
error org.apache.pig.tools.grunt.grunt - error 1070: not resolve csvexcelstorage using imports: [, org.apache.pig.builtin., org.apache.pig.impl.builtin. do need download piggbank jar pig 0.8 version?
what wrong here? possible read csv file in pig 0.8 version?
specify complete package name while using csvexcelstorage() :
using org.apache.pig.piggybank.storage.csvexcelstorage() ... other checks :
unjar , see if having csvexcelstorage class.
"," default delimiter csvexcelstorage, need not specify same.
other alternative make use of csvloader
= load 'a.csv' using org.apache.pig.piggybank.storage.csvloader() (f1,f2,f3); ref : http://pig.apache.org/docs/r0.8.1/api/org/apache/pig/piggybank/storage/csvloader.html
Comments
Post a Comment