hadoop構築チュートリアルの5(Pigインストール)
2190 ワード
1、解凍とインストール
解圧pig-0.10.0.tar.gzから/opt/ディレクトリの下:
2、構成/opt/pig/bin/pigファイル
pigファイルを開く
# # The Pig command script # # Environment Variables # # JAVA_HOME The java implementation to use. Overrides JAVA_HOME. # # PIG_CLASSPATH Extra Java CLASSPATH entries. # # HADOOP_HOME/HADOOP_PREFIX Environment HADOOP_HOME/HADOOP_PREFIX(0.20.205) # # HADOOP_CONF_DIR Hadoop conf dir # # PIG_HEAPSIZE The maximum amount of heap to use, in MB. # Default is 1000. # # PIG_OPTS Extra Java runtime options. # # PIG_CONF_DIR Alternate conf dir. Default is ${PIG_HOME}/conf. # # HBASE_CONF_DIR-Optionally,the HBAse configuration to run against#上のコードの下で構成を開始します.
3、実行
次のコマンドを使用して実行します.
Pig Latin共通コマンド:
解圧pig-0.10.0.tar.gzから/opt/ディレクトリの下:
tar -zxvf pig-0.10.0.tar.gz -C /opt/
mv /opt/pig-0.10.0 /opt/pig
chown -R hadoop:hadoop /opt/pig
su hadoop
2、構成/opt/pig/bin/pigファイル
pigファイルを開く
# # The Pig command script # # Environment Variables # # JAVA_HOME The java implementation to use. Overrides JAVA_HOME. # # PIG_CLASSPATH Extra Java CLASSPATH entries. # # HADOOP_HOME/HADOOP_PREFIX Environment HADOOP_HOME/HADOOP_PREFIX(0.20.205) # # HADOOP_CONF_DIR Hadoop conf dir # # PIG_HEAPSIZE The maximum amount of heap to use, in MB. # Default is 1000. # # PIG_OPTS Extra Java runtime options. # # PIG_CONF_DIR Alternate conf dir. Default is ${PIG_HOME}/conf. # # HBASE_CONF_DIR-Optionally,the HBAse configuration to run against#上のコードの下で構成を開始します.
export JAVA_HOME=/usr/java/jdk/
export PIG_INSTALL=/opt/pig
export HADOOP_INSTALL=/opt/hadoop
export PATH=$PIG_INSTALL/bin:%HADOOP_INSTALL/bin:$PATH
export PIG_CLASSPATH=$HADOOP_INSTALL/conf
3、実行
次のコマンドを使用して実行します.
cd /opt/hadoop/bin
./haoop fs -copyFromLocal /opt/data/test.txt /opt/data/test.txt
cd /opt/pig/bin
./pig
例:ユーザ名を取り出し、dist.txtに存在するA = LOAD '/opt/data/test.txt' USING PigStorage('\t') AS (id,name);
dump A;
B = FOREACH A GENERATE name;
STORE B INTO '/opt/data/dist.txt' USING PigStorage();
cd /opt/hadoop/bin
./hadoop fs -ls /opt/data
./hadoop fs -ls /opt/data/dist.txt
./hadoop fs -cat /opt/data/dist/txt/part-m-00000
Pig Latin共通コマンド:
LOAD ...... USING PigStorage('') ...... AS ......;
FOREACH ...... GENERATE ......;
FILTER ...... BY ......;
DUMP;
STORE ...... INTO;
GROUP ...... BY;
AND OR