Hadoop: Sqoop untuk Import Export Hive ke MySQL

From OnnoWiki
Jump to navigation Jump to search

Sumber: http://doctuts.readthedocs.org/en/latest/sqoop.html


SQOOP INSTALLATION

This section refers to the installation settings of Sqoop. INTRODUCTION

   Sqoop is a tool designed to transfer data between Hadoop and relational databases.
   You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS. Sqoop automates most of this process, relying on the database to describe the schema for the data to be imported. Sqoop uses MapReduce to import and export the data, which provides parallel operation as well as fault tolerance. This document describes how to get started using Sqoop to move data between databases and Hadoop and provides reference information for the operation of the Sqoop command-line tool suite.
Sqoop.png


Stable release and Download

Sqoop is an open source software product of the Apache Software Foundation. Sqoop source code is held in the Apache Git repository. Prerequisites

Before we can use Sqoop, a release of Hadoop must be installed and con?gured. Sqoop is currently supporting 4 major Hadoop releases - 0.20, 0.23, 1.0 and 2.0. We have installed Hadoop 2.2.0 and it is compatible with sqoop 1.4.4.We are using a Linux environment Ubuntu 12.04 to install and run sqoop. The basic familiarity with the purpose and operation of Hadoop is required to use this product. Installation

To install the sqoop 1.4.4 we followed the given sequence of steps :

   Download the sqoop-1.4.4.bin_hadoop-1.0.0.tar.gz file from www.apache.org/dyn/closer.cgl/sqoop/1.4.4
   Unzip the tar file: sudo tar -zxvf sqoop-1.4.4.bin hadoop1.0.0.tar.gz
   Move sqoop-1.4.4.bin hadoop1.0.0 to sqoop using command
   user@ubuntu:~$ sudo mv sqoop 1.4.4.bin hadoop1.0.0 /usr/lib/sqoop
   Create a directory sqoop in usr/lib using command
   user@ubuntu:~$ sudo mkdir /usr/lib/sqoop
   Go to the zipped folder sqoop-1.4.4.bin_hadoop-1.0.0 and run the command
   user@ubuntu:~sudo mv ./* /usr/lib/sqoop
   Go to root directory using cd command
   user@ubuntu:~$ cd
   Open .bashrc file using
   user@ubuntu:~$ sudo gedit ~/.bashrc
   Add the following lines
   export SQOOP_HOME=¡usr/lib/sqoop
   export PATH=$PATH:$SQOOP_HOME/bin
   To check if the sqoop has been installed successfully type the command
   sqoop version



Referensi