Thursday, March 17, 2016

Coudera CDH installation using remote repo

Hadoop.CDH.Installation.RemoteRepository
1.     Overview
This topic describes how to create a remote RPM packages/parcels repository and direct hosts in your Cloudera Manager deployment to use that repository.

Once you have created a parcels repository, go to Configuring the Cloudera Manager Server to Use the Parcel URL. After completing these steps, you have established the environment required to install a previous version of Cloudera Manager or install Cloudera Manager to hosts that are not connected to the Internet. Proceed with the installation process, being sure to target the newly created repository.

2.     Creating a Permanent Remote Repository
The repository is typically hosted using HTTP on a host inside your network. If you already have a web server in your organization, you can reuse it and put the parcel files into it.

Below are the detailed steps to setup a permanent remote repository:

1.      Logon the server you want to setup the web server and run below commands to install Apache httpd web server
yum install httpd
systemctl start httpd
systemctl enable httpd
·    RPM Packages
 Download the RPM packages for your OS distribution from:
 Move the RPM packages files to the web server directory, and modify file permissions
mkdir -p /var/www/html/cdh5/packages
tar -xvf cm5.*-centos7.tar.gz -C /var/www/html/cdh5/parcels
chmod -R ugo+rX /var/www/html/cdh5  (might not require)
3.      After moving the files and changing permissions, visit http://hostname:80/cdh5/parcels to verify that you can access the RPM packages. Apache may have been configured to not show indexes, which is also acceptable.

·    Parcels
a.      Download the parcel and manifest.json files for your OS distribution from:
·         CDH 5 - Impala, Spark, and Search are included in the CDH parcel
·         Accumulo - - https://archive.cloudera.com/accumulo-c5/parcels/
·         GPL Extras - https://archive.cloudera.com/gplextras5/parcels/
b.     Move the .parcel and manifest.json files to the web server directory, and modify file permissions
mkdir -p /var/www/html/cdh5/parcels
mv CDH-5.*-el7.parcel /var/www/html/cdh5/parcels
mv manifest.json /var/www/html/cdh5/parcels
chmod -R ugo+rX /var/www/html/cdh5  (might not require)
·         After moving the files and changing permissions, visit http://hostname:80/cdh5/parcels to verify that you can access the parcel. Apache may have been configured to not show indexes, which is also acceptable.

3.     Configuring the Cloudera Manager Server to Use the Parcel URL
1.      Use one of the following methods to open the parcel settings page:
·         Navigation bar
·   
     
·         Click the Configuration button.
·         Menu
·         Select Administration > Settings
·         Select Category > Parcels
2.      In the Remote Parcel Repository URLs list, click  to open an additional row.
3.      Enter the path to the parcel. For example, http://hostname:port/cdh5/parcels/.
4.      Click Save Changes to commit the changes.



4.     Reference
Creating and Using a Remote Parcel Repository for Cloudera Manager