What is Ambari ?
The Apache’s Ambari is a Hadoop management tool aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters.
Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its restful APIs (Definition given by Apache Ambari)
How it works?
- Ambari server will be running on one machine in the cluster.
- Ambari agent will be running on all machines in the cluster.
- Ambari agent sends data collected from all machines such as resource utilization (Memory, storage, CPU etc.)
- Data will be stored in a relational database which will act as repository
- Ambari server consolidates this information and provides capabilities such ASN alerts, dashboards etc.
How to set it up?
1- Install Ambari server on one of the masters
2- Set up database if you want it to be in different host than where Amabri server is running.
3- Setup Amabari server
4- Start Amabari server
5- Enable Ambari server to start on reboots
6- Validate by going to URL (http://hostname>:8080)
Ambari have two components:
- Ambari Server
- Ambari Agent
Steps for Ambari Installation
Here we are going to cover Ambari cluster setup. We are assuming it to have 3 nodes – Node1, Node2, and Node3. We will make Node1 as our Ambari server and rest will be ambari-agent.
Installation steps for the RHEL based system (CentOS, Redhat, Fedora):
Step1: Download and Install Ambari repository –
Come to Ambari-Server Node (Node 1 as we have decided earlier)
Download Ambari public repository –
1 |
sudo wget http://public-repo-1.hortonworks.com/ambari/centos7/2.x/updates/2.2.1.0/ambari.repo |
Copy this repository to yum repository-
1 |
cp ambari.repo /etc/yum.repos.d/ |
Install the epel repository –
1 |
yum install epel-release |
Step 2: Next Install Ambari-server and PostgreSQL –
1 |
yum install ambari-server -y |
Step 3: Configure Ambari Server-
Run the Ambari- Server setup
1 |
ambari-server setup |
Configure Ambari-server according to your need.
If you have not disabled SELinux, you may get a warning → Enter y to continue. If you have not temporarily disabled iptables, the setup will do it for you.
PostgreSQL is configured by the process. When you are prompted to enter Advanced Database Configuration, enter n to set up the default username and password: ambari-server/bigdata.
To use your own username and password, enter y (prefer default option for first time).
Step 4: Start Ambari – Server
1 |
ambari -server start |
Now you have started the Ambari-server to see the User Interface (UI) of Ambari-server open web browser and type the IP address:port number(192.168.10.69:8080) or hostname of Linux Server.
Default username and password is admin.
Now you can Set-up your Hadoop cluster
Login to Ambari User Interface using Ambari-server credentials. You will get a screen like below –
In the next blog (Part II), We will learn how to set-up and deploy a Hadoop Cluster using Ambari.
Hope this blog was informative. For queries, you can contact us at support@acadgild.com
Leave a Reply