
In this post, we will be discussing how to configure replication factor, block size for the entire cluster, along with directory, and file in HDFS. Hadoop Distributed File System (HDFS) stores files such as blocks, and distributes them across the entire cluster. As HDFS was designed to be fault-tolerant...

If you need scalability and high availability without compromising performance, then the Apache Cassandra Database is the right choice for you. Cassandra is a distributed database; initially developed by Facebook, it later on came under the Apache forum. In this post, we will be cover basic topics as an...

What is Ambari ? The Apache’s Ambari is a Hadoop management tool aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its restful APIs (Definition given by Apache Ambari) How...
In our previous video blog we have learnt the difference between Vertical scaling and Horizontal scaling architecture and why Big data is suitable for storing and processing large datasets in horizontal scaling architecture. In this video blog we will be continuing discussing about the working of hadoop framework and...
Big Data has taken the IT world by storm. If you’re thinking about switching to Big Data domain or take the first step in your career with Big Data, then we believe that you have made the best decision in your career life. Here’s a guide to Big Data...
This video intends to be a part of a short discussion about the Few Customer case studies where hadoop can be used as a solution. We have the some case studies by which we can check the involvement of Hadoop in various sectors. Case Study 1: Children’s Hospital Los Angles Industry...
This video intends to be a part of a short discussion about the problems faced with Big Data. There are some common challenges with Big Data like, Real-Time Analysis, Traditional Storage, Processing, Computation, etc. Hadoop is an open-source framework for processing Big Data. These days, there are many Hadoop distributions...