
If you need scalability and high availability without compromising performance, then the Apache Cassandra Database is the right choice for you. Cassandra is a distributed database; initially developed by Facebook, it later on came under the Apache forum. In this post, we will be cover basic topics as an...

In this blog we will discuss about most widely used file format in Hadoop ecosystem Parquet Parquet, an open source file format for Hadoop. Parquet stores nested data structures in a flat columnar format .Compared to a traditional approach where data is stored in row oriented approach, parquet is...

What is Ambari ? The Apache’s Ambari is a Hadoop management tool aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its restful APIs (Definition given by Apache Ambari) How...