hadoop – AcadGild https://acadgild.com/blog Learn. Do. Earn. Thu, 25 Aug 2016 12:01:42 +0000 en-US hourly 1 https://wordpress.org/?v=4.5.3 103159356 Frequently Asked Hadoop Interview Questions – Part 1 https://acadgild.com/blog/frequently-asked-hadoop-interview-questions-part-1/ https://acadgild.com/blog/frequently-asked-hadoop-interview-questions-part-1/#respond Wed, 17 Aug 2016 14:40:43 +0000 https://acadgild.com/blog/?p=15242 In this first Part of Hadoop interview Questions, we would be discussing various questions related to Big Data Hadoop Ecosystem. We have given relevant posts with most of the questions which you can refer for practical implementation. What are the different types of File formats in hive? Ans. Different file formats which Hive can handle […]

The post Frequently Asked Hadoop Interview Questions – Part 1 appeared first on AcadGild.

]]>
https://acadgild.com/blog/frequently-asked-hadoop-interview-questions-part-1/feed/ 0 15242
6 Undeniable Reasons to Learn Hadoop https://acadgild.com/blog/6-undeniable-reasons-learn-hadoop/ https://acadgild.com/blog/6-undeniable-reasons-learn-hadoop/#respond Fri, 29 Jul 2016 12:32:00 +0000 https://acadgild.com/blog/?p=13145 The Hadoop market is undergoing phenomenal growth and continues to show steep growth rate. Looking at the current growth, it wouldn’t be an exaggeration to say that Hadoop is the best option that is cost-effective as well as scalable, open source substitute for commercially available Big Data management suites. Hadoop has also become a vital Big […]

The post 6 Undeniable Reasons to Learn Hadoop appeared first on AcadGild.

]]>
https://acadgild.com/blog/6-undeniable-reasons-learn-hadoop/feed/ 0 13145
Ambari Installation Guide – (part I) https://acadgild.com/blog/ambari-installation-guide-part/ https://acadgild.com/blog/ambari-installation-guide-part/#respond Thu, 16 Jun 2016 06:23:41 +0000 https://acadgild.com/blog/?p=6287 What is Ambari ? The Apache’s Ambari is a Hadoop management tool aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its restful APIs (Definition given by Apache Ambari) How it works? Ambari server will be […]

The post Ambari Installation Guide – (part I) appeared first on AcadGild.

]]>
https://acadgild.com/blog/ambari-installation-guide-part/feed/ 0 6287
Introduction to Spark RDD and Basic Operations in RDD https://acadgild.com/blog/introduction-spark-rdd-basic-operations-rdd/ https://acadgild.com/blog/introduction-spark-rdd-basic-operations-rdd/#respond Fri, 25 Mar 2016 13:09:16 +0000 https://acadgild.com/blog/?p=2322 In this post, we will be discussing RDD in Spark and will be performing some basic operations like creation of RDD, Transformation, and Actions. Let’s start our discussion with MapReduce and the challenges associated with it which led to the innovation of Spark. MapReduce greatly simplified “Big Data” analysis on large clusters, but its inefficiency to […]

The post Introduction to Spark RDD and Basic Operations in RDD appeared first on AcadGild.

]]>
https://acadgild.com/blog/introduction-spark-rdd-basic-operations-rdd/feed/ 0 2322
Configuring Trash in HDFS https://acadgild.com/blog/configuring-trash-hdfs/ https://acadgild.com/blog/configuring-trash-hdfs/#respond Tue, 22 Mar 2016 06:23:49 +0000 https://acadgild.com/blog/?p=2268 Apache Hadoop Provides a trash feature. Trash feature is very helpful for Hadoop Administrators in case of accidental deletion of files and directories. If trash is enabled and a file or directory is deleted, the file is moved to the .Trash directory in the user’s home directory instead of being deleted. Deleted files are initially […]

The post Configuring Trash in HDFS appeared first on AcadGild.

]]>
https://acadgild.com/blog/configuring-trash-hdfs/feed/ 0 2268
Configuring Rack Awareness on Hadoop Using Centos https://acadgild.com/blog/configuring-rack-awareness-hadoop-using-centos/ https://acadgild.com/blog/configuring-rack-awareness-hadoop-using-centos/#respond Wed, 16 Mar 2016 07:29:42 +0000 https://acadgild.com/blog/?p=2199 This blog gives you information about Rack Awareness in Apache Hadoop. HDFS block placement uses rack awareness for fault tolerance by placing one block replica on a different rack. This ensures data availability during network switch failure or a partition within the cluster. Rack awareness is very helpful in making an appropriate replication factor. Configuring […]

The post Configuring Rack Awareness on Hadoop Using Centos appeared first on AcadGild.

]]>
https://acadgild.com/blog/configuring-rack-awareness-hadoop-using-centos/feed/ 0 2199
Commissioning and Decommissioning of Datanode in Hadoop https://acadgild.com/blog/commissioning-and-decommissioning-of-datanode-in-hadoop/ https://acadgild.com/blog/commissioning-and-decommissioning-of-datanode-in-hadoop/#comments Fri, 11 Mar 2016 12:31:33 +0000 https://acadgild.com/blog/?p=1947 Commissioning of nodes stand for adding new nodes in current cluster which operates your Hadoop framework. In contrast, Decommissioning of nodes stands for removing nodes from your cluster. This is very useful utility to handle node failure during the operation of Hadoop cluster without stopping entire Hadoop nodes in your cluster. Why do we need […]

The post Commissioning and Decommissioning of Datanode in Hadoop appeared first on AcadGild.

]]>
https://acadgild.com/blog/commissioning-and-decommissioning-of-datanode-in-hadoop/feed/ 2 1947
Map reduce Use case – Titanic Data Analysis https://acadgild.com/blog/analyzing-titanic-data-with-hadoop-mapreduce/ https://acadgild.com/blog/analyzing-titanic-data-with-hadoop-mapreduce/#respond Fri, 06 Nov 2015 07:16:01 +0000 http://acadgild.com/blog/?p=317 There have been huge disasters in history, but the magnitude of the Titanic’s disaster ranks as high as the depth it sank to. So much so that subsequent disasters have always been described as “titanic in proportion” – implying huge losses. Anyone who has read about the Titanic, know that a perfect combination of natural […]

The post Map reduce Use case – Titanic Data Analysis appeared first on AcadGild.

]]>
https://acadgild.com/blog/analyzing-titanic-data-with-hadoop-mapreduce/feed/ 0 317
10 Big Differences between Hadoop 1 & Hadoop 2 https://acadgild.com/blog/10-big-differences-between-hadoop1-and-hadoop2/ Fri, 03 Jul 2015 09:21:07 +0000 http://blog.acadgild.com/?p=193 Hadoop – the solution for deciphering the avalanche of Big Data – has come a long way from the time Google published its paper on Google File System in 2003 and MapReduce in 2004. It created waves with its scale-out and not scale-up strategy. Inroads from Doug Cutting and team at Yahoo and Apache Hadoop […]

The post 10 Big Differences between Hadoop 1 & Hadoop 2 appeared first on AcadGild.

]]>
193