
Hadoop is a tool designed for larger files. But how do you handle small files? This blog gives you a brief overview on solving this problem. There are two primary reasons why small files are problematic in Hadoop: NameNode memory management and MapReduce performance. NameNode Memory Management A common...