Basic Hadoop tutorial
Apache Hadoop is an open-source software framework for distributed storage and distributed processing of large data on computer clusters built from commodity hardware.
In this course we’ll discuss about several important aspects of Hadoop like HDFS(Hadoop Distributed File System), MapReduce, Hive, HBase and Pig.
First we’ll talk about Overview of Big data means what is Big Data, Facts of Big Data, Scenarios, Hadoop cluster architecture. Then we’ll move towards HDFS, Components of HDFS and its architecture, NameNode, Secondary NameNode and DataNode.
Next module is about MapReduce. In this we’ll talk about Map Phase and Reduce Phase, Architecture of MapReduce, Combiners and Reducers.
Next module is about PIG. In this we’ll see what is Apache Pig, its importance, Pig Latin language, and where to avoid Pig.
Them we’ll talk about HBase, we’ll talk about its use cases, general commands in HBase, DDL in HBase, DML in HBase, How to create, delete and integrate table in HBase and lot more.
So start learning Hadoop today.