The Hadoop Administration Certification Training is designed for individuals who want to gain expertise to operate and manage a Hadoop cluster. The course will use practical examples to provide an understanding on cluster setup, installation, configuration and Kerberos. It is hands-on training on preparing learners to deal with real-time tasks.
The Hadoop Administration Certification Training will provide in-depth training on how to follow up on different service requests and support issues that have been generated by customers. You will also learn to handle customer support requests efficiently. You will gain the proficiency to navigate challenges often faced by Hadoop Administrators.
The Hadoop Administration Certification Training is a step by step guide that will help you master advance Hadoop admin activities. Qualifying in this course will set you in the right direction, giving you the opportunity to open the door to new and exciting job roles in leading organisations.
Global Edulink is a leading online provider for several accrediting bodies, and provides learners the opportunity to take this exclusive course awarded by CPD. At Global Edulink, we give our fullest attention to our learners’ needs and ensure they have the necessary information required to proceed with the Course. Learners who register will be given excellent support, discounts for future purchases and be eligible for a TOTUM Discount card and Student ID card with amazing offers and access to retail stores, the library, cinemas, gym memberships and their favourite restaurants.
1: Understanding Big Data and Hadoop | |||
Introduction to big data | |||
Common big data domain scenarios | |||
Limitations of traditional solutions | |||
What is Hadoop? | |||
Hadoop 1.0 ecosystem and its Core Components | |||
Hadoop 2.x ecosystem and its Core Components | |||
Application submission in YARN | |||
2: Hadoop Cluster and its Architecture | |||
Distributed File System | |||
Hadoop Cluster Architecture | |||
Replication rules | |||
Hadoop Cluster Modes | |||
Rack awareness theory | |||
Hadoop cluster administrator responsibilities | |||
Understand working of HDFS | |||
NTP server | |||
Initial configuration required before installing Hadoop | |||
Deploying Hadoop in a pseudo-distributed mode | |||
3: Hadoop Cluster Setup and Working | |||
OS Tuning for Hadoop Performance | |||
Pre-requisite for installing Hadoop | |||
Hadoop Configuration Files | |||
Stale Configuration | |||
RPC and HTTP Server Properties | |||
Properties of Namenode, Datanode and Secondary Namenode | |||
Log Files in Hadoop | |||
Deploying a multi-node Hadoop cluster | |||
4: Hadoop Cluster Administration and Maintenance | |||
Commisioning and Decommissioning of Node | |||
HDFS Balancer | |||
Namenode Federation in Hadoop | |||
High Availabilty in Hadoop | |||
Trash Functionality | |||
Checkpointing in Hadoop | |||
Distcp | |||
Disk balancer | |||
5: Computational Frameworks, Managing Resources and Scheduling | |||
Different Processing Frameworks | |||
Different phases in Mapreduce | |||
Spark and its Features | |||
Application Workflow in YARN | |||
YARN Metrics | |||
YARN Capacity Scheduler and Fair Scheduler | |||
Service Level Authorization (SLA) | |||
6: Hadoop 2.x Cluster: Planning and Management | |||
Planning a Hadoop 2.x cluster | |||
Cluster sizing | |||
Hardware, Network and Software considerations | |||
Popular Hadoop distributions | |||
Workload and usage patterns | |||
Industry recommendations | |||
7: Hadoop Security and Cluster Monitoring | |||
Monitoring Hadoop Clusters | |||
Hadoop Security System Concepts | |||
Securing a Hadoop Cluster With Kerberos | |||
Common Misconfigurations | |||
Overview on Kerberos | |||
Checking log files to understand Hadoop clusters for troubleshooting | |||
8: Cloudera Hadoop 2.x and its Features | |||
Visualize Cloudera Manager | |||
Features of Cloudera Manager | |||
Build Cloudera Hadoop cluster using CDH | |||
Installation choices in Cloudera | |||
Cloudera Manager Vocabulary | |||
Cloudera terminologies | |||
Different tabs in Cloudera Manager | |||
What is HUE? | |||
Hue Architecture | |||
Hue Interface | |||
Hue Features | |||
9: Pig, Hive Installation and Working (Self-paced) | |||
Explain Hive | |||
Hive Setup | |||
Hive Configuration | |||
Working with Hive | |||
Setting Hive in local and remote metastore mode | |||
Pig setup | |||
Working with Pig | |||
10: HBase, Zookeeper Installation and Working (Self-paced) | |||
What is NoSQL Database | |||
HBase Data Model | |||
HBase Architecture | |||
MemStore, WAL, BlockCache | |||
HBase Hfile | |||
Compactions | |||
HBase Read and Write | |||
HBase balancer and hbck | |||
HBase setup | |||
Working with HBase | |||
Installing Zookeeper | |||
11: Understanding Oozie (Self-paced) | |||
Oozie overview | |||
Oozie Features | |||
Oozie workflow, coordinator and bundle | |||
Start, End and Error Node | |||
Action Node | |||
Join and Fork | |||
Decision Node | |||
Oozie CLI | |||
Install Oozie | |||
12: Data Ingestion using Sqoop and Flume (Self-paced) | |||
Types of Data Ingestion | |||
HDFS data loading commands | |||
Purpose and features of Sqoop | |||
Perform operations like, Sqoop Import, Export and Hive Import | |||
Sqoop 2 | |||
Install Sqoop | |||
Import data from RDBMS into HDFS | |||
Flume features and architecture | |||
Types of flow | |||
Install Flume | |||
Ingest Data From External Sources With Flume | |||
Best Practices for Importing Data |
Vic Thomas
I fully recommend this course to anyone who wants to learn about the complex Hadoop cluster. I gained the proficiency required to sustain a Hadoop cluster.
Phoenix Brown
The course was very satisfactory! Great job to the tutor!