Hadoop for System Administrators

Popis kurzu

This course covers the essentials of deploying and managing an Apache™ Hadoop® cluster. The course is lab intensive with each participant creating their own Hadoop cluster using either the CDH (Cloudera's Dis­tribution, including Apache Hadoop) or Hortonworks Data Platform stacks. Core Hadoop services are explored in depth with emphasis on troubleshooting and recovering from common cluster failures. The fundamentals of related services such as Ambari, Zookeeper, Pig, Hive, HBase, Sqoop, Flume, and Oozie are also covered.

Obsah kurzu

After completing this course students will be able to:

  • Data Analysis
  • Big Data
  • Hadoop Core Architecture
  • Hadoop Ecosystem
  • Hadoop Ecosystem continued
  • Running Commands on Multiple Systems
  • Design Goals
  • Design
  • Blocks
  • Block Replication
  • Namenode Daemon
  • Secondary Namenode Daemon
  • Datanode Daemon
  • Accessing HDFS
  • Permissions and Users
  • Adding and Removing Datanodes
  • Balancing
  • MapReduce
  • Terminology and Data Flow
  • MapReduce Daemons
  • YARN
  • MapReduce Essential Configuration
  • Failure and Recovery
  • Working with Jobs
  • Scheduling Concepts
  • FIFO Scheduler
  • Fair Scheduler
  • Fair Scheduler – Configuration
  • For more info about this course please open datasheet

Cieľová skupina

Systems Administrators who will be responsible for managing and administering Hadoop clusters

Poznámka k cene

1530 EUR

Kontaktná osoba

Helena Kazárová
+420 261 307 495
education.czsk@hpe.com