Hadoop

 

  • Cloudera 5.5 and HORTONWORKS HDP 2.2
  • IBM BIGINSIGHTS 4.1
  • Cloudera Administrator Training
  • for Apache Hadoop

  • The Case for Apache Hadoop
    § Hadoop Cluster Installation
    § The Hadoop Distributed File System (HDFS)
    § MapReduce and Spark on YARN
    § Hadoop Configuration and Daemon Logs
    § GeRng Data Into HDFS
    § Planning Your Hadoop Cluster
    § Installing and Configuring Hive, Impala, and Pig
    § Hadoop Clients Including Hue
    § Advanced Cluster Configuration
    § Hadoop Security
    § Managing Resources
    § Cluster Maintenance
    § Cluster Monitoring and Troubleshooting

  • The internals of YARN, MapReduce, and HDFS
  • Determining the correct hardware and infrastructure for your cluster
  • Proper cluster configuration and deployment to integrate with the data center
  • How to load data into the cluster from dynamically-generated files using Flume and from RDBMS using Sqoop
  • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster
  • Best practices for preparing and maintaining Apache Hadoop in production
  • Troubleshooting, diagnosing, tuning, and solving Hadoop issues

 

Aurelie: 16-73-56-74
Flo: 87 03 73 51
Juliane: 13 64 97 75
mom1: 24-75-76-23
mom2: 50-84-36-65
Popo: 86-03-92-42
Marc: 09 58 41 07