Training

HDP Analyst: Data Science

Overview: This condensed course Provides instruction on the processes and practice of data science, including machine learning and natural language processing. Included are: tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikit-learn), the Natural Language Toolkit (NLTK), and Spark MLib

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: Architects, software developers, analysts and data scientists who need to apply data science and machine learning on Hadoop.

Pre-requisites: Students must have experience with at least one programming or scripting language, knowledge in statistics and/or mathematics, and a basic understanding of big data and Hadoop principles. Students new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course. Students are required to bring their own laptop.

HDP Developer: Apache Pig and Hive

Overview: This condensed course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Pig and Hive. Topics include: Hadoop YARN, HDFS, MapReduce, data ingestion, workflow definition and using Pig and Hive to perform data analytics on Big Data. Labs are executed on a 7-node HDP cluster

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: Software developers who need to understand and develop applications for Hadoop

Pre-requisites: Students should be familiar with programming principles and have experience in software development. SQL knowledge is also helpful. No prior Hadoop knowledge is required. Students are required to bring their own laptop.

HDP Operations: Hadoop Administration

Overview: This condensed course is designed for administrators who will be managing the Hortonworks Data Platform (HDP) 2.3 with Ambari. It covers installation, configuration, and other typical cluster maintenance tasks.

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: IT administrators and operators responsible for installing, configuring and supporting an HDP 2.3 deployment in a Linux environment using Ambari.

Pre-requisites: Attendees should be familiar with Hadoop and Linux environments. Students are required to bring their own laptop.

HDP Developer: Apache Spark using Python

Overview: This condensed course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. Topics include: Hadoop, YARN, HDFS, using Spark for interactive data exploration, building and deploying Spark applications, optimization of applications, creating Spark pipelines with multiple libraries, working with different filet types, building data frames, exploring the Spark SQL API, using Spark Streaming and an introduction to Spark MLlib.

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: Software engineers that are looking to develop time sensitive applications for Hadoop.

Pre-requisites: Students should be familiar with programming principles and have previous experience in software development. SQL knowledge is helpful. No prior Hadoop experience required, but is very helpful. Students are required to bring their own laptop.

HDP Operations: Security

Overview: This condensed course is designed for experienced administrators who will be implementing secure Hadoop clusters using authentication, authorization, auditing and data protection strategies and tools.

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: IT administrators and operators responsible for installing, configuring and supporting an Apache Hadoop 2.3 deployment in a Linux environment.

Pre-requisites: Students should be experienced in the management of Hadoop using Ambari and Linux environments. Completion of the Hadoop Administration I course is highly recommended. Students are required to bring their own laptop.

HDP Developer: Apache Spark using Scala

Overview: This condensed course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. Topics include: Hadoop, YARN, HDFS, using Spark for interactive data exploration, building and deploying Spark applications, optimization of applications, creating Spark pipelines with multiple libraries, working with different file types, building data frames, exploring the Spark SQL API, using Spark Streaming and an introduction to Spark MLlib

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: Software engineers that are looking to develop time sensitive applications for Hadoop

Pre-requisites: Students should be familiar with programming principles and have previous experience in software development. SQL knowledge is helpful. No prior Hadoop experience required, but is very helpful. Students are required to bring their own laptop.

HDP Operations: Hortonworks DataFlow 

Overview: This condensed course is designed for ‘Data Stewards’ or ‘Data Flow Managers’ who are looking forward to automate the flow of data between systems. Topics Include Introduction to NiFi, Installing and Configuring NiFi, Detail explanation of NiFi User Interface, Explanation of its components and Elements associated with each. How to Build a dataflow, NiFi Expression Language, Understanding NiFi Clustering, Data Provenance, Security around NiFi, Monitoring Tools and HDF Best practices.

Duration: Two Days - Sunday, June 26 - Monday, June 27

Target Audience: Data Engineers, Integration Engineers and Architects who are looking to automate Data flow between systems

Pre-requisites: Students should be familiar with programming principles and have previous experience in software development. Experience with Linux and a basic understanding of DataFlow tools would be helpful. No prior Hadoop experience required, but is very helpful. Students are required to bring their own laptop.

HDP Overview: Apache Hadoop Essentials

Overview: This course provides a technical understanding for Business users and Decision makers and an overview of Apache Hadoop. It includes high-level information about concepts, architecture, operation, and uses of the Hortonworks Data Platform (HDP) and the Hadoop ecosystem. The course provides an optional primer for those who plan to attend a hands-on, instructor-led course. 

Course Objectives:

  • Describe what makes data "Big Data"
  • List data types stored and analyzed in Hadoop
  • Describe how Big Data and Hadoop fit into your current infrastructure and environment
  • Describe fundamentals of: the Hadoop Distributed File System (HDFS)
  • YARN
  • MapReduce
  • Hadoop frameworks: (Pig, Hive, HCatalog, Storm, Solr, Spark, HBase, Oozie, Ambari, ZooKeeper, Sqoop, Flume, and Falcon)
  • Recognize use cases for Hadoop
  • Describe the business value of Hadoop
  • Describe new technologies like Tez and the KnoxGateway


Duration: One Day - Sunday, June 26 OR Monday, June 27

Target Audience: Data architects, data integration architects, managers, C-level executives, decision makers, technical infrastructure team, and Hadoop administrators or developers who want to understand the fundamentals of Big Data and the Hadoop ecosystem

Pre-requisites: No previous Hadoop or programming knowledge is required. Students will need browser access to the Internet. Students are suggested to bring their own laptop but it is not required.

HDP Certified Developer

Register now at a special price of $199 and take one of our Certification exams at the Summit Pre-training & Certification event.

Hortonwork's certification program is now offering hands-on, performance-based exams. This new approach to Hadoop certification is designed to allow individuals an opportunity to prove their Hadoop skills in a way that is recognized in the industry as meaningful and relevant to on-the-job performance.

As a special offer for attendees of Hadoop Summit, you can take “any” of our Hortonworks certification exams for $199. In addition, you have the unique opportunity to take an exam with a live proctor in the room. This special is only available for candidates who take the exam in person at Summit on June 27.

 Please visit our website at http://hortonworks.com/training/certification/ for a list of available exams.

Certification candidates MUST bring their own laptops

Earn Digital Badges: Hortonworks Certified Professionals receive a digital badge for each certification earned. Display your badges proudly on your resume, LinkedIn profile, email signature, etc. Each badge you earn is issued and verified by BadgeCert, a third-party digital badge authentication provider.

sponsor purchase