Big Data Technology Fundamentals provides baseline general knowledge of the technologies used in big data solutions. It covers the development of big data solutions using the Hadoop ecosystem, including MapReduce, HDFS, and the Pig and Hive [...]
  • AWSBDF
  • Duration 1 day
  • 0 ITK points
  • 0 terms
  • Praha (on request)

    Brno (on request)

    Bratislava (on request)

Big Data Technology Fundamentals provides baseline general knowledge of the technologies used in big data solutions. It covers the development of big data solutions using the Hadoop ecosystem, including MapReduce, HDFS, and the Pig and Hive programming frameworks. This web-based course helps you build a foundation for working with AWS services for big data solutions. This course is offered at no charge, and can be used on its own or to help you prepare for the Big Data on AWS instructor-led course.

»

Individuals who are new to big data concepts, including Enterprise Solutions Architects, Big Data Solutions Architects, Data Scientists, and Data Analysts.

This course teaches you how to:

  • Identify common tools and technologies that can be used to create big data solutions
  • Understand the MapReduce programming framework, including the map, shuffle and sort, and reduce components
  • Distinguish options available for creating a big data solution using the Hive programming framework

Please register for free here.

We recommend that attendees of this course have:

  • Working knowledge of basic programming in a language such as Java or C#

Module 1 – Introduction to Big Data

  • The Business Importance of Big Data
  • The Hadoop Ecosystem
  • Characteristics of Big Data
  • Processing Big Data
  • Tools and Techniques for Analyzing Big Data
  • Implementing Big Data Solutions
  • Case Study – Social Media Analytics

Module 2 – Introduction to MapReduce and Hadoop

  • Hadoop Architecture
  • MapReduce Framework
  • MapReduce Programming
  • MapReduce and HDFS/S3
  • Use Case – Recommendation Engine

Module 3 – Data Analysis Using Pig Programming

  • Introduction to Pig
  • Pig Data Types
  • Representing Data in Pig
  • Running Pig
  • User-Defined Functions
  • Pig vs Traditional RDBMSs
  • Advanced Techniques in Pig

Module 4 – Big Data Querying with Hive

  • Introduction to Hive
  • Representing Data in Hive
  • Hive Data Types
  • Probing Data with Hive Queries
  • Hive and AWS
  • Use Case – Ad Hoc Analysis and Product Feedback
 
Current offer
Training location
Course language

The prices are without VAT.