Fundamentals of Apache Flink: Learn data processing using Apache Flink

Fundamentals of Apache Flink: Learn data processing using Apache Flink

English | MP4 | AVC 1920×1080 | AAC 48KHz 2ch | 3h 14m | 475 MB

Use Apache Flink and its ecosystem to process real-time big data

Have you heard of Apache Flink, but don’t know how to use it to get on top of big data? Have you used Flink, but want to learn how to set it up and use it properly? Either way, this course is for you.

This course first introduces Flink concepts and terminology, and then moves on to building a Flink instance, collecting data, and using that data to generate output that can be used as processed data input into other systems. You will also use the Flink APIs to process data in batch and streaming modes.

By the end of the course, you will be capable of using the Apache Flink ecosystem to achieve complex tasks such as event processing and machine learning.

Learn

  • Build your own Flink development environment on a Linux server
  • Monitor your stream processing in real-time using the Flink UI
  • Organize your data comprehensively using data processing pipelines
  • Build end-to-end, real-time analytics projects
  • Design a distributed Flink environment to efficiently process, transform, and aggregate your data
Table of Contents

Introduction to Flink
1 The Course Overview
2 What Is Apache Flink
3 What Is Big Data and Why Does It Matter
4 Installing Flink in Your Local System
5 Setting Up the AWS Environment

Using a Flink Cluster UI and Data Onboarding
6 Using Cluster UIs
7 Programming Model
8 Batch Processing
9 Stream Processing

Batch Analytics with Apache Flink – Transformations
10 Loading Data
11 Transformations
12 More transformations
13 Partitioning

Batch Analytics with Apache Flink – Aggregations and Joins
14 Aggregations
15 Joins

Stream Processing with Apache Flink – Transformations
16 Data Sources
17 Data Stream Transformations
18 Window Operations

Advanced Stream Processing with Apache Flink
19 Time Characteristics
20 State and Checkpointing
21 Other Data Stream Operations