
The Complete Hands-On Introduction to Apache Airflow
Apache Airflow is an open-source platform to programmatically author, schedule and monitor workflows. If you have many ETL(s) to manage, Airflow is a must-have.
In the Complete Hands-On Introduction to Apache Airflow, you are going to learn everything you need in order to start using Airflow and discover how powerful it is. Through theory and practical videos, you will see basic as well as advanced concepts.
Materials (required for the course)
Download the materials by clicking on the link below
Curriculum
Section 1: Course Introduction
- Important Prerequisites
- Course Objectives
- Course Structure
- Who I am
- Development Environment
Section 2: First Approach to Airflow
- Introduction
- Why use Airflow?
- What is Airflow?
- How Airflow works?
- [Practice] Installing Airflow
- [Practice] Quick Tour of Airflow UI
- [Practice] Quick Tour of Airflow CLI
- Quiz Time!
- Recap
Section 3: Coding Your First Data Pipeline with Airflow
- Introduction
- What is a DAG?
- [Practice] Time to code your first DAG
- What is an Operator?
- [Practice] Let’s use Operators
- Operator Relationships and Bitshift Composition
- [Practice] Adding dependencies
- How the Scheduler Works?
- [Practice] A Quick Play With Backfill and Catchup
- What is a Workflow?
- Quiz Time!
- Recap
Section 4: Databases and Executors
- Introduction
- Sequential Executor with SQLite
- Local Executor with PostgreSQL
- [Practice] Configure a DAG with Local Executor and PostgreSQL
- Celery Executor with PostgreSQL and RabbitMQ
- [Practice] Configure a DAG with Celery Executor, PostgreSQL and RabbitMQ
- Quiz Time!
- Recap
Section 5: Advanced Concepts
- Introduction
- Minimising Repetitive Patterns With SubDAGs
- [Practice] Minimising a DAG with SubDAGs
- How to Interact With External Sources Using Hooks?
- [Practice] Get Results From PostgreSQL Using Hooks
- How to Share Data Between Your Tasks With XCOMs?
- [Practice] Share Your First Messages Using XCOMs
- How to Execute Tasks According To Criteria Using Branching
- [Practice] Make Your First Conditional Tasks
- Control Your Tasks With SLAs
- [Practice] Defining a SLA in a DAG
- Quiz Time!
- Recap
Section 6: Customising Your Apache Airflow Installation With Plugins
- Introduction
- Adding Functionalities to Apache Airflow
- [Practice] Creating a Hook to Interact With Elasticsearch
- [Practice] Creating a Transfer Operator PostgresqlToElasticsearch
- [Practice] Adding a View to Apache Airflow UI
- Quiz Time!
- Recap
Section 7: Using Apache Airflow With Docker
- Introduction
- Quick Reminder About Docker
- [Practice] Building And Running Your First Airflow Docker Image
- [Practice] Running Airflow Using Sequential Executor With Docker
- [Practice] Running Airflow Using Local Executor With Docker
- [Practice] Scaling Airflow Using Celery Executor With Docker
- Quiz Time!
- Recap