A Complete Introduction to Apache Airflow


1. Basic concepts

DAGS

DAG Runs

Operators


2. Installation

pipenv install --python=3.7 Flask==1.0.3 apache-airflow==1.10.3
echo "AIRFLOW_HOME=${PWD}/airflow" >> .env
pipenv shell
airflow initdb
mkdir -p ${AIRFLOW_HOME}/dags/

3. Running Airflow

airflow webserver -p 8081
airflow scheduler

4. Basic CLI commands

airflow run tutorial sleep 2020-05-31
bash-3.2$ airflow list_tasks tutorial
airflow pause tutorial
airflow unpause tutorial
airflow backfill tutorial -s 2020-05-28 -e 2020-05-30

5. The Web UI


6. A first DAG

BigQuery set up

pipenv install google-cloud-storage httplib2 google-api-python-client google-cloud-bigquery pandas_gbq

Creating the DAG


Comments

Popular posts from this blog

Flutter for Single-Page Scrollable Websites with Navigator 2.0

A Data Science Portfolio is More Valuable than a Resume

Better File Storage in Oracle Cloud