ETL Process Using Airflow and Docker
Introduction In this article, I will create an ETL process for extracting Forex data and wrap the whole thing as a data pipeline using Airflow and Docker. The ETL process will extract data from fixer.io API, transform it, and load it to a PostgreSQL database. This project aims to have an automated process that constantly feeds the PostgreSQL database with data. Every 2 minutes, the ETL process will load an updated batch of Forex data. Note that this article assumes some knowledge of Airflow, Docker, Python, and SQL. I won't go into too many details to keep this article short. Project Steps 1. Setting up Airflow Architecture 2. 1st Dag - Check if API is available 3. 2nd Dag - Create a table 4. 3rd Dag - Extract 5. 4th Dag - Transform 6. 5th Dag - Load 7. Query data in the pgAdmin UI Step #1 - Setting up Airflow Architecture The first thing we should do is set up the basic ...