Hierarchical Issue Tracker - Data Modelling at Scale
Hierarchical build issue tracker built on PostgreSQL. Features a closure table for instant hierarchy rollups, a snapshot model for point-in-time queries, and pre-computed presence intervals. Full synthetic data pipeline with a Streamlit dashboard.
View Details
Scalable Workflow Orchestration: Advanced Data Pipelines with Apache Airflow
The project titled "Scalable Workflow Orchestration: Advanced Data Pipelines with Apache Airflow" serves as a comprehensive laboratory for mastering complex workflow orchestration, task scheduling, and DAG creation using Apache Airflow.
View Details
Dynamic Schema-Agnostic Dataflow Pipeline for BigQuery
This Apache Beam and Dataflow pipeline provides a metadata-driven approach for automating data ingestion into BigQuery, highlighting architectural flexibility by enabling schema handling without code changes.