Portfolio

Designing reliable data pipelines for analytics, products, and decision-making.

I am Atul Bhardwaj, a DataForge focused on

ETL Pipeline

About Me

I am a B.Tech student and aspiring Data Engineer who enjoys building systems that collect, clean, transform, and deliver high-quality data for reporting and machine learning.

My work focuses on creating dependable pipelines, improving data quality, and modeling data in a way that is simple for analysts and product teams to consume. I care about automation, observability, and long-term maintainability.

I actively learn by building end-to-end data projects: ingesting data from APIs and files, orchestrating workflows, storing it in warehouse layers, and exposing it through dashboards and query-ready marts.

Pipeline-first mindset with strong ETL/ELT fundamentals
SQL-focused problem solving for performance and data quality
Experience designing fact-dimension models for analytics
Automation through schedulers and workflow orchestration
Fast learner with focus on production-like data projects

Quick Snapshot

3+
Data Projects
6+
Data Tools
1M+
Rows Processed

Skills

Core data engineering capabilities used in practical pipeline and analytics projects.

Data Engineering Core

Python SQL ETL / ELT Data Modeling Data Validation

Cloud & Processing

Apache Spark Kafka Airflow AWS / GCP

Storage & BI

PostgreSQL MySQL BigQuery / Snowflake Power BI / Tableau

Current Level Map

Pipeline Design
SQL Optimization
Data Warehousing
Orchestration & Monitoring

Projects

End-to-end data engineering projects — from raw ingestion to analytics-ready insights.

01 In Progress

Identity Lakehouse

Building a Medallion-architecture lakehouse for Aadhaar and government schemes; Spark + Delta handle Bronze–Silver–Gold refinement, while Airflow orchestrates containerized ETL for analytics-ready demographic insights.

Apache Spark Delta Lake Airflow Docker SQL
02 Completed

Revenue Intelligence Engine — SQL Data Warehouse

Unified fragmented ERP and CRM sales data into a Medallion-modeled warehouse, landing clean Bronze/Silver layers and a Star Schema Gold mart for reliable revenue and customer insights.

SQL ETL Data Modeling Star Schema
03 Completed

EcoSentry AI

Deployed an AI-driven web platform that predicts wildfire risk in real time using Google Earth Engine data, ML models, and interactive geospatial visualization for global locations.

ML Google Earth Engine GCP FastAPI Scikit-Learn

Activity

Day-by-day commits for @atulbhardwaj-io over the past year.

GitHub contribution chart for atulbhardwaj-io

Experience & Achievements

Smart Bin: AI-Based Waste Classification

Research Project — Jan 2025 – Apr 2025

Developed an image-recognition system that distinguishes recyclable vs. non-recyclable waste, trained and evaluated deep-learning detectors, and delivered a prototype that promotes smarter waste management.

AI Developer — Smart India Hackathon (Internal Round)

Top College Team | Safety Prediction Platform

Built an anomaly-detection platform that analyzes crime trends in tourist areas using PyTorch and ML models, surfacing risk scores and alerts for safer travel planning.

B.Tech CSE (IoT) — SRM Institute

Aug 2024 – May 2028

Strengthening data engineering and ML foundations through projects that mirror production workloads, focusing on pipelines, warehousing, and model deployment.

Get In Touch

Have a project in mind or just want to connect? Drop me a message — I typically reply within 24 hours.