Allen
Work. Words. Wonder.

Hi, I'm Allen Joy

|

I’m a data professional focused on building reliable data pipelines, scalable cloud infrastructure, and meaningful insights from complex datasets. My work lies at the intersection of data engineering, analytics, and cloud technologies. Alongside my technical pursuits, I write about technology, literature, and the curiosity that drives innovation and human understanding.

0+Projects
0+Years Exp.
0Books Written
Allen Joy
scroll
AWSPythonApache SparkKubernetes TerraformPostgreSQLdbtAirflow DockerCI/CDSnowflakeKafka AWSPythonApache SparkKubernetes TerraformPostgreSQLdbtAirflow DockerCI/CDSnowflakeKafka

Core Expertise

Data Analytics

Transforming raw data into actionable insights. Expert in statistical analysis, data visualization, and BI tools.

PythonSQLTableauPower BI

Data Engineering

Designing and building robust data pipelines, warehouses, and ETL/ELT systems at scale.

SparkKafkaAirflowdbt

DevOps

Bridging development and operations through automation, CI/CD pipelines, and infrastructure as code.

DockerK8sJenkinsGitHub Actions

Cloud (AWS)

Architecting scalable, cost-efficient cloud solutions. Proficient in core AWS services and cloud-native patterns.

EC2S3LambdaRedshift

Featured Books

Author Page
Through the Prism of Life

Through the Prism of Life

A Collection of Poems

"Through the Prism of Life" is a poetic anthology that explores the multifaceted nature of existence. From the highs of joy and love to the lows of pain and loss, each poem offers a unique perspective on the human experience.

2023 142 pages Poetry
Through the Prism of Life

Bouquet

A Collection of Poems

A bouquet of flowers, a gift so bright — a symbol of love, a heartfelt delight. A rainbow of colors, a fragrant surprise, a beauty to behold for our eager eyes. A timeless treasure — a symbol of love that knows no measure.

2023 142 pages Poetry

Let's Build Something Remarkable

Open to consulting, full-time roles, collaborations, and speaking engagements.

Get in Touch

Technical Portfolio

A showcase of real-world projects spanning data engineering, cloud architecture, and DevOps automation.

Data Analytics 2025

EatIn Ireland Sales Analysis

A Real-Time Data Analysis project that analyses food delivery sales data across Ireland. Covers the full data analysis workflow — importing and cleaning an Excel dataset, performing EDA, calculating key business KPIs, and building interactive charts — all built in Python using Jupyter Notebook.

Problem: Food delivery businesses in Ireland had no clear visibility into their sales trends and revenue performance across regions. Without proper data analysis, it was impossible to identify which counties and cities were generating the most revenue or track sales fluctuations over time.
Solution: Built a complete end-to-end data analysis project in Python based on a client Business Requirements Document (BRD). Loaded and cleaned a 139,428-row Excel dataset, calculated 5 key KPIs and designed 7 interactive charts covering Monthly Trends, Daily Trends, Weekly Analysis, Veg vs Non-Veg Revenue, Sales by County, Quarterly Performance and Top 5 Cities by Sales.
Results: Total Revenue of €417,972.64 identified across 139,428 orders. Dublin ranked as highest revenue county. Average customer satisfaction of 4.35/5.0. Weekly revenue stabilised at ~€12,000/week. Top 5 cities identified to guide regional business strategy.
Python Pandas Plotly Matplotlib Jupyter Notebook Excel
DevOps · Cloud 2025

CostumeZone — Cloud-Based Costume Rental App

A cloud-based web application for browsing and renting costumes online. Demonstrates full AWS architecture, CI/CD pipeline automation, security implementation, and performance optimization using Elastic Beanstalk and Cloud9.

Problem: Costume rental businesses relied on manual, in-person processes with no online presence, making it impossible to scale operations, manage inventory efficiently, or reach customers digitally.
Solution: Built a cloud-native web application on AWS Elastic Beanstalk with a fully automated CI/CD pipeline. Used AWS Cloud9 as the development environment, implemented IAM security controls, and optimized performance for scalability and reliability.
Results: Fully automated deployment pipeline with zero manual intervention. Auto-scaling infrastructure on AWS Elastic Beanstalk. Secure IAM access control implemented. Application live in production with optimized performance.
Python HTML AWS Elastic Beanstalk AWS Cloud9 CI/CD IAM
Cloud · DevOps · Data 2025

TC Tyre Works — Cloud-Based Inventory & Billing App

A cloud-based web application built to manage inventory, billing and bill history for tyre repairing shops. Built on a three-tier architecture using React (S3), Django (EC2), and MySQL (RDS) for scalability, accessibility and reliability.

Problem: Tyre repair shops relied entirely on manual paper-based systems for managing inventory and billing. This led to frequent stock errors, slow billing processes, no bill history tracking, and zero visibility into business operations.
Solution: Built a full-stack cloud-based web application on AWS using three-tier architecture. React frontend deployed on S3 for static hosting, Django REST API backend on EC2 for business logic, and MySQL on RDS for reliable data storage. Implemented full CRUD operations across inventory, billing, and admin authentication modules.
Results: Fully digitised inventory and billing operations. Real-time stock tracking and bill history accessible from anywhere. Scalable AWS infrastructure with high availability. Secure admin authentication system protecting business data.
React Django MySQL AWS S3 AWS EC2 AWS RDS
Data Engineering 2025

End-to-End ETL Pipeline with Automated Reporting

Automated weather intelligence pipeline that collects real-time data from 5 cities worldwide, processes it through validation and feature engineering stages, stores it in SQLite with duplicate prevention, and produces a self-contained interactive dashboard with Chart.js visualizations.

Problem: Organizations need reliable, automated systems to collect data from external sources, ensure its quality, store it efficiently, and present actionable insights — all without manual intervention. Manual data collection is error-prone, inconsistent, and doesn't scale.
Solution: Built a production-style ETL pipeline in Python that extracts real-time weather data from OpenWeatherMap API with retry logic, transforms it through validation and feature engineering (comfort index, temperature/humidity/wind categories), loads it into SQLite using idempotent upsert logic, and auto-generates an interactive HTML dashboard. The pipeline logs every run with metadata for auditability and is scheduler-ready.
Results: Processes 5 cities in under 2 seconds per run. Handles API failures gracefully with exponential backoff retries. Prevents duplicate records across multiple runs. Generates a self-contained dashboard with 4 interactive charts and pipeline run history. Includes 16 unit tests with full coverage across all pipeline stages.
Python pandas SQLite Chart.js Jinja2 pytest PyYAML GitHub Pages
Cloud · DevOps · Data 2025

Exploring Lightweight Monitoring Tools for Microservices in Cloud-Based Architectures

A mixed-methods research project evaluating lightweight monitoring tools (Prometheus, Grafana, Telegraf, Jaeger) on AWS. Assessed performance overhead, real-time data processing, and scalability using containerised microservices.

Problem: Traditional monitoring tools fail to address the unique challenges of microservices — service discovery, data consistency, and inter-service communication — leading to poor observability in cloud-native architectures.
Solution: A mixed-methods research project evaluating lightweight monitoring tools (Prometheus, Grafana, Telegraf, Jaeger) on AWS. Assessed performance overhead, real-time data processing, and scalability using containerised microservices.
Results: Efficient observability with low overhead. Real-time analytics across 5 monitoring dimensions. Scalable cloud-native architecture on AWS. Research-backed evaluation of 4 monitoring tools.
Prometheus Grafana Telegraf Jaeger AWS TypeScript Docker

GitHub Activity

View Full Profile

The Blog

Thoughts on data engineering, cloud architecture, DevOps, books, productivity, and life between keyboards.

Agentic AI in the Enterprise Technology
30 March 2026 5 min read Allen Joy

Agentic AI in the Enterprise: What Tech Leaders Need to Know in 2026

The shift has already happened. Gartner says 40% of enterprise applications will embed task-specific AI agents by end of 2026 — up from less than 5% just a year ago. That's not a gradual adoption curve. That's a step change.

Table of Contents:
  1. What "agentic AI" actually means
  2. Where it's delivering results right now
  3. The three things blocking scale
  4. Governance is your competitive advantage
  5. A 5-step path from pilot to production
#AgenticAI #Enterprise #Technology #AIStrategy
AI-Ready Data Strategy 2026 Technology
31 March 2026 5 min read Allen Joy

AI-Ready Data: Why Your Data Strategy Is the Real Bottleneck in 2026

Enterprises have poured billions into AI — yet only 7% say their data is completely ready for it. The models aren't the problem. The data underneath them is. Here's what tech leaders need to fix first.

Table of Contents:
  1. The Pilot-to-Production Gap Is a Data Problem
  2. Why Traditional Data Management Falls Short
  3. What "AI-Ready Data" Actually Looks Like
  4. The Governance Imperative
  5. A Practical Roadmap: Where to Start
#DataStrategy #AIReadiness #DataEngineering #Enterprise
Read Full Article

Discussion

Sarah K.2 days ago

Great article! The section on data contracts was particularly eye-opening. Would love to see a follow-up on testing strategies.

Allen Joy
Author

Allen Joy

Engineer by day, writer by night — always a reader.

My Story in Literature

Before I wrote code, I wrote stories. My journey into writing began long before I knew what a data pipeline was — a childhood spent in libraries, a degree that introduced me to the power of narrative, and a quiet, persistent belief that words matter as much as algorithms.

I started the Born to Read YouTube channel as a way to bring together my two worlds: the technical rigor of engineering and the contemplative depth of literature. What began as book reviews evolved into something larger — a community of curious minds who believe that reading makes you a better engineer, and engineering makes you a more precise writer.

My books sit at the intersection of technology and lived experience. I write for engineers who read novels, and for readers who are curious about the systems shaping the world.

2Books Published
3K+YouTube Subscribers
200+Books Read (tracked)
12+Writing Genres Explored

My Books

Through the Prism of Life

Through the Prism of Life

A Collection of Poems

"Through the Prism of Life" is a poetic anthology that explores the multifaceted nature of existence. From the highs of joy and love to the lows of pain and loss, each poem offers a unique perspective on the human experience. The collection invites readers to view life through a prism, where each emotion refracts and reflects into a spectrum of colors and shades. With lyrical language and vivid imagery, it offers a journey of self-discovery and reflection, reminding us that every moment of life is a precious gift.

2023 142 pages Poetry
Bouquet

Bouquet

A Collection of Poems

A bouquet of flowers, a gift so bright — a symbol of love, a heartfelt delight. A rainbow of colors, a fragrant surprise, a beauty to behold for our eager eyes. Each petal a work of art, a masterpiece that's close to heart. A bouquet of flowers is a timeless treasure — a symbol of love that knows no measure. A reminder that every gesture of kindness warms the soul, and love is what makes us whole.

2023 142 pages Poetry
Unboxing Book Fire and Blood

Unboxing Book | Fire and Blood (Game of Thrones) | George R.R Martin

The Hidden Hindu

The Hidden Hindu | Book 1 | Short Review by Author Akshat Gupta

Why We Read Books

Why We Read Books | Importance of Reading | Benefits of Reading

Unboxing The Witcher Bookset

Unboxing Bookset | The Witcher | English

Born to Read on YouTube

Subscribe

Media Appearances

The Person Behind the Code

Engineer, author, perpetual learner. Here's the longer version.

Allen Joy
"The best way to understand a system is to build it — and then break it."

Professional Background

I'm Allen Joy, a data engineer and cloud architect with 2+ years of experience building data infrastructure for companies ranging from fast-growing startups to large enterprises. My work lives at the intersection of data engineering, cloud architecture, and DevOps.

I specialize in designing systems that are reliable, observable, and maintainable — not just systems that work. I care deeply about data quality, engineering best practices, and the kind of documentation that actually helps people.

Outside of engineering, I'm an author and the host of the Born to Read YouTube channel, where I explore the relationship between reading, thinking, and building.

Career Journey

2024 – 2025

Graduate — Masters in Cloud Computing

National College of Ireland

Specialization in Cloud Management. Dissertation - Exploring Lightweight Monitoring Tools for Microservices in Cloud-Based Architectures

2022 – 2023

System Analyst

ELA Sustainable Solution PVT. LTD

Delivered 24/7 client support and responsive customer service, addressing user and staff needs promptly and efficiently. Implemented system improvements that enhanced customer service response times, improving user satisfaction by 15%. Optimized internal workflows, resulting in a 20% increase in operational efficiency.

2019 – 2022

Graduate — Bachelor in Computer Application

Jain Deemed-to-be University

First Class Honours. Specialized in Mobile Application and Cloud Computing.

2019

Graduate — Diploma in Computer Application

Dr. C.V. Raman University Bilaspur, India(C.G)

First Class Honours. Foundational knowledge in computer programming, automation, database management, and essential software tools."

Certifications & Credentials

AWS Solutions Architect

Professional Level

AWS Data Analytics

Specialty

dbt

dbt Analytics Engineer

Certification

Google Cloud Professional

Data Engineer

Technical Skills

Python
95%
SQL
90%
AWS
88%
Apache Spark
85%
Terraform
88%
Kubernetes
82%
dbt
90%
Kafka
80%

Career Goals

Build at Scale

Continue architecting data systems that process billions of events reliably and efficiently.

Lead & Mentor

Grow into a Principal/Staff engineer role where I can shape data strategy and mentor the next generation.

Write & Speak

Publish more technical content, write my fourth book, and speak at major data and cloud conferences.

Build Products

Launch a SaaS tool that solves a real data engineering pain point — currently in stealth.

Let's Connect

Open to new opportunities, collaborations, speaking invites, and good conversations.

Say Hello

Whether you're a recruiter, a fellow engineer, a reader, or a conference organiser — I'd love to hear from you. I typically respond within 24 hours.

Available for hire

Open to full-time, contract, and consulting roles in Data Engineering & Cloud.

Message sent! I'll be in touch within 24 hours.