Below is a list with links to some of my past writing. Happy 2024 and beyond! đĽ
Data Engineering
Workflow Orchestration vs. Data Orchestration â Are Those Different?
TaskFlow API in Apache Airflow 2.0âââShould You Use It?
7 Reasons Why You Should Consider a Data Lake (and Event-Driven ETL)
âDonât Repeat Yourselfâ is beneficial â Not Only in Software Engineering
Is Real-Time Processing Worth It For Your Analytical Use Cases?
Event-driven Data Pipelines with AWS Lambda and GitHub Actions
DuckDB vs. MotherDuck â should you switch to the Cloud version?
Apache Iceberg Crash Course for AWS users: Amazon S3, Athena & AWS Glue â¤ď¸ Iceberg
Polars, DuckDB, Pandas, Modin, Ponder, Fugue, Daft â which one is the best dataframe and SQL tool?
Serverless, Cloud & AI
How To Securely Parse GitHub Actions JSON Secrets for Azure CI/CD
AI Tools and Autonomous Agents: Auto-GPT, BabyAGI, LangChain, AgentGPT, HeyGPT, and more
How I Use OpenAIâs GPT-4 To Stay In Touch With My Mum More Consistently
When GitHub Actions Get Painful to Troubleshoot, Try This Instead
AWS
10 Simple Hacks That Will Make You More Productive When Using AWS
AWS Kinesis vs. SNS vs. SQS â A Comparison With Python Examples
How I Manage Credentials in Python Using AWS Secrets Manager
How I Built CI/CD For Data Pipelines in Apache Airflow on AWS
Deep Dive Into Amazon Timestream â Data Ingestion in Python
Deep Dive Into Amazon Timestream â Building a Real-Time Dashboard
How to Deploy a ChatGPT-powered Python FastAPI Microservice to AWS App Runner