Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

I help organization turn Data into Gold.
less than 1 minute read
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
4 minute read
Published:
Data pipelines are at the heart of modern data engineering. They allow you to automate data ingestion, cleaning, transformation, and feature generation—making your datasets ready for analysis or machine learning tasks.
1 minute read
Published:
Scheduling tasks on Linux is essential for automating system maintenance, backups, or repetitive jobs. Two popular tools for this are Cron and Anacron. While they might seem similar, they serve different purposes and are optimized for different environments. Let’s break it down.
13 minute read
Published:
Docker is simple. Production is not.Most Docker problems don’t come from Docker itself—they come from small shortcuts that compound over time. In this post, we’ll cover practical Docker best practices that keep images small, builds fast, and deployments predictable.
3 minute read
Published:
Running REDCap locally on your PC shouldn’t be fragile or painful. In this post, I walk through how I Dockerised REDCap to create a portable, reproducible, and easy-to-maintain research data platform on my laptop.