Skip to main content

Daily data contracts tips

I’m no longer publishing daily data contract tips, but I am still writing! Check out my new weekly newsletter.

2023


The cost of handoffs in data

·1 min

Every handoff has a cost.

As data engineers we see this most often in the cost of handing off data. We move it from one system to another, paying the cost in the compute needed to do that, paying the cost in the duplication of storage, paying the cost in building and maintaining the pipelines doing it.

Being reactive with data quality

·2 mins

Most of the time we’re reacting to data quality issues.

Maybe someone has made a change to their database schema, and since we’re pulling that into our data warehouse directly from their database that breaks everything we’ve built. Or maybe the business logic has changed upstream, and we had our own version of that logic built on the data warehouse that has fallen out of sync.

How important is data to your organisation?

·1 min

If you’re working in one of the data teams then it’s useful to consider how important data is to your organisation. Does it (or something driven by it, such as ML/AI) appear in your company strategy? How easy is it to get investment for people and tooling? Where in the org chart does your data team sit?

The IKEA Effect in Data Engineering

·2 mins

The IKEA effect is where one places greater value on something they built, or partially built themselves. It’s named after the Swedish firm who famously provide their furniture as flat-pack, requiring assembly by the customer.

Writing daily

·1 min

I’ve written before how much I love writing.

Often, I’m writing for myself. By writing things down I feel forced to think critically about the subject. What do I know? What don’t I know? Am I able to explain it clearly? It’s a forcing function to help clarify my ideas.