Skip to main content

Data Engineering

2023


Agility vs stability

·1 min

How do you like your data?

Do you want it to be agile? So it can change at any moment, depending on the needs or wants of those producing the data? If a team decides it wants to model an object differently, with different IDs, they can do so. They are moving fast and breaking things.

The holiday code freeze

·1 min

It’s that time of year again where teams everywhere are considering a code freeze for the holiday season. Should we have one? How long for? What will we do while our code is frozen? Will we have lots to merge in January, and how will we manage that?

The importance of interfaces

·2 mins

Interfaces are powerful. That’s why we see them everywhere in software engineering. In fact, I’d say they are essential when you want to depend on something provided by someone else.

Answer with docs

·1 min

It’s a reality that when working in a team that supports or enables others you’ll have a steady stream of people asking you questions. Sometimes this stream can turn into a flood, and you spend more of your time answering questions than you do delivering projects.

The cost of handoffs in data

·1 min

Every handoff has a cost.

As data engineers we see this most often in the cost of handing off data. We move it from one system to another, paying the cost in the compute needed to do that, paying the cost in the duplication of storage, paying the cost in building and maintaining the pipelines doing it.

Being reactive with data quality

·2 mins

Most of the time we’re reacting to data quality issues.

Maybe someone has made a change to their database schema, and since we’re pulling that into our data warehouse directly from their database that breaks everything we’ve built. Or maybe the business logic has changed upstream, and we had our own version of that logic built on the data warehouse that has fallen out of sync.

The IKEA Effect in Data Engineering

·2 mins

The IKEA effect is where one places greater value on something they built, or partially built themselves. It’s named after the Swedish firm who famously provide their furniture as flat-pack, requiring assembly by the customer.


Want great, practical advice on implementing data mesh, data products and data contracts?

In my weekly newsletter I share with you an original post and links to what's new and cool in the world of data mesh, data products, and data contracts.

I also include a little pun, because why not? 😅

    Newsletter

    (Don’t worry—I hate spam, too, and I’ll NEVER share your email address with anyone!)