Skip to main content

Data Engineering

2024


Building defensively

·2 mins

I talk a lot about data quality, and how it can be improved.

And that’s because generally, it’s garbage!

The costs of storing data

·1 min

Since Hadoop came along in 2006 and significantly reduced the cost of storing “big data” we’ve often been focused on how much data we can bring in centrally, with the assumption that we’ll use it to create value later.

Every data transform is technical debt

·3 mins

I enjoyed this post by Paul McMahon on how all code is technical debt. Paul argues that the more code an application has, the slower the development, due to the assumptions that exist in that code and the features they support. As such, he views all code as technical debt.

Trust starts at the source

·1 min

As I wrote yesterday, many data professionals don’t trust the data they are building on. And many users of data and data applications don’t trust the data they’re being provided.

Do you trust your data?

·1 min

At most of my recent talks I’ve asked the audience - who are made up of data professionals - a simple question: Do you trust your data?

We are not unique

·1 min

Most of the problems we’re solving in our organisations are not unique to us:

  • We need a way to store data
  • We need a way to discover data
  • We need a way to transform data
  • We need a way to build and present dashboards
  • We need a way to train and deploy machine learning models

And so on.

2023


3 data assumptions worth challenging

·1 min

3 common data assumptions I believe are worth challenging:

  1. No one else cares about data quality
  2. You have to bring all data centrally before you can work on it
  3. Your problems are unique to you, and therefore you need a unique solution

Want great, practical advice on implementing data mesh, data products and data contracts?

In my weekly newsletter I share with you an original post and links to what's new and cool in the world of data mesh, data products, and data contracts.

I also include a little pun, because why not? 😅

    Newsletter

    (Don’t worry—I hate spam, too, and I’ll NEVER share your email address with anyone!)