Skip to main content

Daily data contracts tips

I’m no longer publishing daily data contract tips, but I am still writing! Check out my new weekly newsletter.

2024


The costs of storing data

·1 min

Since Hadoop came along in 2006 and significantly reduced the cost of storing “big data” we’ve often been focused on how much data we can bring in centrally, with the assumption that we’ll use it to create value later.

Every data transform is technical debt

·3 mins

I enjoyed this post by Paul McMahon on how all code is technical debt. Paul argues that the more code an application has, the slower the development, due to the assumptions that exist in that code and the features they support. As such, he views all code as technical debt.

Trust starts at the source

·1 min

As I wrote yesterday, many data professionals don’t trust the data they are building on. And many users of data and data applications don’t trust the data they’re being provided.

Do you trust your data?

·1 min

At most of my recent talks I’ve asked the audience - who are made up of data professionals - a simple question: Do you trust your data?

We are not unique

·1 min

Most of the problems we’re solving in our organisations are not unique to us:

  • We need a way to store data
  • We need a way to discover data
  • We need a way to transform data
  • We need a way to build and present dashboards
  • We need a way to train and deploy machine learning models

And so on.

2023


Challenge assumptions

·1 min

Often things are as they are because they always have been.

So challenge the assumption that they have to be that way.

Work on your written communication

·1 min

Written communication is such an important skill for your career.

Most of us do most of our professional communication on something like Slack or Teams, particularly as we spend more time working remotely. The quality of that communication affects how well you’ll work with others and the relationships you build with them.