Skip to main content

Daily data contracts tips

I’m no longer publishing daily data contract tips, but I am still writing! Check out my new weekly newsletter.

2023


The problem with defaults

·2 mins

Let’s say 20 years ago you ran your code in an environment you configured simply as python. The obvious default would have been Python 2. But today the obvious default is Python 3. If you deployed that same code with the same configuration, what Python environment would you expect? What would you expect that to be in 20 years time?

Agility vs stability

·1 min

How do you like your data?

Do you want it to be agile? So it can change at any moment, depending on the needs or wants of those producing the data? If a team decides it wants to model an object differently, with different IDs, they can do so. They are moving fast and breaking things.

The holiday code freeze

·1 min

It’s that time of year again where teams everywhere are considering a code freeze for the holiday season. Should we have one? How long for? What will we do while our code is frozen? Will we have lots to merge in January, and how will we manage that?

Data governance is changing

·1 min

Data governance is changing.

With the move towards data mesh we’re seeing a move away from centralised governance teams defining policies and assigning roles like “data steward” (which no one wanted to be!) to people in the organisation and instead delegating those responsibilities to those teams and domains who produce the data and have the most context on the data.

Prioritising data quality projects

·1 min

It’s likely going to be difficult to get a project prioritised if the goal is defined only as “improving data quality”. So what? What’s the business value that will provide or enable? And does that align with the organisations wider goals?

Know your audience

·2 mins

After mentioning the title of my talk for apidays in Paris was “Data Contracts: The API for Data”, I had a couple of responses saying that data contracts are not APIs, and that they’re for communication and facilitating an agreement between the generators and consumers of data (or something along those lines).

The similarities between software and data

·1 min

I’m writing this on the train back from Paris, having spent the last few days at the apidays conference, where I gave a talk titled “Data contracts: The API for Data”. I have a lot of digest and plenty of notes to go through! But one think that struck me is the similarities between software and data.

The importance of interfaces

·2 mins

Interfaces are powerful. That’s why we see them everywhere in software engineering. In fact, I’d say they are essential when you want to depend on something provided by someone else.