Skip to main content

Blogs

2022


Answer with docs

·1 min

As someone who works in an enablement team, I’m trying to instil a best practice of answering with docs. This has a number of benefits to both us and our users:

2021


Data contracts

·4 mins

Almost all data platforms start with a change data capture (CDC) service to extract data from an organisations transactional databases - the source of truth for their most valuable data. That data is then transformed, joined, and aggregated to drive analysis, modelling, and other downstream services.

We had an incident, and it was great

·4 mins

We recently had an incident with our data pipeline, resulting in data being lost on route to our data platform. Of course, you never want an incident, but failures are a fact of life. What’s important is how you prepare for them and respond to them, and in that sense this was a great incident.

2020


The democratisation of Data Science

·2 mins

There’s a trend in the industry to make data science and machine learning more accessible, allowing engineers to build and deploy standard models without needing to have a strong data science background. Examples include BigQueryML (standard models in SQL), Ubers Ludwig and h2o.ai.

What does a Tech Lead do?

·2 mins

I’ve been a Tech Lead for a few years now, though I’d say I’ve only been a good Tech Lead for about a year. So what, exactly, does a good Tech Lead do?

2019


Lambda Architecture in 2020

·2 mins

As I start to think about some of the upcoming projects we’ll be working on over the next year and how we might go about building them, I wanted to consider where lambda architecture fits in our toolbox for building data services.

The benefits of postmortems

·1 min

Postmortems are a well established process followed in the aftermath of an incident. Often the most visible output from the process is a structured document, with some or all of the following components: