Data engineering salon. News and interesting reads about the world of data.
Collaborating as a data team to produce excellent datasets -- some parts are bullshit, but it's an interesting read.
Two words: Java functions.
Don't use Spark for tasks that require complex logic.
Deploying is a ritual. It’s a sacred place, a quiet place, and a dangerous place, where anything can happen. In deployment, the system is in a fragile state, and you are in a fragile state.
A gentle introduction to Apache Kafka.
Help with solving Kafka-esque data problems
Cloudera was once one of the hottest Hadoop startups, but over time the shine has come off that market, and today it went private.
"Meltano aims to bring the entire data lifecycle into the DataOps Era." Wut?