Scio 0.7: a deep dive
Introduction Large-scale data processing is a critical component of Spotify’s business model. It drives music recommendations, artist payouts based on [...]
Introduction Large-scale data processing is a critical component of Spotify’s business model. It drives music recommendations, artist payouts based on [...]
Spotify’s Event Delivery system is responsible for delivering hundreds of billions of events every day. Most of the events are generated as a response [...]
In this part we’ll take a closer look at Scio, including basic concepts, its unique features, and concrete use cases here at Spotify [...]
Changing an engineering culture is one of the biggest challenges for any organization. It requires challenging an existing way of working, and introducing compelling improvements [...]
This is the first part of a 2 part blog series. In this series we will talk about Scio, a Scala API for Apache Beam and Google Cloud Dataflow, and [...]
Five years ago, music personalization at Spotify was a tiny team. The team read papers, developed models, wrote data pipelines [...]
At Spotify we have have over 60 million active users who have access to a vast music catalog of over 30 million [...]
Spotify has built several real-time pipelines using Apache Storm for use cases like ad targeting, music recommendation, and data visualization. Each of these [...]
All of our lovely Spotify users generate many terabytes of data every day. All the songs that are listened to, [...]