Solving MapReduce Performance Problems With Sharded Joins
Sometimes the answer to a sluggish data pipeline isn’t more power in the Hadoop cluster, but a shift in technique. [...]
Published by Noel CodySometimes the answer to a sluggish data pipeline isn’t more power in the Hadoop cluster, but a shift in technique. [...]
Published by Noel CodyFor my master’s thesis, I developed and benchmarked an Apache Cassandra compaction strategy optimized for time series. The result, the [...]
Published by Björn HegerforsAll of our lovely Spotify users generate many terabytes of data every day. All the songs that are listened to, [...]
Published by davidawhitingSpotify is a very frequent user of Puppet. We have given many talks and presentations on the subject but these [...]
Published by jhaalsA few weeks ago Spotify had one of the biggest incidents in the last few years. It caused a major [...]
Published by David Poblador i GarciaWe recently hosted the seventh sthlm.js meetup at our office and Paul Lewis of Google Chrome, Robert Nyman of Mozilla and our [...]
Published by Gabriel BonanderThe most frequent question we heard at PyCon this weekend, was how do we use Python at Spotify. Hopefully this post answers [...]
Published by Geoff van der Meerbackend infrastructure at Spotify. Our backend infrastructure is very much work in progress – in some areas we have come [...]
Published by Spotify EngineeringPowering the Spotify service is a backend of dozens of different, specialized service implementations. For example, we have a playlist system that [...]
Published by Björn EdströmIn this article I will explain how Spotify uses different mature and proven technologies in our backend service eco-system and [...]
Published by Björn EdströmIt was the middle of the night, 10th of May 2011, and were at step twentysomething of the rollout plan, [...]
Published by Björn EdströmSpotify makes a significant investment in writing code to automate the provisioning of servers in our data centers. Our goal is [...]
Published by Noa Resare