URL: https://github.com/twitter/scalding

Tech: Scala, Hadoop, Cascading

Scalding is a Scala library that makes it easy to specify Hadoop MapReduce jobs using a Scala API inspired by Cascading. Scalding was the primary batch data processing framework at Twitter, used to process petabytes of data daily. I was a core contributor, focusing on the algebird-powered aggregation framework and typed API.