Scalding is a Scala library that makes it easy to specify Hadoop MapReduce jobs using a Scala API inspired by Cascading. Scalding was the primary batch data processing framework at Twitter, used to process petabytes of data daily. I was a core contributor, focusing on the algebird-powered aggregation framework and typed API.
Scalding
Scala MapReduce library built on Cascading