Apache Spark @Scale: A 60 TB+ production use case

Facebook often uses analytics for data-driven decision making. Over the past few years, user and product growth has pushed our analytics engines to operate on data sets in the tens of terabytes for a single query. Some of our batch analytics is executed through the venerable Hive platform (contributed to Apache Hive by Facebook in … Continue reading Apache Spark @Scale: A 60 TB+ production use case […]