Getting Started with Apache Spark: the Definitive Guide

By Tim King , Executive Editor at Solutions Review
Best Practices,

If you work in Data Science or IT, you’re probably already familiar with Apache Spark. In practice, Spark has grown exponentially in 2015, and in some use cases it has matched or even surpassed Hadoop as the open source Big Data framework of choice. Vendors are beginning to hop on board as well, as Talend, Altiscale and Pentaho have all enhanced their integration platforms with Spark in recent months.

With all of the highly technical chatter out there, it can be hard to understand what Spark can help your organization do. Thankfully there’s LinkedIn’s Slideshare, a resource where users and companies can host webinars and presentations for public access. We combed through thousands of presentations on the site using the Spark keyword to find a series of eight created by Databricks, a company who revolutionizes data processing through the Spark platform.

The slideshows, which were all presented by Databricks at Spark Summit EU 2015 in late October, outline various topics on Spark, as you’ll see below:

The evolution of Spark: where is it being used, for what purpose, and by whom?

Spark Summit EU 2015: Matei Zaharia keynote from Databricks

A technical overview of Spark’s DataFrame API: Implementation and more:

Spark Summit EU 2015: Spark DataFrames: Simple and Fast Analysis of Structured Data from Databricks

An inside look at Spark’s development, both frontend and backend:

Spark Summit EU 2015: Reynold Xin Keynote from Databricks

Databricks outlines emerging trends, common issues, and solutions:

Spark Summit EU 2015: Lessons from 300+ production users from Databricks

How do users integrate common data science tools like Python, with Spark?

Spark Summit EU 2015: Combining the Strengths of MLlib, scikit-learn, and R from Databricks

What have users learned in migrating from Data Warehouses to Spark?

Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predictive Modeling from Databricks

Databricks’ CEO discusses the impact Spark has had in the enterprise:

Spark Summit EU 2015: Revolutionizing Big Data in the Enterprise with Spark from Databricks

How do Spark clusters and R facilitate analysis of Big Data?

Enabling exploratory data science with Spark and R from Databricks

There you have it! A nice selection of Spark presentations to help you cut through all of the other information out there on the web. For more on Spark, stay tuned into Solutions Review.

Widget not in any sidebars

This article was written by Tim King on November 19, 2015

Tim King

Executive Editor

Tim is Solutions Review's Executive Editor and leads coverage on data management and analytics. A 2017 and 2018 Most Influential Business Journalist and 2021 "Who's Who" in Data Management, Tim is a recognized industry thought leader and changemaker. Story? Reach him via email at tking@solutionsreview dot com.

What the AI Impact on Data Engineering Jobs Looks Like Right Now - April 24, 2025
The 17 Best AI Agents for Data Integration to Consider in 2025 - April 22, 2025
What to Expect at Safe Software’s The Peak of Data and AI 2025 May 6-8 - April 17, 2025

Best Practices

Getting Started with Apache Spark: the Definitive Guide

Tim King

Executive Editor

Expert Insights

Latest Posts

Categories

Important Links

Useful Pages

Getting Started with Apache Spark: the Definitive Guide

Share This

Tags

Tim King

Executive Editor

Related Posts

The Holy Grail of Data Integration Is AI-Driven, Seamless & Secure

Outmaneuvering Tariffs: Navigating Disruption with Data-Driven Resilience

The Great Debate: Will AI Help or Hinder Data Engineering Roles?

Expert Insights

Latest Posts

Follow Solutions Review