Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving | Spark Summit Europe 2018

We present Spark Serving, a new spark computing mode that enables users to deploy any Spark computation as a sub-millisecond latency web service backed by any Spark Cluster. Attendees will explore the architecture of Spark Serving and discover how to deploy services on a variety of cluster types like Azure Databricks, Kubernetes, and Spark Standalone. We will also demonstrate its simple yet powerful API for RESTful SparkSQL, SparkML, and Deep Network deployment with the same API as batch and streaming workloads. In addition, we will explore the “dual architecture”: HTTP on Spark. This architecture converts any spark cluster into a distributed web client with the familiar and pipelinable SparkML API. These two contributions provide the fundamental spark communication primitives to integrate and deploy any computation framework into the Spark Ecosystem. We will explore how Microsoft has used this work to leverage Spark as a fault-tolerant microservice orchestration engine in addition to an ETL and ML platform. And will walk through two examples drawn from Microsoft’s ongoing work on Cognitive Service composition, and unsupervised object detection for Snow Leopard recognition.

Databricks provides a unified data analytics platform (opens in new tab), powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Download the report

Date:: May 6, 2019

- Mark Hamilton
  
  Software Engineer
Research Area
- Artificial intelligence
Project
- SynapseML

Watch Next

Large Scale Intelligent Microservices - IEEE Big Data 2020 Paper Presentation
December 10, 2020
Speakers:

Mark Hamilton
The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Services | Spark Summit Europe 2018
May 6, 2019
Speakers:

Mark Hamilton
Unsupervised Object Detection Using the Azure Cognitive Services on Spark | Spark Summit Europe 2018
October 10, 2018
Speakers:

Mark Hamilton,

Anand Raman
Deep Reality Simulation for Automated Poacher Detection | Spark Summit Europe 2018
October 10, 2018
Speakers:

Mark Hamilton,

Anand Raman

Apache Spark Serving: Unifying Batch, Streaming, and RESTful Serving | Spark Summit Europe 2018

Speakers

Mark Hamilton

Related Links

Research Area

Project

Watch Next

Large Scale Intelligent Microservices - IEEE Big Data 2020 Paper Presentation

The Azure Cognitive Services on Spark: Clusters with Embedded Intelligent Services | Spark Summit Europe 2018

Unsupervised Object Detection Using the Azure Cognitive Services on Spark | Spark Summit Europe 2018

Deep Reality Simulation for Automated Poacher Detection | Spark Summit Europe 2018