Introduction to pysparkling

11:45 AM - 12:10 PM on August 16, 2015, Room 704

Sven Kreiss

Audience level:
intermediate
Watch:
http://youtu.be/KWxu5xuRtwo

Description

A native Python implementation of Spark's RDD interface.

Abstract

This talk introduces pysparkling, a native Python implementation of Spark's RDD interface. The use cases, which are different from PySpark, are discussed. As an example, a Flask-based API endpoint using pysparkling to process documents for a scikit-learn classification are shown.