Introduction to pysparkling
11:45 AM - 12:10 PM on August 16, 2015, Room 704Sven Kreiss
- Audience level:
- intermediate
- Watch:
- http://youtu.be/KWxu5xuRtwo
Description
A native Python implementation of Spark's RDD interface.
Abstract
This talk introduces pysparkling, a native Python implementation of Spark's RDD interface. The use cases, which are different from PySpark, are discussed. As an example, a Flask-based API endpoint using pysparkling to process documents for a scikit-learn classification are shown.