Introduction to pysparkling11:45 AM - 12:10 PM on August 16, 2015, Room 704
- Audience level:
A native Python implementation of Spark's RDD interface.
This talk introduces pysparkling, a native Python implementation of Spark's RDD interface. The use cases, which are different from PySpark, are discussed. As an example, a Flask-based API endpoint using pysparkling to process documents for a scikit-learn classification are shown.