Using Python for Sarcasm Detection in Speech

01:45 PM - 02:10 PM on August 16, 2015, Room 701

Rachel Rakov

Audience level:
novice
Watch:
http://youtu.be/xJlTDy61L8s

Description

In this talk, I discuss my work using Python to create a system for sarcasm detection in speech. My goal in this task is to determine whether human intonation alone can be modeled to predict sarcastic speech. I first extracted speech samples from the titular character of MTV’s late ‘90s hit TV show “Daria”. Using crowdsourcing techniques to get the speech labeled for sarcasm, I created a corpus of speech that is annotated for sarcasm and sincerity. I used several Python toolkits to extract a number of acoustic features from this speech that are indicative of sarcasm. The first tool I used was Snack Sound Toolkit, a library for Python that does basic sound handling and analysis. I used tools in Snack to extract a baseline of basic acoustic features that have been found to be helpful in human sarcasm identification. I then used NumPy, SciPy, and NLTK to model prosodic contours, and applied these contours to the task of automatic sarcasm detection. This approach applies sequential modeling to representations of pitch and intensity curves obtained via k-means clustering. Using machine learning (specifically Weka’s SimpleLogistic (LogitBoost) classifier), this system is able to predict sarcasm with 81.57% accuracy.

Abstract

In this talk, I discuss my work using Python to create a system for sarcasm detection in speech. My goal in this task is to determine whether human intonation alone can be modeled to predict sarcastic speech. I first extracted speech samples from the titular character of MTV’s late ‘90s hit TV show “Daria”. Using crowdsourcing techniques to get the speech labeled for sarcasm, I created a corpus of speech that is annotated for sarcasm and sincerity. I used several Python toolkits to extract a number of acoustic features from this speech that are indicative of sarcasm. The first tool I used was Snack Sound Toolkit, a library for Python that does basic sound handling and analysis. I used tools in Snack to extract a baseline of basic acoustic features that have been found to be helpful in human sarcasm identification. I then used NumPy, SciPy, and NLTK to model prosodic contours, and applied these contours to the task of automatic sarcasm detection. This approach applies sequential modeling to representations of pitch and intensity curves obtained via k-means clustering. Using machine learning (specifically Weka’s SimpleLogistic (LogitBoost) classifier), this system is able to predict sarcasm with 81.57% accuracy.