Natural language processing of new jokes from 2015

This is a brief update to a previous post: “Python, natural language processing and predicting funny”. In that post I carried out some basic natural language processing with Python to predict whether or not a joke is funny. In this post I just update that with some more data from this year’s Edinburgh Fringe festival.

Take a look at the ipython notebook which shows graphics and outputs of all the jokes. Interestingly this year’s winning joke is not deemed funny by the basic model :) but overall was 60% right this year (which is pretty good compared to last year).

Here is a summary plot of the classifiers for different thresholds of ‘funny’:

The corresponding plot this year (with the new data):

Take a look at the notebook file and by all means grab the csv file to play (but do let me know how you get on :)).