A few months ago, I discovered that IMDb users can download data about the films they have rated.
I’m a great lover of all things film related, but someone who’s even more of a film geek than me is my other half who’s had an IMDb account set up for several years now. Whether he’s watched a film at the cinema or at home, he’ll always rate them on IMDb afterwards so I was excited to look at the data to see what I could discover.
As there’s a lot of data to get through, this post will be the first in a series looking at what interesting analysis can be done with user ratings data from IMDb.
First of all, I wanted to create an overview of the number of films rated.
2013 was a busy year on the ratings front, but from 2014 onwards the number of ratings settles down to a relatively consistent level.
Let’s take a look at 2014 onwards in more detail to see how the number of ratings over the course of 2016 compare to 2014 and 2015…
There are two things that caught my attention:
- 2016 got off to a good start, but it looks like the number of ratings tailed off in the second half of the year
- 2015 had a late surge in December, overtaking the total number of films rated in 2014 and 2016
Finally, I wanted to look at the number of films rated by day to see whether we tend to watch films on some days more than others.
Sunday is the most popular day for rating ( / watching) films, with Wednesday coming a close second (perhaps as a result of the Orange Wednesdays offer which has now finished).
On the other hand, we aren’t as likely to watch films on a Saturday or Tuesday (who knew!).
That’s a wrap for today folks, but in the next post in the series I’ll be looking at the distribution of ratings (Are the ratings skewed? What’s the average rating? How varied are the ratings?).
Click on the link below if you’d like to take a look at the code behind the charts in this post.
Thanks for reading this post!