Buckets of Rain

Back to job at hand...

It is never perfect

UMMA opened recently. I'm not a big fan of museums, my legs tire before my mind; but art museums excite me as an exception. I don't want to explain my experience inside the museum (someone else is already doing that). Instead I want to point out this painting - "The Dead Soldier by Joseph Wright" that simply stood out from everything. The painting has an extremely evocative story that was conveyed from the moment you see the painting. I don't want to be rude to other UMMA exhibits, but this one was simply amazing. Definitely check it out.

Once I hit the Processing land, I was daunted by task of collecting and displaying the movie posters. I was also having problems with the Netflix API. Since a Netflix users data is not public, apps have to authenticate (using oAuth mechanism) to retrieve user's data. I was making slow progress so I put shelved the idea to use movie posters and use just the Netflix dataset. So to refresh - the Netflix dataset has user ratings with the date of rating, rating given and movie name; no other data is available. But the sheer magnitude of the data makes it very interesting.

I overlooking was an salient variable in the Netflix dataset - how different is a user's rating from other users' ratings. Example, bob might rate Taxi Driver at 3. But the mean rating of all users might be 4.6. That is a signal of bob's uniqueness. (I don't know how someone can rate Taxi Driver less than 5).

So I went back to the drawing board and tried to come up with a more feasible visualization. I came up with the visualization below, where each movie is represented by a flower (sort of). The flower is ordered from left to right based on time of rating. I encoded the difference between the user's rating of a movie and the mean rating for the movie using a curve. The width of the curve indicates the difference in rating, the direction of the curve (left or right) indicates if the user rated higher or lower than the mean rating. So if all your flowers are leaning to the right then you rate higher than other users.

Now, if someone asks your movie taste, you can simply say "The answer my friend is blowing in the wind"