ryochiji's blog
Brought to you fresh from the depths of Ryo Chijiiwa
>Top
 About Me!
 Programming
 Writing
 Photo Album
 Photo Blog
 Schedule
 Resume
 Contact
 Links
GPG Key


 
Powered by IlohaBlog

Section: All | News & Politics | Geek Stuff | Devel | Non-existent Life | Random | Food! | Life |

Mon, August 29, 2005

PubCrawl: Arm Chair Sociology With Flickr

So Yitz and I were hanging out, trying to decide whether we should go to my apartment and code, or go to the pub. After a while, we somehow ended up wondering what day of week pubs were the busiest, and being the geeks that we are, we decided to look for an algorithmic solution. And where would you go to collect data on pub patronage by day of week? Why, Flickr, of course!

The premise of our theory was very simple. We guessed that people are bound to take pictures when they go to the pub, then upload and tag those photos on Flickr. Sure enough, there are over 14,000 images tagged "pub" on Flickr. We decided that would give us enough data points, to at least get some info.

Since we couldn't find API calls to extract the data we wanted, and we don't have API keys anyway, we decided to do it the quick and dirty way. Yitz hacked together a Perl script to extract all the photo IDs tagged "pub", then extract the date taken from the details page, and compile all the data into a single file (well, technically, we had up to 10 of these processes running in parallel). I then scratched together a PHP script to collate that data, and this is what we got:

Exhibit A: # of pictures tagged "pub," by day-of-week

Sun:1857 (13.38%)
Mon:1771 (12.76%)
Tue:1286 (9.26%)
Wed:1507 (10.85%)
Thu:1735 (12.50%)
Fri:2944 (21.21%)
Sat:2778 (20.02%)
The list shows the total number of pictures taken, by day of week, with percentages. As it turns out, Friday seems to be the busiest day at the pub. (We decided that for this experiment, we should define a day as starting and ending at 7am. It just intuitively seemed like a good time when a trip at the pub should end --more on this later).

Encouraged by these results, we decided to do the same with photos tagged "bar":

Exhibit B: # of pictures tagged "bar," by day of week

Sun: 2917 (11.14%) [896]
Mon: 2803 (10.70%) [795]
Tue: 2949 (11.26%) [780]
Wed: 2690 (10.27%) [840]
Thu: 4196 (16.02%) [941]
Fri: 5101 (19.47%) [1185]
Sat: 5540 (21.15%) [1301]

(Square brackets show number of unique users who posted images.)

Since no experiment can be considered legitimate without pretty graphs, we passed our data to GNUPlot:

Exhibit C: # of photos tagged "bar" and "pub" by hour

Here are some observations we made:

  • The "pub" and "bar" graphs are strikingly similar
  • Friday and Saturday around 11pm-midnight seem busiest
  • Everybody goes home at 7am (recall that we defined a day as starting and ending at 7am -the ticks are shown at 7am, which happen to also be the lowest points)
  • Have a case of the Mondays? Hit the pubs and bars like everybody else!
  • People don't seem to need extra booze to get over "hump days"
  • Tuesday might as well be called "Sober Day".

It was close to 2am when we got this far... but there was one more thing we wanted to know. How does all this correlate to when people are the most drunk? Well, check out what we found:

Exhibit D:Pub/bar-crawling vs drunkenness

There's a very clear phase shift... and as it turns out, drunkenness peaks at 7am, when they've all gone home from the pubs and bars!

Conclusions
It's past 3:20am, and I really should go to bed. Besides, everybody knows conclusions are complete BS anyway. So let's just skip it, and let me go to bed...

Also see: Yitz's LJ post




dirvish

What's with the spike of drunkenness Monday morning?
[moderate]



Seriously...

That's a huge spike... How is the 'date taken' info stored at Flickr? Could we be seeing delayed uploads (from work, maybe?) of weekend hijinks?

On a personal note, I seem to find myself at the pub on Sundays and Tuesdays, mostly. Plenty of open space to enjoy a pint of Guinness and some quiet contemplation.

Nice mini project. Bonus points for not just asking a regular bar-hopper and doing it the cool way instead.

[moderate]



Wow

You've seriously over-engineered going to the bar and looking around.
[moderate]



observations

"People don't seem to need extra booze to get over "hump days" "

maybe they just don't take their camera to the pub on a monday night.. but do when 'larging it' on a friday or saturday

[moderate]



Some similar work...

Hi, I did something similar a couple months ago, except I used the photos themselves as datapoints.

Here's a time-mapping of "breakfast, lunch, dinner":

http://www.flickr.com/photos/krazydad/5041585/in/set-140323/

And here are sunsets:

http://www.flickr.com/photos/krazydad/4992355/in/set-140323/

Enjoy,

- jbum


[moderate]



Trackback: Geeks Get Their Customer Intuition On

From: http://betuitive.blogs.com/beconnected/2005/08/geeks_get_their.html

[moderate]



drunkenness?

what was your measure of drunkenness?
[moderate]




>you're monitoring the time people posted pictures

We used the date/time in the "time taken" field, so hopefully that's what we got. But it's entirely possible that that information wasn't available for some photos.

>what was your measure of drunkenness?

We looked at photos tagged "drunk".

[moderate]



187%

The #'s add up to ~187% in exhibit B. What gives?
[moderate]




>The #'s add up to ~187% in exhibit B. What gives?

Thanks, fixed.

[moderate]




[moderate]

.:: Links ::.

ryochiji's blog
Archives
RSS 1.0
Contact