One of my favourite websites is Digg.com, which is a site where members click in their votes for stories that they find newsworthy or interesting, or worth a look.
Being perversely fascinated with statistics: I thought I would follow Digg for a month and see what kind of stories get the most attention (votes/diggs) based on stories submitted + keywords + actual content in the url/page/site (keywords (most common word) everywhere basically). These findings (obviously) has nothing to do with story importance, it’s more a general look in to averages and words mentioned in the actual stories etc.
No, I was never a great coder, so maybe I missed something somewhere? These findings are presented “as is” (I can’t guarantee the accuracy). Hoverer, seeing the end result – it looks good to me? In no particular order, except the last entry…
- Paris Hilton + Drunk + Purse + Vomiting + Video.
- Nintendo + Wii + Rocks + Cool + Tricks.
- Iraq + War + Neat + Dying + Bush.
- Intel + Chip + Core16 + Edible + Yummie.
- Cancer + Urban + Legend + Santa + Norad.
- Stripper + Pole + Slipping + Elf + Dentures.
- PS3 + Software + Bug + Herpes + Bill Gates.
- Murder + Gross + Horrific + Wii + Youtube.
- Wii + Controler + Dog + Rectum + Lawsuit.
- PS3 + eBay + Fucking + Nuts + Beef jerky.
- Stripper + Hat + China + Hockey + Suck.
- Keyboard + Better + Cancer + Porn + WTF.
- Airline + Security + Fun + Lightsaber + Colorado.
- Mad + Cow + Ritalin + Happy + BBQ.
- Girfriend + Very + Naked + Happy + Redneck.
- Coors + Light + Flaming + Homosexual + Constitution.
- Digg + New + Features + News+Wii.
- Nintendo + Undigg + Wii + Merger + Iminent.
- Paris Hilton + Video + Nintendo + Iraq + Intel.
- Cancer + Yummie + PS3 + Bug + Stripper + China + Suck + Wii + Dog + Rectum + Airline + Coors + Digg + Merger + Iminent + Google + Big + Bugs.