mersenneforum.org Official "New York Times Deathwatch" thread
 2007-06-01, 17:29 #1 ewmayer ∂2ω=0     Sep 2002 República de California 32×1,303 Posts Official "New York Times Deathwatch" thread "If it bleeds, it leads" This thread will keep a running tally of how many times the word "kill" appears on the New York Times online world headlines page. I'm not often online on weekends, so a weekend editor would be very much appreciated, so we don't miss any days. June 2007: ------------------------------ Fri, 01 Jun: 5 times: 1, 2, 3, 4, 5 Sat, 02 Jun: No data Sun, 03 Jun: 7 times: 1, 2, 3, 4, 5, 6, 7 Mon, 04 Jun: 7 times: 1, 2, 3, 4, 5, 6, 7 Tue, 05 Jun: 2 times: 1, 2 Wed, 06 Jun: 4 times: 1, 2, 3, 4 Thu, 07 Jun: No data Fri, 08 Jun: 6 times: 1, 2, 3, 4, 5, 6 Sat, 09 Jun: No data Sun, 10 Jun: No data Mon, 11 Jun: 7 times: 1, 2, 3, 4, 5, 6, 7 Tue, 12 Jun: 3 times: 1, 2, 3 Wed, 13 Jun: 2 times: 1, 2 Thu, 14 Jun: 3 times: 1, 2, 3 Fri, 15 Jun: 3 times: 1, 2, 3 Sat, 16 Jun: No data Sun, 17 Jun: No data Mon, 18 Jun: 3 times: 1, 2, 3 Tue, 19 Jun: 5 times: 1, 2, 3, 4, 5 Wed, 20 Jun: 6 times: 1, 2, 3, 4, 5, 6 Thu, 21 Jun: 6 times: 1, 2, 3, 4, 5, 6 Fri, 22 Jun: 5 times: 1, 2, 3, 4, 5 Sat, 23 Jun: No data Sun, 24 Jun: No data Mon, 25 Jun: 6 times: 1, 2, 3, 4, 5, 6 Tue, 26 Jun: 7 times: 1, 2, 3, 4, 5, 6, 7 Wed, 27 Jun: 4 times: 1, 2, 3, 4 Thu, 28 Jun: 3 times: 1, 2, 3 Fri, 29 Jun: 6 times: 1, 2, 3, 4, 5, 6 Last fiddled with by ewmayer on 2007-08-01 at 17:09
 2007-06-01, 18:21 #2 BlisteringSheep     Oct 2006 On a Suzuki Boulevard C90 2·3·41 Posts Are you excluding the "News from AP & Reuters" box: 6. They do appear to be transient.
2007-06-01, 18:47   #3
ewmayer
2ω=0

Sep 2002
República de California

32·1,303 Posts

Quote:
 Originally Posted by BlisteringSheep Are you excluding the "News from AP & Reuters" box: 6. They do appear to be transient.
Thanks - that link wasn't there when I took my snapshot. I figure a once-a-day capture is enough for my purposes, so 5 remains the official count for today.

Also not looking for similar death-and-violence-related words like die, slaughter, murder, assassinate, butcher, genocide, execute, maim, blind, torture, dead, body, corpse, mutilate, brutalize, etc. -- otherwise I could spend half of every day hunting the page.

 2007-06-29, 16:32 #4 ewmayer ∂2ω=0     Sep 2002 República de California 101101110011112 Posts July 2007 Sun, 01 Jul: No data Mon, 02 Jul: 5 times: 1, 2, 3, 4, 5 Tue, 03 Jul: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Wed, 04 Jul: No data Thu, 05 Jul: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Fri, 06 Jul: 7 times: 1, 2, 3, 4, 5, 6, 7 Sat, 07 Jul: No data Sun, 08 Jul: No data Mon, 09 Jul: 4 times: 1, 2, 3, 4 Tue, 10 Jul: 2 times: 1, 2 Wed, 11 Jul: 7 times: 1, 2, 3, 4, 5, 6, 7 Thu, 12 Jul: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Sat, 14 Jul: 9 times: 1, 2, 3, 4, 5, 6, 7, 8, 9 Sun, 15 Jul: No data Mon, 16 Jul: 9 times: 1, 2, 3, 4, 5, 6, 7, 8, 9 Tue, 17 Jul: 3 times: 1, 2, 3 Wed, 18 Jul: No data Thu, 19 Jul: 13 times: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 Fri, 20 Jul: 2 times: 1, 2 Sat, 21 Jul: No data Sun, 22 Jul: No data Mon, 23 Jul: 6 times: 1, 2, 3, 4, 5, 6 Tue, 24 Jul: 4 times: 1, 2, 3, 4 Wed, 25 Jul: 2 times: 1, 2 Thu, 26 Jul: No data Fri, 27 Jul: 4 times: 1, 2, 3, 4 Sat, 28 Jul: No data Sun, 29 Jul: No data Mon, 30 Jul: 3 times: 1, 2, 3 Tue, 31 Jul: 2 times: 1, 2 Last fiddled with by ewmayer on 2007-08-01 at 17:10
 2007-08-01, 17:08 #5 ewmayer ∂2ω=0     Sep 2002 República de California 2DCF16 Posts August 2007 Wed, 01 Aug: 4 times: 1, 2, 3, 4 Thu, 02 Aug: 6 times: 1, 2, 3, 4, 5, 6 Fri, 03 Aug: 4 times: 1, 2, 3, 4 Sat, 04 Aug: No data Sun, 05 Aug: No data Mon, 06 Aug: 2 times: 1, 2 Tue, 07 Aug: 0 times [A first since thread was begun - still had plenty of dies(4), dead(4), hurt(1), murder(1), though] Wed, 08 Aug: 5 times: 1, 2, 3, 4, 5 Thu, 09 Aug: 6 times: 1, 2, 3, 4, 5, 6 Fri, 10 Aug: 2 times: 1, 2 Sat, 11 Aug: No data Sun, 12 Aug: No data Mon, 13 Aug: ? times: 1, 2, 3, 4, 5, 6 Tue, 14 Aug: 4 times: 1, 2, 3, 4 Wed, 15 Aug: 3 times: 1, 2, 3 Thu, 16 Aug: 4 times: 1, 2, 3, 4 Fri, 17 Aug: 4 times: 1, 2, 3, 4 Sat, 18 Aug: No data Sun, 19 Aug: No data Mon, 20 Aug: 3 times: 1, 2, 3 Tue, 21 Aug: 1 times: 1 Wed, 22 Aug: No data Thu, 23 Aug: 1 times: 1 Sat, 25 Aug: 12 times: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 Sun, 26 Aug: 5 times: 1, 2, 3, 4, 5 Mon, 27 Aug: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Tue, 28 Aug: 6 times: 1, 2, 3, 4, 5, 6 Wed, 29 Aug: 2 times: 1, 2 Thu, 30 Aug: 4 times: 1, 2, 3, 4 Fri, 31 Aug: 4 times: 1, 2, 3, 4 Last fiddled with by ewmayer on 2007-09-01 at 21:59
 2007-09-01, 22:08 #6 ewmayer ∂2ω=0     Sep 2002 República de California 32×1,303 Posts September 2007 Sat, 01 Sep: 3 times: 1, 2, 3 Sun, 02 Sep: 3 times: 1, 2, 3 Mon, 03 Sep: No data Tue, 04 Sep: 4 times: 1, 2, 3, 4 Wed, 05 Sep: 5 times: 1, 2, 3, 4, 5 Thu, 06 Sep: 5 times: 1, 2, 3, 4, 5 Fri, 07 Sep: 3 times: 1, 2, 3 Sat, 08 Sep: 6 times: 1, 2, 3, 4, 5, 6 Sun, 09 Sep: No data Mon, 10 Sep: 5 times: 1, 2, 3, 4, 5 Tue, 11 Sep: 4 times: 1, 2, 3, 4 Wed, 12 Sep: 0 times Thu, 13 Sep: 7 times: 1, 2, 3, 4, 5, 6, 7 Fri, 14 Sep: 7 times: 1, 2, 3, 4, 5, 6, 7 Sat, 15 Sep: No data Sun, 16 Sep: No data Mon, 17 Sep: 4 times: 1, 2, 3, 4 Tue, 18 Sep: 4 times: 1, 2, 3, 4 Wed, 19 Sep: 4 times: 1, 2, 3, 4 Thu, 20 Sep: 6 times: 1, 2, 3, 4, 5, 6 Fri, 21 Sep: 3 times: 1, 2, 3 Sat, 22 Sep: No data Sun, 23 Sep: No data Mon, 24 Sep: 4 times: 1, 2, 3, 4 Tue, 25 Sep: 6 times: 1, 2, 3, 4, 5, 6 Thu, 27 Sep: 4 times: 1, 2, 3, 4 Fri, 28 Sep: 1 times: 1 Sat, 29 Sep: 2 times: 1, 2 Sun, 30 Sep: No data Last fiddled with by ewmayer on 2007-10-01 at 16:42
 2007-10-01, 16:45 #7 ewmayer ∂2ω=0     Sep 2002 República de California 1172710 Posts October 2007 Mon, 01 Oct: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Tue, 02 Oct: 5 times: 1, 2, 3, 4, 5 Wed, 03 Oct: 3 times: 1, 2, 3 Thu, 04 Oct: 6 times: 1, 2, 3, 4, 5, 6 Fri, 05 Oct: 7 times: 1, 2, 3, 4, 5, 6, 7 Sat, 06 Oct: No data Sun, 07 Oct: No data Mon, 08 Oct: 9 times: 1, 2, 3, 4, 5, 6, 7, 8, 9 Tue, 09 Oct: 9 times: 1, 2, 3, 4, 5, 6, 7, 8, 9 Wed, 10 Oct: 7 times: 1, 2, 3, 4, 5, 6, 7 Thu, 11 Oct: 6 times: 1, 2, 3, 4, 5, 6 Fri, 12 Oct: No data Sat, 13 Oct: No data Sun, 14 Oct: No data Mon, 15 Oct: 3 times: 1, 2, 3 Tue, 16 Oct: 0 times Wed, 17 Oct: 3 times: 1, 2, 3 Thu, 18 Oct: 1 times: 1 Fri, 19 Oct: 5 times: 1, 2, 3, 4, 5 Sat, 20 Oct: No data Sun, 21 Oct: No data Mon, 22 Oct: 4 times: 1, 2, 3, 4 Tue, 23 Oct: 4 times: 1, 2, 3, 4 Wed, 24 Oct: 5 times: 1, 2, 3, 4, 5 Thu, 25 Oct: 5 times: 1, 2, 3, 4, 5 Fri, 26 Oct: 4 times: 1, 2, 3, 4 Sat, 27 Oct: No data Sun, 28 Oct: No data Mon, 29 Oct: 4 times: 1, 2, 3, 4 Tue, 30 Oct: 8 times: 1, 2, 3, 4, 5, 6, 7, 8 Wed, 31 Oct: 7 times: 1, 2, 3, 4, 5, 6, 7 Last fiddled with by ewmayer on 2007-11-02 at 16:33
2007-10-02, 21:57   #8
Uncwilly
6809 > 6502

"""""""""""""""""""
Aug 2003
101×103 Posts

17·619 Posts

Quote:
 Originally Posted by ewmayer Also not looking for similar death-and-violence-related words like die, slaughter, murder, assassinate, butcher, genocide, execute, maim, blind, torture, dead, body, corpse, mutilate, brutalize, etc. -- otherwise I could spend half of every day hunting the page.
My understanding is that Python could fetch the page for you and run the analysis.

I was trying to convince a co-worker that he should set-up a homemade auto stock trading system using Python to do the work.

2007-10-02, 22:20   #9
ewmayer
2ω=0

Sep 2002
República de California

32·1,303 Posts

Quote:
 Originally Posted by Uncwilly My understanding is that Python could fetch the page for you and run the analysis. I was trying to convince a co-worker that he should set-up a homemade auto stock trading system using Python to do the work.
I'm not a scripting guru - but if someone wants to write a .py script to auto-search the above page for a list of keywords, I'd be happy to add that total to each day's link-by-link entries for the narrower "kill".

I'm not sure what if anything this little experiment is tracking, but let's gather data first, and formulate hypotheses later.

 2007-10-03, 00:21 #10 Xyzzy     Aug 2002 100001000000112 Posts Code: wget -q -O - 'http://nytimes.com/pages/world/index.html' | tr ' ' '\n' | grep -i kill | wc -l This catches "kill", "killed", "killing" and any words that contain the word "kill". If you want it more exclusive: Code: wget -q -O - 'http://nytimes.com/pages/world/index.html' | tr ' ' '\n' | grep -i '^kill\$' | wc -l Edit: Theoretically, "kill" could could occur before a period, or with a comma after it, or it could be next to a "<" or a ">", but we can use "tr" to turn all those into spaces if we need to. First we have to see what edge case scenarios pop up.
 2007-10-03, 17:56 #11 ewmayer ∂2ω=0     Sep 2002 República de California 2DCF16 Posts Nice - thanks, o Master Yoda of *nix scripting. I can use the 'kill' version as a checksum on my manual link-by-link count - but, is there a way to get it to print the entire word in which each match was found. A few months, when Mr. Skilling of Enron infamy was in the news, this would have produced many false hits. This will also allow me to quickly search for a whole slew of violence-related keywords and add those as a sort of "daily horror total" which should be a more reliable indicator to the overall NYT "gore quotient" than any single word. So now it's time to start putting together the master keyword list - I've made a start, additional suggestions are requested. In cases where multiple related words share the same sequence of starting letters [and no non-related words contain same], I've used just the common substring, e.g. "insurgent" and "insurgency" get collapsed to just "insurge". I've indicated such with a "substring[foo,bar]" notation, and don't even bother to explicitly list -s pluralizations and -d past-tenses, which are implied: abduct[or,ee,ion] assassin[ate] assault attack[er] beating ["beat" by itself is too broad] blast blind bodies body bomb[ing,er] brawl[ing] brutal[ize,ity] butcher casualt[y,ies] corpse crash crush danger[ous,en-] dead[ly] death decapitat[e,ion] destroy destruction die dispute dying execut[e,ion] explo[de,sion,sive] fatal fear fight[er,ing] fire flee feud genocid[e,al] ghastly holocaust horrendous horrif[ic,y] horror hostage hunt[er,ing] hurt inferno injur[y,ies,ed] insurge[nt,ncy] insurrection kidnap kill[er,ing] lynching [specifically added -ing because "Lynch" is a common name - "lynch mob" can be gotten via "mob"] maim massacre milit[ia,ant,ancy] missile mob[ster,bing] murder[er,ing,ous] mutilat[e,ion] prison rage rape raping rapist rebel[led,lion,lious] refugee riot[ous,ing,er] ruthless slaughter strangle suicide target terror[ist,ize,ism] torture trap victim[ize] violen[t,ce] weapon wound Mike, is there a way to make the "grep" to be for "any of {list of words}", or do I just have to run the above for each keyword separately? That would save a lot of time- only need to access the webpaqe once, not once for each keyword. Last fiddled with by ewmayer on 2007-10-03 at 18:47 Reason: Should've just read today's NYT frontpage before posting the above list: added: fire,fatal,wound,fear,dispute,missile,target,feud,danger

