The web speaks

Mar 04, 2009 01:41

Here's the 'best' of the first half-dozen 'poems' from my new program: it babbles at random according to a Markov model using the most common bigrams from Google's whole-web corpus, constrained to a Shakespearean sonnet.

not safe for work )

Leave a comment

Comments 10

lunza March 4 2009, 12:40:59 UTC
Amusing.

Reply


dariusk March 4 2009, 13:29:52 UTC
Oh, that is amazing. Is the source up anywhere yet? (I've been working on a different sort of sonnet-generator myself.)

Reply

dariusk March 4 2009, 13:31:32 UTC
Err, I just realized you're probably accessing that 6-DVD set of data, though?

Reply

darius March 4 2009, 18:41:48 UTC
Not directly, though I might go and buy it. The data's from someone at Google who wrote a great article about some of the cool things you can do with that dataset -- but the article hasn't been published yet so I don't know if I can say more or give out this condensation of the data.

Reply

darius March 4 2009, 18:36:49 UTC
Thanks! I guess I'll put the code up on github as it is now (an ugly mess). Different how?

Reply


gustavolacerda May 26 2009, 09:48:34 UTC
that's pretty cool!

Reply

darius May 27 2009, 00:05:10 UTC
Thanks. :)

Reply


Leave a comment

Up