Entries from May 2009

May 29, 2009

I shall be telling this with a sigh somewhere ages and ages hence

Warning: If you have a vested interest in what I do after I graduate (you know who you are; if you’re not sure if this applies to you, then it doesn’t) or you don’t have the stomach for the inherently impolitic nature of job-search aporia, you should probably stop reading.

As some of you know, I’ve [...]

May 10, 2009

Fun with names

In line with a previous post I decided to have some more fun with names.
I used the census names data to generate 200 names by taking a random first name and a random last name and combining them. The first names were chosen 50/50 from the male and female lists. [...]

May 6, 2009

Word counts vs. word presence for LDA

Every now and then someone (you know who you are) asks if the feature vectors one passes into LDA should be vectors of word counts (i.e., vectors of non-negative integers) or vectors of word presence/absence (i.e., vectors of binary values). Now the former gives strictly more information so the short answer is that you [...]