Most characteristic words in pro and

Most characteristic words in pro- and anti-feminist tweets Here are, based on my analysis (which I’ll get to in a moment) clouds of the 40 words most characteristic of anti-feminist and pro-feminist tweets, respectively. anti-feminist pro-feminist Word clouds my may be only semi-quantitative but they have other virtues, like recognizability and explorability. For the purists, … Read more Most characteristic words in pro and

Visualizing 10 unusual causes of death in the CDC mortality database

Let me make two things clear right up front: The metrics I used to decide what causes of death are unusual are purely subjective, i.e. which of the thousands of causes I skimmed through caught my eye and made me go, “Huh.” It is in no way my intention to make fun of anyone’s death. … Read more Visualizing 10 unusual causes of death in the CDC mortality database

visualizing word and letter frequencies

Visualizing word and letter frequencies in Gadsby, a novel without the letter ‘e’ In 1939, Ernest Vincent Wright published the novel¬†Gadsby¬†(gee, I wonder where he came up with that name…), 58,124 words (by my count), none of which contain the letter ‘e’. Here are a few of the features of the English language Wright was … Read more visualizing word and letter frequencies

The Most Decade-Specific Words of the Past Two Centuries

This is from an analysis of Brigham Young University’s Corpus of Historical American English, sort of a way-better-curated and easier-to-search version of Google Ngram Viewer. It covers a selected corpus of English from different genres and sources from 1810 to 2009. Of course, the analysis is biased towards words at the beginning or end of the … Read more The Most Decade-Specific Words of the Past Two Centuries