Shakespearean n-grams

  The following is about  86 pairwise combinations of EMD plays that include Shakespeare on one side and sit 2.5 standard deviations above the average for all pairwise combinations. This is a crude and arbitrary cut-off and includes  less than 1% of some 11,000...

The Top Fifty n-gram heavy play links

This blog entry continues the entry on “Authors are trumps” and looks at the top fifty play links, which score at the 99.9th percentile of shared n-grams.  What can we learn from this list without actually looking at the plays? Or, if we think about it as...

Authors are trumps

What do repeated phrases or n-grams tell us about how distant from or close to each other pairs of early modern plays are?  Do n-grams provide  dependable measures of distance, and can we learn from them about the weight of various factors that differentiate between...

Shakespeare’s dislegomena

Shakespeare’s dislegomena are lemmata that occur in only two of his plays. I use ‘dislegomenon’ in a specialized sense to refer to document rather than collection frequency.  For instance, the lemma ‘Laertes’ occurs once in Titus...