Scalable Reading

dedicated to DATA: digitally assisted text analysis

...the broad circumference
Hung on his shoulders like the Moon, whose Orb
Through Optic Glass the Tuscan Artist views
At Ev’ning from the top of Fesole,
Or in Valdarno, to descry new Lands,
Rivers or Mountains in her spotty Globe.
(Paradise Lost, 1. 286-91)

Monthly archive September, 2012

Google maps and crowdsourcing

David Pogue has an  on his New York Times blog about what makes Google maps so good.  It’s a story of incremental and iterative improvement over years, combining sophisticated algorithms with a lot of manual work. Definite lessons for the incremental and iterative improvement over time of the TCP texts and similar corpora.

EEBO-TCP 2012: The future of the TCP as a public domain and collaboratively curated corpus of Early Modern English

“Revolutionizing Early Modern Studies?” was the question that governed the recent  EEBO-TCP 2012 conference sponsored by the Bodleian Library. I gave a talk there about “Towards a Book of English: A linguistically annotated corpus of the EEBO-TCP texts.” In another blog I will write about the ways in which this project will keep Phil Burns and...