Finding a needle in a haystack: Using annotated corpora for linguistic research

Anthony Kroch and Beatrice Santorini
University of Pennsylvania

October 11, 2007

Structurally annotated corpora are becoming ever more important in empirical linguistic research, and the use of such corpora is particularly widespread in historical syntax. In this workshop, we will offer a hands-on introduction to searching syntactically annotated (parsed) corpora. The workshop will introduce participants to the nuts and bolts of using CorpusSearch (Randall 2000-2007), a program developed at Penn to search parsed corpora, and it will guide participants in a mini-investigation of one of the major syntactic changes in the history of English, as a way of illustrating the sorts of results that can be obtained with appropriately annotated databases.