Other corpora using the same or similar annotation schemes as the
Penn-Helsinki Corpora
Parsed corpora of historical English
The following corpora are all part of an overarching project at the
University of Pennsylvania, the University of York, and elsewhere to
produce syntactically annotated corpora for all stages of the history
of English:
- Old English (before 1100)
- Middle English (1100-1500)
- Early Modern English (1500-1700)
- Modern English (1700-1914)
Parsed corpora of other languages
- Tycho Brahe Corpus,
a parsed corpus of historical Portuguese
- Charlotte Galves (University of Campinas, Brazil) and collaborators
- Modéliser le changement: les voies du français
(Modelling change: the paths of French),
a parsed corpus of historical French
- France Martineau (University of Ottawa) and collaborators
- CORDIAL-SIN Corpus, a syntax-oriented corpus of
Portuguese dialects
- Ana Maria Martins (Centro de Linguística da Universidade de
Lisboa) and collaborators
- Icelandic Parsed Historical Corpus (IcePaHC)
- Eiríkur Rögnvaldsson (University of Iceland) and
collaborators
- Word order and word order change in Western European
languages (WOChWEL) Corpus, a growing parsed corpus of Old
Portuguese
- Ana Maria Martins and Sandra Pereira (Centro de Linguística da
Universidade de Lisboa) and collaborators
- Audio-Aligned and Parsed Corpus of Appalachian English
(AAPCAppE)
- Christina Tortora (City University of New York) and collaborators
- P.S. Post Scriptum -
A Digital Archive of Ordinary Writing (Early Modern Portugal and
Spain),
April 2017
- Rita Marquilhas (Centro de Linguística da Universidade de Lisboa)
and collaborators
- NINJAL
Parsed Corpus of Modern Japanese (NPCMJ),
July 2017.
At present, the NPCMJ contains 10,000 sentences acessible over the web
via various interfaces.
- Prashant Pardeshi (National Institute of Japanese Language
and Linguistics) and collaborators
- Oxford-NINJAL
Corpus of Old Japanese (ONCOJ),
September 2018.
The ONCOJ
contains the full corpus of Old
Japanese poetic texts amounting to
90,000 words.
- Bjarke Frellesvig (project leader, U. of Oxford) and an
international committee