Citation »
Galves, Charlotte, and Pablo Faria. (2017, December). Tycho Brahe Parsed Corpus of Historical Portuguese. URL: http://www.tycho.iel.unicamp.br/corpus/en/index.html.
Presentation »
The Tycho Brahe Parsed Corpus of Historical Portuguese is an electronic corpus of texts written in Portuguese by authors born between 1380 and 1845.
The Corpus has been built within the projects:
- Rhythmic Patterns, Parameter Setting and Language Change (1998-2003)
- Rhythmic Patterns, Parameter Setting and Language Change, Phase 2 (2004-2008)
- Portuguese in time and space: linguistic contact, grammars in competition and parametric change. (since 2012)
Acknowledgments »
We are grateful to the following institutions and individuals:
- Fundação de Amparo à Pesquisa do Estado de São Paulo, FAPESP 04/03643-0, "Rhythmic Patterns, Parameter Setting and Language Change, Phase II".
- CNPq, 485999/2007-2, "Rhythmic Patterns, Prosodic Domains and Probabilistic Modelling in Portuguese Corpora".
- Anthony Kroch and Beatrice Santorini, for the inspiration and constant support.
- Fábio Kepler for allowing us to use his part-of-speech tagger for our work.
- Dan Bikel for allowing us to use his Penn dissertation parser for our work.
Other corpora »
Access to Texts
[
Computational tools page ]
[
Ordered Lists Catalog
]
[
Query POS files with CorpusSearch
]
Download Complete Corpus (compacted .zip files):
[
Complete Corpus, syntactic annotation ]
[Complete Corpus, POS tagging
]
[ Complete Corpus, no annotation ]
Edition Guidelines
[
Texts Presentation
]
[
Complete Edition Manual
]
[
Syntactic and POS Tagging Annotation Manuals
]