© 2009 France Martineau

Structure of the MCVF Corpus

The MCVF (Modéliser le changement : les voies du français) corpus is structured along three main directions: dialectal, social, and historical. It consists for the most part of  unabridged texts. The corpus covers four historical periods:

I. Old French
II. Middle French
III. Sixteenth-century French
IV. Classical French (France and New-France) (17th and 18th century)


Text encoding

The texts have been processed in XML format according to the standard of the Text Encoding Initiative (TEI). Our transcription and TEI-encoding protocol can be freely consulted in order to standardize formats. The tag set has been supplemented with historical descriptors allowing us to specify the social profile of the writer and the philological context of the text. Then, the texts are annotated, with morphological and syntactical tags.


Searchable MCVF corpus

The corpus may be searched by anyone who completes the access form. The texts of the MCVF Corpus are not accessible in full-text format, but users can search them and obtain results in the form of a concordance or a tree structure (depending on the search engine) for different query types.

MCVF corpus, XML-TEI format
MCVF  corpus with morphosyntactic labels
Les Canadas vus par les Canadiens corpus (accessible without access form)

© University of Ottawa