The MCVF (Modéliser le changement : les voies du français) corpus is structured along three main directions: dialectal, social, and historical. It consists for the most part of unabridged texts. The corpus covers four historical periods:
Classical French (France and New-France) (17th and 18th century)
The texts have been processed in XML format according to the standard of the Text Encoding Initiative (TEI). Our transcription and TEI-encoding protocol can be freely consulted in order to standardize formats. The tag set has been supplemented with historical descriptors allowing us to specify the social profile of the writer and the philological context of the text. Then, the texts are annotated, with morphological and syntactical tags.
Searchable MCVF corpus
The corpus may be searched by anyone who completes the access form. The texts of the MCVF Corpus are not accessible in full-text format, but users can search them and obtain results in the form of a concordance or a tree structure (depending on the search engine) for different query types.