The Mixed Corpus: Fiction texts


This subcorpus contains Estonain fiction texts, ca 5,8 million words altogether. Most of the texts have been published after the year 1990. More precise information about the texts in the corpus (titles, authors, words per titles)  can be found in this table.

NB! Most of these texts are constitute also the Balanced Corpus.

The corpus is free for use for non-commercial purposes only.



SGML-files contain entities listed in this table

