| |
| | RESOURCES SPECIFICATIONS |
 | | This corpus covers the Portuguese language as spoken in Portugal, Brazil, Angola, Mozambique, Guinea, Macao, etc. It consists of about 1.5Million of words for the spoken language and more that 40 millions words of Portuguese texts extracted from fiction, technical, scientific, journalistic, legal, and political material. |
 | | For the Dutch data, frequencies have been disambiguated on the basis of the 42.4m Dutch Instituut voor Nederlandse Lexicologie text corpora. |
 | | frequency of the entries: disambiguated for homographic lemmata. |
| sirio.deusto.es /abaitua/konzeptu/nlp/txt_det.htm (6028 words) |
|