ACCURAT corpus of Wikipedia texts

Author: 
Tilde
Description: 
The corpus contains comparable texts from Wikipedia for 12 language pairs: English-Croatian, English- Greek, English-Estonian, English-Latvian, English-Lithuanian, English-Romanian, English-Slovenian, Greek-Romanian, Latvian-Lithuanian, Romanian-German, Romanian-Lithuanian and German-English.
Resource type: 
corpus
Resource availability: 
free
Tags: 
Modality: 
text
Format: 
Size: 
Romanian (58 622 Texts) - English (58 622 Texts), Estonian (20 621 Texts) - English (20 621 Texts), Lithuanian (2,209 Texts) - Romanian (2,209 Texts), Croatian (22 137 Texts) - English (22 137 Texts), Greek (4,230 Texts) - English (4 230 Texts), Latvian (
Production date: 
06/30/2012
Domain: