WikiMatrix: Bitext extraction of 135 million Wikipedia sentences
Facebook AI is sharing WikiMatrix, the largest and most complete extraction of parallel sentences across multiple languages.
Facebook AI is sharing WikiMatrix, the largest and most complete extraction of ...
Using publicly available Wikipedia articles, we extracted 135 million parallel ... In
contrast, a brute force method of comparing the 134 million English and 51 ...