Few approaches to extract word translations from non-parallel texts have been proposed so far. Researchers have not been encouraged to work on this topic because extracting information from non-parallel corpora is a difficult task producing poor results. Whereas for parallel texts, word translation extraction can reach about 99%, the accuracy for non-parallel texts has been around 72% up to now. The current approach, which relies on the previous extraction of bilingual pairs of lexico-syntactic templates from parallel corpora, makes a significant improvement to about 89% of words translations identified correctly.
Weitere Kapitel dieses Buchs durch Wischen aufrufen
- An Approach to Acquire Word Translations from Non-parallel Texts
Pablo Gamallo Otero
José Ramom Pichel Campos
- Springer Berlin Heidelberg