New neural machine translation architectures are experimenting with pairs of neu...

rancur · on Jan 15, 2015

> Also pairs of languages for which their are big amount of parallel training data will still be favored.

wouldn't bible translations help?

ogrisel · on Jan 15, 2015

It might but:

- the vocabulary and topics covered in the bible is quite different from today's written and spoken text, especially phone discussions or social network messages.

- other aligned corpora such as http://www.statmt.org/europarl/ are much larger than the bible (several millions of tokens for most pairs vs less than 1 million for the Bible)

Agreed that http://www.statmt.org/europarl/ does not cover non-European languages.