Site Meter

2/12/2012

Corpus of parsed texts

Njyem was once an unwritten language as well as lacking a Bible translation.
We are passionately interested in solving both of these conditions. They are linked, as well, since there can be no good readers of the Bible in progress if there are no readers of general literature and no writers and revisers who can produce the translation.
In order of sequence, people should read untranslated literature before translations, which are always more difficult to read than works arising from ones own culture.
The corpus of texts in Njyem is being built up gradually as people contribute texts big and small to us. We transcribe them and then pass them through a parser, using FLEX, a program written and supported by SIL. Once the texts pass the "FLEX-test", they are distributed for further review and comment.
Nineteen texts are at this point.
Interested parties wanting access to them can contact me.

No comments:

Post a Comment