Language identification of short text segments with n-gram models

Reference:

Tommi Vatanen, Jaakko J. Väyrynen, and Sami Virpioja. Language identification of short text segments with n-gram models. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), pages 3423–3430. European Language Resources Association (ELRA), 2010.

Suggested BibTeX entry:

@inproceedings{Vatanen10LREC,
    author = {Tommi Vatanen and Jaakko J. V{\"{a}}yrynen and Sami Virpioja},
    booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
    pages = {3423--3430},
    publisher = {European Language Resources Association (ELRA)},
    title = {Language identification of short text segments with n-gram models},
    year = {2010},
}

PDF (163 kB)