The LLab at Carnegie Mellon University meets weekly to discuss issues in human language and computation.


Older posts…



Prepare a corpus of East Tusom wordlist and text data for release, with phonetic and phonemic transcriptions and appropriate metadata. Produce a sketch of the phonetics and phonology of East Tusom (Tangkhulic; Tibeto-Burman) based on ibid.

Read more »

Automatic Interlinear Glossing for Under-Resourced Languages

Attempt to leverage neural-based models with dual sources (transcription and translation) to create a hard-to-obtain gloss from an easy-to-obtain parallel corpus.

Read more »

Intrinsic Evaluation of Contextualized Word Embeddings

Contextualize a word translation task; fill the gap between outdated dictionary-based word translation tasks and update contextualized word embedding evaluation.

Read more »


Graduate Students

Nathan Anderson N
Nathan Anderson
Master of Language Technologies
Brendon Boldt
Master of Language Technologies
Satoru Ozaki
Master of Language Technologies
Yue Yin Y
Yue Yin
Master of Language Technologies
Sean Zhang S
Sean Zhang
Xingyuan Zhao
Master of Language Technologies


Jong Hyuk (Jay) Park Master of Language Technologies, 2020. Now at University of Edinburgh.