Research Projects

Coordinate Structures

Learning the ordering of coordinate compounds and elaborate expressions in Hmong, Lahu, and Chinese.

Read more »

East Tusom

Prepare a corpus of East Tusom wordlist and text data for release, with phonetic and phonemic transcriptions and appropriate metadata. Produce a sketch of the phonetics and phonology of East Tusom (Tangkhulic; Tibeto-Burman) based on ibid.

Read more »

Mesoamerican Morphology

Computational morphology for language documentation.

Read more »

Automatic Interlinear Glossing for Under-Resourced Languages

Attempt to leverage neural-based models with dual sources (transcription and translation) to create a hard-to-obtain gloss from an easy-to-obtain parallel corpus.

Read more »

Intrinsic Evaluation of Contextualized Word Embeddings

Contextualize a word translation task; fill the gap between outdated dictionary-based word translation tasks and update contextualized word embedding evaluation.

Read more »