East Tusom

East Tusom is a Tangkhulic (Tibeto-Burman) language of Manipur State, India. It is highly endangered and is spoken primarily in one village in Ukhrul distinct. It has areally unsual phonetics and phonology. David and collaborators are currently working to document East Tusom and to build speech resources based upon field recordings of the language.

A speech dataset from Tusom (called Tusom2021) is available on GitHub. A preprint of the paper describing this dataset is available on arXiv. This paper has been accepted for Interspeech 2021. Rather than citing the arXiv preprint, you should use the following BibTeX:

@inproceedings{mortensen2021tusom2021,
  author={David R. Mortensen and Jordan Picone and Xinjian Li and Kathleen Siminyu},
  title={Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={3660--3664},
  doi={10.21437/Interspeech.2021-1435}
}

The paper East Tusom: A phonetic and phonological sketch of a largely undocumented Tangkhulic language has been published by Linguistics of the Tibeto-Burman Area.

@article{mortensen2021east,
   author={Mortensen, David R. and Picone, Jordan},
   title={East {Tusom}: A phonetic and phonological sketch of a largely undocumented {Tangkhulic} language}, 
   journal={Linguistics of the Tibeto-Burman Area},
   year={2021},
   volume={44},
   number={2},
   pages={168--196},
   doi={https://doi.org/10.1075/ltba.21009.mor},
   url={https://www.jbe-platform.com/content/journals/10.1075/ltba.21009.mor},
   publisher={John Benjamins},
   issn={0731-3500}
  }

If you don’t have access to LTBA, you can read the manuscript of this paper.

People