Elizabeth Salesky

[Elizabeth Salesky]
PhD student in the CLSP at JHU.

Generally interested in:

speech and text translation
language representations
language variation
low-resource & multilingual settings

About Me

Hi! I’m Liz Salesky (/lɪz səˈlɛski/), a PhD student at the Center for Language and Speech Processing at Johns Hopkins University, advised by Matt Post and Philipp Koehn.
I am very lucky to be supported by the Apple Scholars in AI/ML PhD fellowship.

My research primarily focuses on machine translation and language representations, including how to create models which are more data-efficient and robust to variation across languages and data sources. I co-organize NLP with Friends, an online student seminar, with Abhilasha Ravichander, Yanai Elazar, and Zeerak Waseem.

Previously, I was a Masters student at the Language Technologies Institute at Carnegie Mellon University, where I was advised by Alex Waibel and often collaborated with Alan W Black and the lab at KIT, where I worked in the summers. Before that, I worked at MIT Lincoln Laboratory in the Human Language Technology group from 2012-2017, focused primarily on machine translation and language learning applications. I graduated from Dartmouth College in 2012, where I studied Linguistics and Math. My undergraduate thesis with Ann Irvine compared the linguistic validity of unsupervised segmentation methods.

When not at my computer, I like to learn languages, run, and bike to ice cream!

Kepler Track


Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan arXiv preprint ·
Assessing Evaluation Metrics for Speech-to-Speech Translation Elizabeth Salesky, Julian Mäder, Severin Klinger ASRU 2021 ·
Robust Open-Vocabulary Translation from Visual Text Representations Elizabeth Salesky, David Etter, Matt Post EMNLP 2021 ·
A surprisal—duration trade-off across and within the world's languages Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell EMNLP 2021 ·
The Multilingual TEDx Corpus for Speech Recognition and Translation Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post Interspeech 2021 ·
FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN Antonios Anastasopoulos, Ondřej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alexander Waibel, Changhan Wang, Matthew Wiesner IWSLT 2021 ·
SIGTYP 2021 Shared Task: Robust Spoken Language Identification Elizabeth Salesky, Badr M. Abdullah, Sabrina Mielke, Elena Klyachko, Oleg Serikov, Edoardo Maria Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova SIGTYP 2021 ·
SIGTYP 2020 Shared Task: Prediction of Typological Features Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein SIGTYP 2020 ·
Relative Positional Encoding for Speech Recognition and Direct Translation Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alex Waibel INTERSPEECH 2020 ·
A Corpus For Large-Scale Phonetic Typology Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner ACL 2020 ·
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing Clara Meister, Elizabeth Salesky, Ryan Cotterell ACL 2020 ·
Phone Features Improve Speech Translation Elizabeth Salesky, Alan W Black ACL 2020 ·
Findings of the 2020 IWSLT Evaluation Campaign Ebrahim Ansari, Nguyen Bach, Ondřej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alex Waibel, Changhan Wang IWSLT 2020 ·
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden SIGMORPHON 2020 ·
Optimizing Segmentation Granularity for Neural Machine Translation Elizabeth Salesky, Andrew Runge, Alex Coda, Jan Niehues, Graham Neubig Machine Translation 2020. arXiv:1810.08641 Oct. 2018 ·
The IWSLT 2019 Evaluation Campaign Jan Niehues, Roldano Cattoni, Sebastian Stüker, Matteo Negri, Marco Turchi, Thanh-Le Ha, Elizabeth Salesky, Ramon Sanabria, Loïc Barrault, Lucia Specia, Marcello Federico IWSLT 2019 ·
Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation Elizabeth Salesky, Matthias Sperber, Alan W Black ACL 2019 ·
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime G. Carbonell, Yulia Tsvetkov SIGMORPHON 2019 (Interpretability Prize) ·
Fluent Translations from Disfluent Speech in End-to-End Speech Translation Elizabeth Salesky, Matthias Sperber, Alex Waibel NAACL 2019 ·
Towards Fluent Translations from Disfluent Speech Elizabeth Salesky, Susanne Burger, Alex Waibel SLT 2018 ·
KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning Florian Dessloch, Thanh-Le Ha, Markus Müller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, Elizabeth Salesky, Matthias Sperber, Sebastian Stüker, Thomas Zenkel, Alex Waibel COLING 2018 ·
KIT’s Multilingual Neural Machine Translation systems for IWSLT 2017 Ngoc-Quan Pham, Matthias Sperber, Elizabeth Salesky, Thanh-Le Ha, Jan Niehues, Alex Waibel IWSLT 2017 ·
The AFRL-MITLL WMT17 Systems: Old, New, Borrowed, BLEU Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson, Jonathan Taylor WMT 2017 ·
The MITLL-AFRL IWSLT 2016 Systems Michaeel Kazi, Elizabeth Salesky, Brian Thompson, Jonathon Taylor, Jeremy Gwinnup, Timothy Anderson, Grant Erdmann, Eric Hansen, Brian Ore, Katherine Young, Michael Hutt IWSLT 2016 ·
The AFRL-MITLL WMT16 News-Translation Task Systems Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson WMT 2016 ·
Operational Assessment of Keyword Search on Oral History Elizabeth Salesky, Jessica Ray, Wade Shen LREC 2016 ·
The MITLL-AFRL IWSLT 2015 MT System Michaeel Kazi, Brian Thompson, Elizabeth Salesky, Timothy Anderson, Grant Erdmann, Eric Hansen, Brian Ore, Jeremy Gwinnup, Katherine Young, Michael Hutt, Christina May IWSLT 2015 ·
The AFRL-MITLL WMT15 System: There’s More than One Way to Decode It! Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson, Christina May WMT 2015 ·
The MITLL-AFRL IWSLT 2014 MT System Michaeel Kazi, Elizabeth Salesky, Brian Thompson, Jessica Ray, Michael Coury, Wade Shen, Tim Anderson, Grant Erdmann, Jeremy Gwinnup, Katherine Young, Brian Ore, Michael Hutt IWSLT 2014 ·
Exploiting Morphological, Grammatical, and Semantic Correlates for Improved Text Difficulty Assessment Elizabeth Salesky, Wade Shen BEA 2014 ·
The MIT-LL/AFRL IWSLT-2013 MT system Michaeel Kazi, Michael Coury, Elizabeth Salesky, Jessica Ray, Wade Shen, Terry Gleason, Tim Anderson, Grant Erdmann, Lane Schwartz, Brian Ore, Raymond Slyh, Jeremy Gwinnup, Katherine Young, Michael Hutt IWSLT 2013 ·
A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners Wade Shen, Jennifer Williams, Tamas Marius, Elizabeth Salesky PITR 2013 ·

Hosted on GitHub Pages — Theme by orderedlist