About Me
Hi! I’m Liz Salesky (/lɪz səˈlɛski/), a PhD student at the Center for Language and Speech Processing at Johns Hopkins University, advised by Matt Post and Philipp Koehn.
I am very lucky to be supported by the Apple Scholars in AI/ML PhD fellowship.
My research primarily focuses on language representations for machine translation and multilinguality, including alternatives to traditional tokenization, multimodal representation learning, and how to create more data-efficient and robust models.
I am also interested in studying and modeling variation within and across languages.
Previously, I received my MSc from CMU in 2019 advised by Alex Waibel, collaborating often with the KIT ISL lab and Alan W Black.
Before that, I worked at MIT Lincoln Laboratory from 2012-2017, focused on machine translation and language learning applications.
I graduated from Dartmouth College in 2012, where I majored in Linguistics and Math.
When not at my computer, I like to learn languages, run, and bike to ice cream!
Publications
2023 |
Evaluating multilingual speech translation under realistic conditions with resegmentation and terminology
Elizabeth Salesky, Kareem Darwish, Mohamed Al-Badrashiny, Mona Diab, Jan Niehues
IWSLT 2023 ·
|
Findings of the IWSLT 2023 Evaluation Campaign
Milind Agarwal, ..., Elizabeth Salesky, ..., + many more
IWSLT 2023 ·
|
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen
ICASSP 2023 ·
|
Language Modelling with Pixels
Phillip Rust, Jonas F. Lotz, Emanuele Bugliarello, Elizabeth Salesky, Miryam de Lhoneux, Desmond Elliott
ICLR 2023 · notable top-5% ·
|
2022 |
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop: Teven Le Scao, ..., Elizabeth Salesky, ..., + many more
arXiv preprint ·
|
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack Julian Weber, Salomon Kabongo, Elizabeth Salesky, Iroro Orife, Colin Leong, Perez Ogayo, Chris Emezue, Jonathan Mukiibi, Salomey Osei, Apelete Agbolo, Victor Akinode, Bernard Opoku, Samuel Olanrewaju, Jesujoba Alabi, Shamsuddeen Muhammad
INTERSPEECH 2022 ·
|
UniMorph 4.0: Universal Morphology
Khuyagbaatar Batsuren, Omer Goldman, ..., Elizabeth Salesky, ..., + many more
LREC 2022 ·
|
Findings of the IWSLT 2022 Evaluation Campaign
Antonios Anastasopoulos, ..., Elizabeth Salesky, ..., + many more
IWSLT 2022 ·
|
2021 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke, Zaid Alyafeai, Elizabeth Salesky, Colin Raffel, Manan Dey, Matthias Gallé, Arun Raja, Chenglei Si, Wilson Y. Lee, Benoît Sagot, Samson Tan
arXiv preprint ·
|
Assessing Evaluation Metrics for Speech-to-Speech Translation
Elizabeth Salesky, Julian Mäder, Severin Klinger
ASRU 2021 ·
|
Robust Open-Vocabulary Translation from Visual Text Representations
Elizabeth Salesky, David Etter, Matt Post
EMNLP 2021 ·
|
A surprisal—duration trade-off across and within the world's languages
Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell
EMNLP 2021 ·
|
The Multilingual TEDx Corpus for Speech Recognition and Translation
Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post
INTERSPEECH 2021 ·
|
Findings of the IWSLT 2021 Evaluation Campaign
Antonios Anastasopoulos, Ondřej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alexander Waibel, Changhan Wang, Matthew Wiesner
IWSLT 2021 ·
|
SIGTYP 2021 Shared Task: Robust Spoken Language Identification
Elizabeth Salesky, Badr M. Abdullah, Sabrina Mielke, Elena Klyachko, Oleg Serikov, Edoardo Maria Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova
SIGTYP 2021 ·
|
2020 |
SIGTYP 2020 Shared Task: Prediction of Typological Features
Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein
SIGTYP 2020 ·
|
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stueker, Jan Niehues, Alex Waibel
INTERSPEECH 2020 ·
|
A Corpus For Large-Scale Phonetic Typology
Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black, Jason Eisner
ACL 2020 ·
|
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing
Clara Meister, Elizabeth Salesky, Ryan Cotterell
ACL 2020 ·
|
Phone Features Improve Speech Translation
Elizabeth Salesky, Alan W Black
ACL 2020 ·
|
Findings of the 2020 IWSLT Evaluation Campaign
Ebrahim Ansari, Nguyen Bach, Ondřej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alex Waibel, Changhan Wang
IWSLT 2020 ·
|
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden
SIGMORPHON 2020 ·
|
Optimizing Segmentation Granularity for Neural Machine Translation
Elizabeth Salesky, Andrew Runge, Alex Coda, Jan Niehues, Graham Neubig
Machine Translation 2020. arXiv:1810.08641 Oct. 2018 ·
|
2019 |
The IWSLT 2019 Evaluation Campaign
Jan Niehues, Roldano Cattoni, Sebastian Stüker, Matteo Negri, Marco Turchi, Thanh-Le Ha, Elizabeth Salesky, Ramon Sanabria, Loïc Barrault, Lucia Specia, Marcello Federico
IWSLT 2019 ·
|
Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Elizabeth Salesky, Matthias Sperber, Alan W Black
ACL 2019 ·
|
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology
Aditi Chaudhary, Elizabeth Salesky, Gayatri Bhat, David R. Mortensen, Jaime G. Carbonell, Yulia Tsvetkov
SIGMORPHON 2019 · Interpretability Prize ·
|
Fluent Translations from Disfluent Speech in End-to-End Speech Translation
Elizabeth Salesky, Matthias Sperber, Alex Waibel
NAACL 2019 ·
|
2018 |
Towards Fluent Translations from Disfluent Speech
Elizabeth Salesky, Susanne Burger, Alex Waibel
SLT 2018 ·
|
KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning
Florian Dessloch, Thanh-Le Ha, Markus Müller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, Elizabeth Salesky, Matthias Sperber, Sebastian Stüker, Thomas Zenkel, Alex Waibel
COLING 2018 ·
|
2017 |
KIT’s Multilingual Neural Machine Translation systems for IWSLT 2017
Ngoc-Quan Pham, Matthias Sperber, Elizabeth Salesky, Thanh-Le Ha, Jan Niehues, Alex Waibel
IWSLT 2017 ·
|
The AFRL-MITLL WMT17 Systems: Old, New, Borrowed, BLEU
Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson, Jonathan Taylor
WMT 2017 ·
|
2016 |
The MITLL-AFRL IWSLT 2016 Systems
Michaeel Kazi, Elizabeth Salesky, Brian Thompson, Jonathon Taylor, Jeremy Gwinnup, Timothy Anderson, Grant Erdmann, Eric Hansen, Brian Ore, Katherine Young, Michael Hutt
IWSLT 2016 ·
|
The AFRL-MITLL WMT16 News-Translation Task Systems
Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson
WMT 2016 ·
|
Operational Assessment of Keyword Search on Oral History
Elizabeth Salesky, Jessica Ray, Wade Shen
LREC 2016 ·
|
2015 |
The MITLL-AFRL IWSLT 2015 MT System
Michaeel Kazi, Brian Thompson, Elizabeth Salesky, Timothy Anderson, Grant Erdmann, Eric Hansen, Brian Ore, Jeremy Gwinnup, Katherine Young, Michael Hutt, Christina May
IWSLT 2015 ·
|
The AFRL-MITLL WMT15 System: There’s More than One Way to Decode It!
Jeremy Gwinnup, Timothy Anderson, Michaeel Kazi, Elizabeth Salesky, Grant Erdmann, Katherine Young, Brian Thompson, Christina May
WMT 2015 ·
|
2014 |
The MITLL-AFRL IWSLT 2014 MT System
Michaeel Kazi, Elizabeth Salesky, Brian Thompson, Jessica Ray, Michael Coury, Wade Shen, Tim Anderson, Grant Erdmann, Jeremy Gwinnup, Katherine Young, Brian Ore, Michael Hutt
IWSLT 2014 ·
|
Exploiting Morphological, Grammatical, and Semantic Correlates for Improved Text Difficulty Assessment
Elizabeth Salesky, Wade Shen
BEA 2014 ·
|
2013 |
The MIT-LL/AFRL IWSLT-2013 MT system
Michaeel Kazi, Michael Coury, Elizabeth Salesky, Jessica Ray, Wade Shen, Terry Gleason, Tim Anderson, Grant Erdmann, Lane Schwartz, Brian Ore, Raymond Slyh, Jeremy Gwinnup, Katherine Young, Michael Hutt
IWSLT 2013 ·
|
A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners
Wade Shen, Jennifer Williams, Tamas Marius, Elizabeth Salesky
PITR 2013 ·
|