Siqi Ouyang (欧阳思琦)

selfie-4.webp

Hi, I’m currently a PhD student at the Language Technologies Institute of the School of Computer Science at Carnegie Mellon University, advised by Prof. Lei Li. Before I came to CMU, I spent two years as a PhD student at the Computer Science Department at UC Santa Barbara also with Lei. Before PhD, I received my B.Eng. from the Institute for Interdisciplinary Information Sciences at Tsinghua University (a.k.a. Yao Class), advised by Prof. Yi Wu.

My research aims to build the foundation for real-time communication across languages. I study simultaneous translation with large language models, with the goal of enabling systems that can listen, understand, and translate as people speak, making multilingual communication as natural and immediate as conversation itself.

Office: GHC 6715, 4902 Forbes Ave, Pittsburgh, PA 15213

Email: siqiouya@andrew.cmu.edu

[Twitter/X] [GitHub] [LinkedIn] [Google Scholar]

News

Mar 11, 2026 Give a lecture at Speech Technology for Conversational AI course at CMU.
Feb 26, 2026 Give a talk at Speech Lunch. Here is the slide.
Sep 05, 2025 Give a talk at TTIC Summer Workshop on Foundations of Speech and Audio Foundation Models.
May 12, 2025 Intern at NVIDIA NeMo again, advised by Shouyang Ding, Oleksii Hrinchuk, and Vitaly Lavrukhin.
Apr 30, 2025 Presented Anticipating Future with Large Language Model for Simultaneous Machine Translation orally at NAACL 2025.
Oct 03, 2024 Give a talk at Speech Lunch. Here is the slide.
May 13, 2024 Intern at NVIDIA NeMo, advised by Zhehuai Chen, Oleksii Hrinchuk, and Vitaly Lavrukhin.
Jan 16, 2024 Join the Language Technologies Institute at Carnegie Mellon University as a PhD student.

Selected Papers

  1. RASST: Fast Cross-modal Retrieval-Augmented Simultaneous Speech Translation
    Jiaxuan Luo, Siqi Ouyang, and Lei Li
    Jan 2026
  2. InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model
    Siqi Ouyang, Xi Xu, and Lei Li
    In Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025
  3. Anticipating Future with Large Language Model for Simultaneous Machine Translation
    Siqi Ouyang, Oleksii Hrinchuk, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Lei Li, and Boris Ginsburg
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
  4. CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation
    Xi Xu, Wenda Xu, Siqi Ouyang, and Lei Li
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  5. FASST: Fast LLM-based Simultaneous Speech Translation
    Siqi Ouyang, Xi Xu, Chinmay Dandekar, and Lei Li
    Aug 2024
  6. CMU‘s IWSLT 2024 Simultaneous Speech Translation System
    Xi Xu, Siqi Ouyang, Brian Yan, Patrick Fernandes, William Chen, Lei Li, Graham Neubig, and Shinji Watanabe
    In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), Aug 2024
    Top 1 Human Rating

Selected Awards

Waibel Presidential Fellowship 2024
Tsinghua University Yao Recognition Prize 2021
Gold Medal, Chinese National Olympiad in Informatics 2016