Siqi Ouyang (欧阳思琦)

selfie-4.webp

Hi, I’m currently a PhD student at the Language Technologies Institute of the School of Computer Science at Carnegie Mellon University, advised by Prof. Lei Li. Before I came to CMU, I spent two years as a PhD student at the Computer Science Department at UC Santa Barbara also with Lei. Before PhD, I received my B.Eng. from the Institute for Interdisciplinary Information Sciences at Tsinghua University (a.k.a. Yao Class), advised by Prof. Yi Wu.

My current research focus is on low-latency simultaneous speech translation with a large language model. The ambition is to build a system that can translate speech in real time (less than one-second latency), enabling face-to-face communication for people speaking different languages.

Office: GHC 6715, 4902 Forbes Ave, Pittsburgh, PA 15213

Email: siqiouya@andrew.cmu.edu

[Twitter/X] [GitHub] [LinkedIn] [Google Scholar]

News

Apr 30, 2025 Presented Oral Paper Anticipating Future with Large Language Model for Simultaneous Machine Translation at NAACL 2025.
Oct 03, 2024 Give a talk at Speech Lunch. Here is the slide.
May 13, 2024 Intern at NVIDIA Speech AI, advised by Zhehuai Chen, Oleksii Hrinchuk, and Vitaly Lavrukhin.
Jan 16, 2024 Join the Language Technologies Institute at Carnegie Mellon University as a PhD student.

Selected Papers

  1. InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model
    Siqi Ouyang, Xi Xu, and Lei Li
    Mar 2025
  2. Anticipating Future with Large Language Model for Simultaneous Machine Translation
    Siqi Ouyang, Oleksii Hrinchuk, Zhehuai Chen, Vitaly Lavrukhin, Jagadeesh Balam, Lei Li, and Boris Ginsburg
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
  3. CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation
    Xi Xu, Wenda Xu, Siqi Ouyang, and Lei Li
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  4. FASST: Fast LLM-based Simultaneous Speech Translation
    Siqi Ouyang, Xi Xu, Chinmay Dandekar, and Lei Li
    Aug 2024
  5. CMU‘s IWSLT 2024 Simultaneous Speech Translation System
    Xi Xu, Siqi Ouyang, Brian Yan, Patrick Fernandes, William Chen, Lei Li, Graham Neubig, and Shinji Watanabe
    In Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), Aug 2024
    Top 1 Human Rating

Selected Awards

Waibel Presidential Fellowship 2024
Tsinghua University Yao Recognition Prize 2021
Gold Medal, Chinese National Olympiad in Informatics 2016