Po-Yao (Bernie) Huang

Researcher, FAIR

Greetings! I am Po-Yao (Bernie) Huang. I am a research scientist at Facebook AI Research (FAIR) Labs. I obtained my Ph.D. degree from the Language Technologies Institute (LTI) of School of Computer Science (SCS) at Carnegie Mellon University (CMU). My research focus lies at the intersection of multimodal machine learning, encompassing vision, language, and audio, with a particular emphasis on multimodal large language models (MLLMs).

Contact: berniebear_at_gmail.com
Google Scholar: https://scholar.google.com/citations?user=E8K25LIAAAAJ


Work Experience

Facebook

Senior Research Scientist (FAIR Labs) Aug 2022 - present
Research Scientist (FAIR Labs) Aug 2021 - Aug 2022
Research Intern May 2020 - May 2021

MicroSoft

Research Intern (Microsoft Research) Jun 2017 - Aug 2017

MediaTek

Senior Software Engineer Jun 2012 - Jun 2014
Software Engineer Sep 2010 - May 2012

Education

Carnegie Mellon University

Ph.D. in Computer Science - Language and Information Technologies Aug 2016 - Jul 2021

GPA: 4.33/4.33

M.S. in Computer Science - Language Technologies Aug 2014 - Jul 2016

GPA: 4.21/4.33

National Taiwan University

M.S. in Computer Engineering Aug 2007 - Jul 2009

GPA: 4.00/4.00

B.S. in Electrical Engineering Sep 2003 - Jul 2007

GPA: 3.78/4.00


Publication

  • Dinov2: Learning robust visual features without supervision
    Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski
    JMLR, Jan 2024.
  • Demystifying clip data (MetaCLIP)
    Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer
    ICLR, 2024.
  • MAViL: Masked Audio-Video Learners
    Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer
    NeurIPS, 2023.
  • Masked autoencoders that listen
    Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer
    NeurIPS, 2022.

Awards


Scholarship

  • ACL, ICMR, AAAI, ACM MM, CVPR travel awards/grants, 2015-2019
  • NSF travel awards, 2015-2019
  • CMU Research Fellowship 2014-2019
  • Taiwan's Study Abroad Scholarship, 2016
  • Siebel Scholarship, 2016
  • Din-Jing Memorial Scholarship, 2009