publications

Publications and patents in reversed chronological order. * denotes equal contribution.

2025

  1. Preprint
    Exploring Model Invariance with Discrete Search for Ultra-Low-Bit Quantization
    Yuqiao Wen, Yanshuai Cao, and Lili Mou
    2025
  2. AAAI
    ebbs.png
    EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
    Yuqiao Wen, Behzad Shayegh, Chenyang Huang, Yanshuai Cao, and Lili Mou
    The 39th Annual AAAI Conference on Artificial Intelligence, 2025

2024

  1. Preprint
    neuzip.png
    NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks
    Yongchang Hao, Yanshuai Cao, and Lili Mou
    arXiv preprint arXiv:2410.20650, 2024
  2. NeurIPS
    llm_pddl.png
    Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models
    Sadegh Mahdavi, Raquel Aoki, Keyi Tang, and Yanshuai Cao
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  3. NeurIPS
    llm_abstraction.png
    Do LLMs Build World Representations? Probing Through the Lens of State Abstraction
    Zichao Li, Yanshuai Cao, and Jackie CK Cheung
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
  4. ACL
    jumpstart.png
    Jump Starting Bandits with LLM-Generated Prior Knowledge
    Parand A. Alamdari, Yanshuai Cao, and Kevin H. Wilson
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  5. ICML
    flora.png
    Flora: Low-Rank Adapters Are Secretly Gradient Compressors
    Yongchang Hao, Yanshuai Cao, and Lili Mou
    In Forty-first International Conference on Machine Learning, Nov 2024
  6. Preprint
    Ginger: An Efficient Curvature Approximation with Linear Complexity for General Neural Networks
    Yongchang Hao, Yanshuai Cao, and Lili Mou
    arXiv preprint arXiv:2402.03295, Nov 2024
  7. ICLR
    Ensemble distillation for unsupervised constituency parsing
    Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie CK Cheung, and Lili Mou
    International Conference on Learning Representations, Nov 2024
  8. Patent
    System and method for improved neural network training
    CAO Yanshuai, Yik Chau Lui, Weiguang Ding, and Ruitong Huang
    Aug 2024
    US Patent 12,056,605
  9. Patent
    System and method for machine learning architecture for partially-observed multimodal data
    Yu Gong, Jiawei He, Thibaut Durand, Megha Nawhal, CAO Yanshuai, MORI Gregory, and Seyed Hossein Hajimirsadeghi
    Jul 2024
    US Patent 12,033,083
  10. Patent
    System and method for machine learning architecture with variational autoencoder pooling
    Teng Long, CAO Yanshuai, and Jackie CK Cheung
    Feb 2024
    US Patent 11,914,955
  11. Patent
    Transformer-based architecture for density ratio estimation
    TANG Keyi, and CAO Yanshuai
    May 2024
    US Patent App. 18/491,417

2023

  1. ICLR
    An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation
    Yuqiao Wen, Yongchang Hao, Yanshuai Cao, and Lili Mou
    International Conference on Learning Representations, May 2023
  2. Patent
    Method and device for conducting measurements for an N-dimensional data structure
    Weiguang Ding, Ruitong Huang, Luyu Wang, and CAO Yanshuai
    Jan 2023
    US Patent 11,551,041
  3. Patent
    Robust pruned neural networks via adversarial training
    Luyu Wang, Weiguang Ding, Ruitong Huang, CAO Yanshuai, and Yik Chau Lui
    Jan 2023
    US Patent 11,562,244
  4. Patent
    System and method for improving deep neural network performance
    CAO Yanshuai, Ruitong Huang, and Junfeng Wen
    Sep 2023
    US Patent 11,755,916
  5. Patent
    System and method for machine learning with long-range dependency
    CAO Yanshuai, and Peng Xu
    Sep 2023
    US Patent 11,763,129
  6. Patent
    System and method for controllable machine text generation architecture
    Peng Xu, CAO Yanshuai, and Jackie CK Cheung
    Sep 2023
    US Patent 11,763,100
  7. Patent
    System and method for machine learning architecture with variational hyper-RNN
    DENG Ruizhi, CAO Yanshuai, Bo Chang, and Marcus Brubaker
    Mar 2023
    US Patent 11,615,305

2022

  1. Patent
    System and method for cross-domain transferable neural coherence model
    CAO Yanshuai, Hamidreza SAGHIR, Jin Sung KANG, Teng Long, Jackie CK CHEUNG, and  others
    Mar 2022
    US Patent 11,270,072
  2. Patent
    System and method for transferable natural language interface
    CAO Yanshuai, Peng Xu, TANG Keyi, Wei Yang, ZI Wenjie, Teng Long, Jackie Chit Kit Cheung, Chenyang Huang, MOU Lili, Hamidreza Shahidi, and  others
    Apr 2022
    US Patent App. 17/508,914

2021

  1. Preprint
    hier_syn_sempar.png
    Hierarchical Neural Data Synthesis for Semantic Parsing
    Wei Yang, Peng Xu, and Yanshuai Cao
    arXiv preprint arXiv:2112.02212, Apr 2021
  2. ACL
    codegen_mono.png
    Code Generation from Natural Language with Less Prior Knowledge and More Monolingual Data
    Sajad Norouzi, Keyi Tang, and Yanshuai Cao
    In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, Aug 2021
  3. ACL
    dt_fixup.png
    Optimizing Deeper Transformers on Small Datasets
    Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J.D. Prince, and Yanshuai Cao
    In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, Aug 2021
  4. ACL-Demo
    TURING: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface
    Peng Xu, Wenjie Zi, Hamidreza Shahidi, Ákos Kádár, Keyi Tang, Wei Yang, Jawad Ateeq, Harsh Barot, Meidan Alon, and Yanshuai Cao
    In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Aug 2021
  5. Workshop
    A Globally Normalized Neural Model for Semantic Parsing
    Chenyang Huang, Wei Yang, Yanshuai Cao, Osmar Zaı̈ane, and Lili Mou
    In Proceedings of the 5th Workshop on Structured Prediction for NLP (SPNLP 2021), Aug 2021
  6. Patent
    Method and device for generative adversarial network training
    BOSE Avishek, and CAO Yanshuai
    Jul 2021
    US Patent 11,062,179
  7. Patent
    System, methods, and devices for visual construction of operations for data querying
    CAO Yanshuai, and Luyu Wang
    Aug 2021
    US Patent 11,080,292
  8. Patent
    System and method for testing machine learning
    Yik Chau Lui, and CAO Yanshuai
    Oct 2021
    US Patent App. 17/227,086

2020

  1. AISTATS
    llm_mi.png
    Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer
    Yanshuai Cao*, and Peng Xu*
    In Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, 26–28 aug 2020
  2. ICML
    rd_eval.png
    Evaluating Lossy Compression Rates of Deep Generative Models
    Sicong Huang*, Alireza Makhzani*Yanshuai Cao, and Roger Grosse
    In Proceedings of the 37th International Conference on Machine Learning, 13–18 jul 2020
  3. ICML
    vae_controllable.png
    On Variational Learning of Controllable Representations for Text without Supervision
    Peng Xu, Jackie Chi Kit Cheung, and Yanshuai Cao
    In Proceedings of the 37th International Conference on Machine Learning, 13–18 jul 2020
  4. Preprint
    Variational Hyper RNN for Sequence Modeling
    Ruizhi Deng, Yanshuai Cao, Bo Chang, Leonid Sigal, Greg Mori, and Marcus A Brubaker
    arXiv preprint arXiv:2002.10501, 13–18 jul 2020
  5. Patent
    System and method for adaptive data visualization
    Luyu Wang, and CAO Yanshuai
    Aug 2020
    US Patent 10,739,955
  6. Patent
    Systems and methods for cyberbot network detection
    Ashkan Amiri, Bryce Croll, FONG Cory, Athinthra Krishnaswamy Sethurajan, Vikash Yadav, Sylvester King Chun Chiang, QIN Zhengyi, Cathal Smyth, Yik Chau Lui, CAO Yanshuai, and  others
    Oct 2020
    US Patent 10,819,724
  7. Patent
    Systems and methods for malicious code detection
    Cathal Smyth, FONG Cory, Yik Chau Lui, and CAO Yanshuai
    Jun 2020
    US Patent 10,685,284
  8. Patent
    System and method for reproducible machine learning
    Weiguang Ding, and CAO Yanshuai
    Oct 2020
    US Patent 10,802,822

2019

  1. Preprint
    Preventing Posterior Collapse in Sequence VAEs with Pooling
    Teng Long, Yanshuai Cao, and Jackie Chi Kit Cheung
    arXiv preprint arXiv:1911.03976, Oct 2019
  2. ACL
    A Cross-Domain Transferable Neural Coherence Model
    Peng Xu, Hamidreza Saghir, Jin Sung Kang, Teng Long, Avishek Joey Bose, Yanshuai Cao, and Jackie Chi Kit Cheung
    In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019

2018

  1. Preprint
    Few-shot Self-Reminder to Overcome Catastrophic Forgetting
    Junfeng Wen, Yanshuai Cao, and Ruitong Huang
    NeurIPS 2018 Workshop on Continual Learning, Jul 2018
  2. Workshop
    Compositional Hard Negatives for Visual Semantic Embeddings via an Adversary
    A. Bose, Huan Ling, and Yanshuai Cao
    NeurIPS 2018 Workshop on ViGIL, Jul 2018
  3. ICLR
    bre_gan.png
    Improving GAN Training via Binarized Representation Entropy (BRE) Regularization
    Yanshuai Cao, Gavin Weiguang Ding, Kry Yik-Chau Lui, and Ruitong Huang
    International Conference on Learning Representations, Jul 2018
  4. Preprint
    Adversarial Robustness of Pruned Neural Networks
    Luyu Wang, Gavin Weiguang Ding, Ruitong Huang, Yanshuai Cao, and Yik Chau Lui
    Jul 2018
  5. ACL
    Adversarial Contrastive Estimation
    Avishek Joey Bose*, Huan Ling*, and Yanshuai Cao*
    In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2018
  6. PhD Thesis
    Scaling Gaussian Processes
    Yanshuai Cao
    Jul 2018

2017

  1. Workshop
    Implicit Manifold Learning on Generative Adversarial Networks
    Kry Yik Chau Lui, Yanshuai Cao, Maxime Gazeau, and Kelvin Shuangjian Zhang
    ICML 2017 Workshop on Implicit Models, Jul 2017
  2. Workshop
    Automatic Selection of t-SNE Perplexity
    Yanshuai Cao, and Luyu Wang
    ICML 2017 Workshop on AutoML, Jul 2017

2016

  1. ICLR
    feat_adv.png
    Adversarial Manipulation of Deep Representations
    Sara Sabour*Yanshuai Cao*, Fartash Faghri, and David J. Fleet
    Jul 2016

2015

  1. TPAMI
    Efficient Optimization for Sparse Gaussian Process Regression
    Yanshuai Cao, Marcus A Brubaker, David J Fleet, and Aaron Hertzmann
    IEEE Transactions on Pattern Analysis and Machine Intelligence, Jul 2015
  2. Workshop
    Transductive Log Opinion Pool of Gaussian Process Experts
    Yanshuai Cao, and David J Fleet
    NIPS2015 Workshop on Nonparametric Methods for Large Scale Representation Learning, Jul 2015

2014

  1. Workshop
    Generalized Product of Experts for Automatic and Principled Fusion of Gaussian Process Predictions
    Yanshuai Cao, and David J Fleet
    Modern Nonparametrics 3: Automating the Learning Pipeline Workshop at NIPS, Jul 2014

2013

  1. NeurIPS
    cholqr.png
    Efficient Optimization for Sparse Gaussian Process Regression
    Yanshuai Cao, Marcus A Brubaker, David J Fleet, and Aaron Hertzmann
    In Advances in Neural Information Processing Systems, Jul 2013