Shanshan Han

About

I'm an incoming research scientist at Tiktok. My research focuses on building trustworthy AI systems, addressing safety and security challenges across the entire AI lifecycle.

Keywords: AI Safety Data Security LLM Guardrails FL Security Data Management

Research Overview

My research addresses AI safety and security across multiple layers of AI systems, including:

  • Infrastructure Layer: Secure Data Foundations. Link
  • Training Layer: Robust Federated Learning. Link 1 Link 2
  • Inference Layer: Trustworthy LLM Inference and LLM Guardrail Pipelines. Link
  • Application Layer: Access Control for Retrieval-Augmented Generation Systems. Link
Research overview

Selected Works

  • FedSecurity: A Benchmark for Attacks and Defenses in Federated Learning and Federated LLMs
    Shanshan Han, Baturalp Buyukates, Zijian Hu, Han Jin, Weizhao Jin, Lichao Sun, Xiaoyang Wang, Wenxuan Wu, Chulin Xie, Yuhang Yao, Kai Zhang, Qifan Zhang, Yuhui Zhang, Salman Avestimehr, Chaoyang He
    Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024 (KDD 2024)
    Link Invited Talk @ AI TIME
  • Kick Bad Guys Out! Conditionally Activated Anomaly Detection in Federated Learning with Zero-Knowledge Proof Verification
    Shanshan Han, Wenxuan Wu, Baturalp Buyukates, Weizhao Jin, Qifan Zhang, Yuhang Yao, Salman Avestimehr
    NDSS-PRISM 2026
  • Veil: Storage and Communication Efficient Volume Hiding Algorithms
    Shanshan Han, Vishal Chakraborty, Michael Goodrich, Sharad Mehrotra, Shantanu Sharma
    Proceedings of the ACM on Management of Data, 2023 (SIGMOD 2024)
    Link Invited Talk @ Cryptography Group, MongoDB Inc.
  • An Iterative Scheme for Leverage-based Approximate Aggregation
    Shanshan Han, Hongzhi Wang, Jialin Wan, Jianzhong Li
    IEEE 35th International Conference on Data Engineering (ICDE 2019)
  • Don't Be a Pot Stirrer! Authorized Vector Data Retrieval via Access-Aware Indexing
    Shanshan Han, Vishal Chakraborty, Sharad Mehrotra
  • Bridging the Safety Gap: A Guardrail Pipeline for Trustworthy LLM Inferences
    Shanshan Han, Salman Avestimehr, Chaoyang He
    Link Invited Talk @ Ploutos AI Community
  • FedML-HE: An Efficient Homomorphic-Encryption-Based Privacy-Preserving Federated Learning System
    Weizhao Jin, Yuhang Yao, Shanshan Han, Carlee Joe-Wong, Srivatsan Ravi, Salman Avestimehr, Chaoyang He
    FL@FM-NeurIPS 2023 Workshop
  • Fox-1: Open Small Language Model For Cloud And Edge
    Zijian Hu, Jipeng Zhang, Rui Pan, Zhaozhuo Xu, Shanshan Han, Han Jin, Alay Dilipbhai Shah, Dimitris Stripelis, Yuhang Yao, Salman Avestimehr, Chaoyang He, Tong Zhang
  • Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
    Yide Ran, Zhaozhuo Xu, Yuhang Yao, Zijian Hu, Shanshan Han, Han Jin, Alay Dilipbhai Shah, Jipeng Zhang, Dimitris Stripelis, Tong Zhang, Salman Avestimehr, Chaoyang He

Vision Papers

  • Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
    Shanshan Han
  • LLM Multi-Agent Systems: Challenges and Open Problems
    Shanshan Han, Qifan Zhang, Weizhao Jin, Zhaozhuo Xu

PhD Thesis

  • Safeguarding AI Lifecycles in the Cloud: Secure Data Management for Data at Rest, in Transit, and in Use.