Aston Zhang is a research scientist at Meta Generative AI, building large language models (Llama). Prior to this, he was a scientist/manager at Amazon Web Services AI Research, studying language and multimodal models. He received an ICLR Outstanding Paper Award, an ACM Ubicomp Distinguished Paper Award, and an ACM SenSys Best Paper Award Nomination. His Dive into Deep Learning textbook is adopted worldwide. He obtained a Ph.D. in Computer Science from University of Illinois Urbana-Champaign.

[Filled, see you next year] If you are interested in research internship on LLMs with our team in 2024, just email me.

Books

  • A. Zhang, Z. C. Lipton, M. Li, and A. J. Smola
    Dive into Deep Learning
    Cambridge University Press, 2023
    • Adopted at 500 universities from 70 countries
    • Featured in the AWS re:Invent keynote by Swami, Head of AWS AI, Database, and Analytics
  • A. Zhang, M. Li, Z. C. Lipton, and A. J. Smola
    动手学深度学习
    人民邮电出版社, 2nd ed., 2023, 1st ed., 2019

Papers (All)

Tutorials

  • with A. J. Smola
    Attention in Deep Learning [Keynote] [PDF] [Video]
    In The 36th International Conference on Machine Learning (ICML), 2019

  • with H. Lin, X. Shi, L. Lausen, H. He, S. Zha, and A. J. Smola
    Dive into Deep Learning for Natural Language Processing
    In Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

  • with H. Lin, L. Lausen, S. Zha, A. J. Smola, C. Wang, and M. Li
    From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond [Website]
    In The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2019

  • with H. Zhang, T. He, Z. Zhang, Z. Zhang, H. Lin, and M. Li
    Everything You Need to Know to Reproduce SOTA Deep Learning Models from Hands-on Tutorial
    In International Conference on Computer Vision (ICCV), 2019

Services

  • Area Chair
    • Annual Meeting of the Association for Computational Linguistics (ACL)
    • Conference on Empirical Methods in Natural Language Processing (EMNLP)
    • International Conference on Computational Linguistics (COLING)