Contact Us

Give us a call or drop by anytime, we endeavor to answer all inquiries within 24 hours.

map

Find us

PO Box 16122 Collins Street West Victoria, Australia

Email us

info@domain.com / example@domain.com

Phone support

Phone: + (066) 0760 0260 / + (057) 0760 0560

Loading Events

« All Events

  • This event has passed.
Event Series Event Series: Special Seminar Series

Building and Deploying Large Language Model Applications Efficiently and Verifiably | Ying Sheng

April 23 @ 11:00 am - 12:00 pm

Abstract: 

The applications of large language models (LLMs) are increasingly complex and diverse, necessitating efficient and reliable frameworks for building and deploying them. In this talk, I will begin with algorithms and systems for serving LLMs for everyone (FlexGen, S-LoRA, VTC), highlighting the growing trend of personalized LLM services. My work addresses the need to run LLMs locally for isolated individual needs. It also tackles the problem of efficiency and service fairness when resource sharing among many users is required. Once we have efficient deployment, a primary concern is the reliability of generation. The second part of this talk aims to address this issue by exploring verifiable code generation. To achieve this, I adopt tools in formal verification to facilitate LLMs in generating correctness certificates alongside other artifacts (Clover). Finally, I will touch on future research avenues, such as integrating formal methods with LLMs and developing programming systems for generative AI.

Bio: 

Ying Sheng is a Ph.D. candidate in Computer Science at Stanford University, advised by Clark Barrett. Her research focuses on building and deploying large language model applications, emphasizing accessibility, efficiency, programmability, and verifiability. Ying has authored numerous papers in top-tier AI, system, and automated reasoning conferences and journals, such as NeurIPS, ICML, ICLR, OSDI, SOSP, IJCAR, and JAR. Her work has received a Best Paper award (as first author) at IJCAR and a Best Tool Paper award at TACAS. As a core member of the LMSYS Org, she has developed influential open models, datasets, systems, and evaluation tools, such as Vicuna, Chatbot Arena, and SGLang. Ying is a recipient of the Machine Learning and Systems Rising Stars Award (2023) and the a16z Open Source AI Grant (2023). More information about her can be found at https://sites.google.com/view/yingsheng.

Details

Date:
April 23
Time:
11:00 am - 12:00 pm
Series:
Event Category:

Venue

Computer Science & Engineering Building (CSE), Room 1242
3234 Matthews Ln
La Jolla, CA 92093 United States

Organizer

HDSI General