BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Halıcıoğlu Data Science Institute - UC San Diego - ECPv6.16.2//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Halıcıoğlu Data Science Institute - UC San Diego
X-ORIGINAL-URL:https://datascience.ucsd.edu
X-WR-CALDESC:Events for Halıcıoğlu Data Science Institute - UC San Diego
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20230312T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20231105T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20240310T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20241103T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20250309T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20251102T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240423T110000
DTEND;TZID=America/Los_Angeles:20240423T120000
DTSTAMP:20260527T153250
CREATED:20240501T163343Z
LAST-MODIFIED:20240501T164107Z
UID:10000474-1713870000-1713873600@datascience.ucsd.edu
SUMMARY:Building and Deploying Large Language Model Applications Efficiently and Verifiably | Ying Sheng
DESCRIPTION:Abstract:  \n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nThe applications of large language models (LLMs) are increasingly complex and diverse\, necessitating efficient and reliable frameworks for building and deploying them. In this talk\, I will begin with algorithms and systems for serving LLMs for everyone (FlexGen\, S-LoRA\, VTC)\, highlighting the growing trend of personalized LLM services. My work addresses the need to run LLMs locally for isolated individual needs. It also tackles the problem of efficiency and service fairness when resource sharing among many users is required. Once we have efficient deployment\, a primary concern is the reliability of generation. The second part of this talk aims to address this issue by exploring verifiable code generation. To achieve this\, I adopt tools in formal verification to facilitate LLMs in generating correctness certificates alongside other artifacts (Clover). Finally\, I will touch on future research avenues\, such as integrating formal methods with LLMs and developing programming systems for generative AI. \nBio:  \n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nYing Sheng is a Ph.D. candidate in Computer Science at Stanford University\, advised by Clark Barrett. Her research focuses on building and deploying large language model applications\, emphasizing accessibility\, efficiency\, programmability\, and verifiability. Ying has authored numerous papers in top-tier AI\, system\, and automated reasoning conferences and journals\, such as NeurIPS\, ICML\, ICLR\, OSDI\, SOSP\, IJCAR\, and JAR. Her work has received a Best Paper award (as first author) at IJCAR and a Best Tool Paper award at TACAS. As a core member of the LMSYS Org\, she has developed influential open models\, datasets\, systems\, and evaluation tools\, such as Vicuna\, Chatbot Arena\, and SGLang. Ying is a recipient of the Machine Learning and Systems Rising Stars Award (2023) and the a16z Open Source AI Grant (2023). More information about her can be found at https://sites.google.com/view/yingsheng.
URL:https://datascience.ucsd.edu/event/building-and-deploying-large-language-model-applications-efficiently-and-verifiably-ying-sheng/
LOCATION:Computer Science & Engineering Building (CSE)\, Room 1242\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
END:VEVENT
END:VCALENDAR