Contact Us

Give us a call or drop by anytime, we endeavor to answer all inquiries within 24 hours.

map

Find us

PO Box 16122 Collins Street West Victoria, Australia

Email us

info@domain.com / example@domain.com

Phone support

Phone: + (066) 0760 0260 / + (057) 0760 0560

Filters

Changing any of the form inputs will cause the list of events to refresh with the filtered results.

Event Series Special Seminar Series

Building and Deploying Large Language Model Applications Efficiently and Verifiably | Ying Sheng

Computer Science & Engineering Building (CSE), Room 1242 3234 Matthews Ln, La Jolla

The applications of large language models (LLMs) are increasingly complex and diverse, necessitating efficient and reliable frameworks for building and deploying them. In this talk, I will begin with algorithms and systems for serving LLMs for everyone (FlexGen, S-LoRA, VTC), highlighting the growing trend of personalized LLM services. My work addresses the need to run LLMs locally for isolated individual needs. It also tackles the problem of efficiency and service fairness when resource sharing among many users is required. Once we have efficient deployment, a primary concern is the reliability of generation. The second part of this talk aims to address this issue by exploring verifiable code generation. To achieve this, I adopt tools in formal verification to facilitate LLMs in generating correctness certificates alongside other artifacts (Clover). Finally, I will touch on future research avenues, such as integrating formal methods with LLMs and developing programming systems for generative AI.