BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Halıcıoğlu Data Science Institute - UC San Diego - ECPv6.16.2//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Halıcıoğlu Data Science Institute - UC San Diego
X-ORIGINAL-URL:https://datascience.ucsd.edu
X-WR-CALDESC:Events for Halıcıoğlu Data Science Institute - UC San Diego
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Los_Angeles
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20230312T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20231105T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20240310T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20241103T090000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0800
TZOFFSETTO:-0700
TZNAME:PDT
DTSTART:20250309T100000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0700
TZOFFSETTO:-0800
TZNAME:PST
DTSTART:20251102T090000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240401T140000
DTEND;TZID=America/Los_Angeles:20240401T153000
DTSTAMP:20260601T091604
CREATED:20240304T172031Z
LAST-MODIFIED:20240329T000153Z
UID:10000453-1711980000-1711985400@datascience.ucsd.edu
SUMMARY:"Instance-Optimization: Rethinking Database Design for the Next 1000X" | Jialin Ding
DESCRIPTION:Abstract: “Modern database systems aim to support a large class of different use cases while simultaneously achieving high performance. However\, as a result of their generality\, databases often achieve adequate performance for the average use case but do not achieve the best performance for any individual use case. In this talk\, I will describe my work on designing databases that use machine learning and optimization techniques to automatically achieve performance much closer to the optimal for each individual use case. In particular\, I will present my work on instance-optimized database storage layouts\, in which the co-design of data structures and optimization policies improves query performance in analytic databases by orders of magnitude. I will highlight how these instance-optimized data layouts address various challenges posed by real-world database workloads and how I implemented and deployed them in production within Amazon Redshift\, a widely-used commercial database system.” \nBio: “Jialin Ding is an Applied Scientist at AWS. Before that\, he received his PhD in computer science from MIT\, advised by Tim Kraska. He works broadly on applying machine learning and optimization techniques to improve data management systems\, with a focus on building databases that automatically self-optimize to achieve high performance for any specific application. His work has appeared in top conferences such as SIGMOD\, VLDB\, and CIDR\, and has been recognized by a Meta Research PhD Fellowship. To learn more about Jialin’s work\, please visit https://jialinding.github.io/.”
URL:https://datascience.ucsd.edu/event/special-seminar-jialin-ding/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240401T110000
DTEND;TZID=America/Los_Angeles:20240401T123000
DTSTAMP:20260601T091604
CREATED:20240328T234406Z
LAST-MODIFIED:20240328T234737Z
UID:10000467-1711969200-1711974600@datascience.ucsd.edu
SUMMARY:How Do We Get There?: Toward Intelligent Behavior Intervention | Xuhai Xu
DESCRIPTION:Abstract: As the intelligence of everyday smart devices continues to evolve\, they can already monitor basic health behaviors such as physical activities and heart rates. The vision of an intelligent behavior change intervention pipeline for health — combining behavior modeling & interaction design — seems to be within reach. How do we get there? \nIn this talk\, I will introduce a comprehensive intervention pipeline that bridges behavior science theory-driven designs and generalizable behavior models. I will also introduce my efforts on passive sensing datasets\, human-centered algorithms\, and a benchmark platform that drives the community toward more robust and deployable intervention systems for health and well-being. \nBio: Xuhai “Orson” Xu is a postdoc at MIT EECS. He received his PhD at the University of Washington. Specializing in human-computer interaction\, applied machine learning\, and health\, Xu develops intelligent behavior intervention systems to promote human health and well-being. His research covers two aspects — 1) building deployable human-centered behavior models and 2) designing interactive user experiences — to establish a complete system to improve end-users’ well-being. Moreover\, his research also goes beyond end-users and supports health experts by designing new human-AI collaboration paradigms in clinical settings. Xu has earned several awards\, including 9 Best Paper\, Best Paper Honorable Mention\, and Best Artifact awards. His research has been covered by media outlets such as the Washington Post and ACM News. He was recognized as the Outstanding Student Award Winner at UbiComp 2022\, the 2023 UW Distinguished Dissertation Award\, and the 2024 Innovation and Technology Award at the Western Association of Graduate Schools.  \nZoom:  https://ucsd.zoom.us/j/92792843021\nPassword: 741675
URL:https://datascience.ucsd.edu/event/how-do-we-get-there-toward-intelligent-behavior-intervention-xuhai-xu/
LOCATION:Computer Science & Engineering Building (CSE)\, Room 1242\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240328T140000
DTEND;TZID=America/Los_Angeles:20240328T153000
DTSTAMP:20260601T091604
CREATED:20240326T031112Z
LAST-MODIFIED:20240326T031112Z
UID:10000464-1711634400-1711639800@datascience.ucsd.edu
SUMMARY:The Emergence of Reproducibility and Generalizability in Diffusion Models | Qing Qu
DESCRIPTION:Abstract: We reveal an intriguing and prevalent phenomenon of diffusion models which we term as “consistent model reproducibility”: given the same starting noise input and a deterministic sampler\, different diffusion models often yield remarkably similar outputs while they generate new samples. We demonstrate this phenomenon through comprehensive experiments and theoretical studies\, implying that different diffusion models consistently reach the same data distribution and scoring function regardless of frameworks\, model architectures\, or training procedures. More strikingly\, our further investigation implies that diffusion models are learning distinct distributions affected by the training data size and model capacity\, so that the model reproducibility manifests in two distinct training regimes with phase transition: (i) “memorization regime”\, where the diffusion model overfits to the training data distribution\, and (ii) “generalization regime”\, where the model learns the underlying data distribution and generate new samples with finite training data. Finally\, our results have strong practical implications regarding training efficiency\, model privacy\, and controllable generation of diffusion models\, and our work raises numerous intriguing theoretical questions for future investigation. \nSpeaker Bio: Qing Qu is an assistant professor in EECS department at the University of Michigan. Prior to that\, he was a Moore-Sloan data science fellow at Center for Data Science\, New York University\, from 2018 to 2020. He received his Ph.D from Columbia University in Electrical Engineering in Oct. 2018. He received his B.Eng. from Tsinghua University in Jul. 2011\, and a M.Sc.from the Johns Hopkins University in Dec. 2012\, both in Electrical and Computer Engineering. His research interest lies at the intersection of foundation of data science\, machine learning\, numerical optimization\, and signal/image processing\, with focus on developing efficient nonconvex methods and global optimality guarantees for solving representation learning and nonlinear inverse problems in engineering and imaging sciences. He is the recipient of Best Student Paper Award at SPARS’15\, and the recipient of Microsoft PhD Fellowship in machine learning in 2016\, and best paper awards in NeurIPS Diffusion Model Workshop in 2023. He received the NSF Career Award in 2022\, and Amazon Research Award (AWS AI) in 2023. He is the program chair of the new Conference on Parsimony & Learning.
URL:https://datascience.ucsd.edu/event/the-emergence-of-reproducibility-and-generalizability-in-diffusion-models-qing-qu/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240328T140000
DTEND;TZID=America/Los_Angeles:20240328T153000
DTSTAMP:20260601T091604
CREATED:20240304T171827Z
LAST-MODIFIED:20240323T082150Z
UID:10000452-1711634400-1711639800@datascience.ucsd.edu
SUMMARY:The Emergence of Reproducibility and Generalizability in Diffusion Models | Qing Qu
DESCRIPTION:Abstract: We reveal an intriguing and prevalent phenomenon of diffusion models which we term as “consistent model reproducibility”: given the same starting noise input and a deterministic sampler\, different diffusion models often yield remarkably similar outputs while they generate new samples. We demonstrate this phenomenon through comprehensive experiments and theoretical studies\, implying that different diffusion models consistently reach the same data distribution and scoring function regardless of frameworks\, model architectures\, or training procedures. More strikingly\, our further investigation implies that diffusion models are learning distinct distributions affected by the training data size and model capacity\, so that the model reproducibility manifests in two distinct training regimes with phase transition: (i) “memorization regime”\, where the diffusion model overfits to the training data distribution\, and (ii) “generalization regime”\, where the model learns the underlying data distribution and generate new samples with finite training data. Finally\, our results have strong practical implications regarding training efficiency\, model privacy\, and controllable generation of diffusion models\, and our work raises numerous intriguing theoretical questions for future investigation. \nBio: “Qing Qu is an assistant professor in EECS department at the University of Michigan. Prior to that\, he was a Moore-Sloan data science fellow at Center for Data Science\, New York University\, from 2018 to 2020. He received his Ph.D from Columbia University in Electrical Engineering in Oct. 2018. He received his B.Eng. from Tsinghua University in Jul. 2011\, and a M.Sc.from the Johns Hopkins University in Dec. 2012\, both in Electrical and Computer Engineering. His research interest lies at the intersection of foundation of data science\, machine learning\, numerical optimization\, and signal/image processing\, with focus on developing efficient nonconvex methods and global optimality guarantees for solving representation learning and nonlinear inverse problems in engineering and imaging sciences.\nHe is the recipient of Best Student Paper Award at SPARS’15\, and the recipient of Microsoft PhD Fellowship in machine learning in 2016\, and best paper awards in NeurIPS Diffusion Model Workshop in 2023. He received the NSF Career Award in 2022\, and Amazon Research Award (AWS AI) in 2023. He is the program chair of the new Conference on Parsimony & Learning.”
URL:https://datascience.ucsd.edu/event/special-seminar-qing-qu/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240327T140000
DTEND;TZID=America/Los_Angeles:20240327T153000
DTSTAMP:20260601T091604
CREATED:20240313T191359Z
LAST-MODIFIED:20240323T081955Z
UID:10000458-1711548000-1711553400@datascience.ucsd.edu
SUMMARY:Towards a Machine Capable of Learning Everything | Hao Liu
DESCRIPTION:Abstract: Large generative models such as ChatGPT have led to amazing results and revolutionized artificial intelligence. In this talk\, I will discuss my research on advancing the foundation of these models\, centered around addressing the architectural bottlenecks of learning from everything. First\, I will describe our efforts to remove context size limitations of the transformer architecture. Our new model architecture and training method allow for nearly infinitely large context sizes without approximations. Our proposed technique has been used for building state-of-the-art open-source and proprietary models. I will then discuss the applications of large context in world model learning and in reinforcement learning\, including Large World Model\, the world’s first multimodal model of million-length scale\, and the required training methodologies. Next\, I will introduce my research on unsupervised exploration that pioneered learning beyond existing knowledge\, allowing unsupervised pretrained models to outperform human experts in gameplay and paving the road for learning beyond imitating existing knowledge. Finally\, I will envision the modeling and training paradigms for the next generation of large generative models we should build\, focusing on advances in neural net architecture\, efficient scaling\, large context reasoning\, and discovery.” \nBio: Hao Liu is a final-year Ph.D. candidate in the Department of Electrical Engineering and Computer Sciences at UC Berkeley\, where he is advised by Pieter Abbeel. During his PhD\, he has also spent two years part-time at Google Brain and DeepMind. His research interests focus on the foundations of generative models\, including machine learning and neural networks\, with the goal of developing computationally scalable solutions for generalization. He recently developed Large World Model (LWM) and architectural advances (BlockwiseTransformers\, and RingAttention) for scaling transformers. Earlier\, he pioneered general and scalable unsupervised exploration (APT and APS). His work on million-length contexts has been influential at Google\, Meta\, and the broader industry. Several of his papers have been presented as spotlight and oral presentations at top-tier machine learning conferences\, and have also been featured in popular media\, including MarkTechPost\, Business Insider\, and ZDNet.
URL:https://datascience.ucsd.edu/event/special-seminar-hao-liu/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240326T140000
DTEND;TZID=America/Los_Angeles:20240326T153000
DTSTAMP:20260601T091604
CREATED:20240323T081729Z
LAST-MODIFIED:20240323T081729Z
UID:10000463-1711461600-1711467000@datascience.ucsd.edu
SUMMARY:Making machine learning predictably reliable | Andrew Ilyas
DESCRIPTION:Abstract: “Despite ML models’ impressive performance\, training and deploying them is currently a somewhat messy endeavor. But does it have to be? In this talk\, I overview my work on making ML “predictably reliable”—enabling developers to know when their models will work\, when they will fail\, and why. \nTo begin\, we use a case study of adversarial inputs to show that human intuition can be a poor predictor of how ML models operate. Motivated by this\, we present a line of work that aims to develop a precise understanding of the ML pipeline\, combining statistical tools with large-scale experiments to characterize the role of each individual design choice: from how to collect data\, to what dataset to train on\, to what learning algorithm to use.” \nBio: “Andrew Ilyas is a PhD student in Computer Science at MIT\, where he is advised by Aleksander Madry and Constantinos Daskalakis. His research aims to improve the reliability and predictability of machine learning systems. He was previously supported by an Open Philanthropy AI Fellowship.”
URL:https://datascience.ucsd.edu/event/making-machine-learning-predictably-reliable-andrew-ilyas/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240326T140000
DTEND;TZID=America/Los_Angeles:20240326T153000
DTSTAMP:20260601T091604
CREATED:20240304T171618Z
LAST-MODIFIED:20240326T030740Z
UID:10000451-1711461600-1711467000@datascience.ucsd.edu
SUMMARY:Making machine learning predictably reliable | Andrew Ilyas
DESCRIPTION:Abstract: “Despite ML models’ impressive performance\, training and deploying them is currently a somewhat messy endeavor. But does it have to be? In this talk\, I overview my work on making ML “predictably reliable”—enabling developers to know when their models will work\, when they will fail\, and why. \nTo begin\, we use a case study of adversarial inputs to show that human intuition can be a poor predictor of how ML models operate. Motivated by this\, we present a line of work that aims to develop a precise understanding of the ML pipeline\, combining statistical tools with large-scale experiments to characterize the role of each individual design choice: from how to collect data\, to what dataset to train on\, to what learning algorithm to use.” \n\nBio “Andrew Ilyas is a PhD student in Computer Science at MIT\, where he is advised by Aleksander Madry and Constantinos Daskalakis. His research aims to improve the reliability and predictability of machine learning systems. He was previously supported by an Open Philanthropy AI Fellowship.”
URL:https://datascience.ucsd.edu/event/special-seminar-andrew-ilyas/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240322T140000
DTEND;TZID=America/Los_Angeles:20240322T153000
DTSTAMP:20260601T091604
CREATED:20240313T190501Z
LAST-MODIFIED:20240321T174552Z
UID:10000457-1711116000-1711121400@datascience.ucsd.edu
SUMMARY:Efficient Deep Learning with Sparsity: Algorithms\, Systems\, and Applications | Zhijian Liu
DESCRIPTION:Abstract: Deep learning is used across a broad spectrum of applications. However\, behind its remarkable performance lies an increasing gap between the demand for and supply of computation. On the demand side\, the computational costs of deep learning models have surged dramatically\, driven by ever-larger input and model sizes. On the supply side\, as Moore’s Law slows down\, hardware no longer delivers increasing performance within the same power budget. \nIn this talk\, I will discuss my research efforts to bridge this demand-supply gap through the lens of sparsity. I will begin with my research on input sparsity. First\, I will introduce algorithms that systematically eliminate the least important patches/tokens from dense input data\, such as images\, enabling up to 60% sparsity without any loss in accuracy. Then\, I will present the system library that we have developed to effectively translate the theoretical savings from sparsity to practical speedups on hardware. Our system is up to 3 times faster than the leading industry solution from NVIDIA. Following this\, I will touch on my research on model sparsity\, highlighting a family of automated\, hardware-aware model compression frameworks that surpass manual solutions in accuracy and reduce the design cycle from weeks of human efforts to mere hours of GPU computation. Finally\, I will demonstrate the use of sparsity to accelerate a wide range of computation-intensive AI applications\, such as autonomous driving\, language modeling\, and high-energy physics. I will conclude this talk with my vision towards building more efficient and accessible AI. \nBio: Zhijian Liu is a Ph.D. candidate at MIT\, advised by Song Han. His research focuses on efficient machine learning and systems. He has developed efficient ML algorithms and provided them with effective system support. He has also contributed to accelerating computation-intensive AI applications in computer vision\, natural language processing\, and scientific discovery. His work has been featured as oral and spotlight presentations at conferences such as NeurIPS\, ICLR\, and CVPR. He was selected as the recipient of the Qualcomm Innovation Fellowship and the NVIDIA Graduate Fellowship. He was also recognized as a Rising Star in ML and Systems by MLCommons and a Rising Star in Data Science by UChicago and UCSD. Previously\, he was the founding research scientist at OmniML\, which was acquired by NVIDIA.
URL:https://datascience.ucsd.edu/event/special-seminar-zhijian-liu/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240320T140000
DTEND;TZID=America/Los_Angeles:20240320T153000
DTSTAMP:20260601T091604
CREATED:20240313T184800Z
LAST-MODIFIED:20240318T230405Z
UID:10000456-1710943200-1710948600@datascience.ucsd.edu
SUMMARY:Understanding Deep Learning through Optimization Geometry|  Nati (Nathan) Srebro
DESCRIPTION:Abstract: How can models with more parameters than training examples generalize well\, and generalize even better when we add even more parameters\, even without explicit complexity control?  In recent years\, it is becoming increasingly clear that much\, or perhaps all\, of the complexity control and generalization ability of deep learning comes from the optimization bias\, or implicit bias\, of the training procedures.  In this talk\, I will survey our work from the past several years on highlighting the role of optimization geometry in determining such implicit bias\, and understanding deep learning through it\, and how this view influences the study of further deep learning phenomena. \nBio: Nati (Nathan) Srebro is a professor at the Toyota Technological Institute at Chicago\, with cross-appointments at the University of Chicago’s Department of Computer Science\, and Committee on Computational and Applied Mathematics. He obtained his PhD from the Massachusetts Institute of Technology in 2004\, and previously was a postdoctoral fellow at the University of Toronto\, a visiting scientist at IBM\, and an associate professor at the Technion\, and held visiting position at the Weizmann Institute and at École Polytechnique Fédérale de Lausanne. \nDr. Srebro’s research encompasses methodological\, statistical and computational aspects of machine learning\, as well as related problems in optimization. Some of Srebro’s significant contributions include work on learning “wider” Markov networks\, introducing the use of the nuclear norm for machine learning\, introducing the “equalized odds” fairness notion for non-discrimination\, work on fast optimization techniques for machine learning\, and on the relationship between learning and optimization. \nWebsite: https://nati.ttic.edu/
URL:https://datascience.ucsd.edu/event/special-seminar-nathan-srebro/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240320T100000
DTEND;TZID=America/Los_Angeles:20240320T110000
DTSTAMP:20260601T091604
CREATED:20240318T224758Z
LAST-MODIFIED:20240318T224925Z
UID:10000461-1710928800-1710932400@datascience.ucsd.edu
SUMMARY:TILOS Seminar: How Large Models of Language and Vision Help Agents to Learn to Behave
DESCRIPTION:Roy Fox\, Assistant Professor and Director of the Intelligent Dynamics Lab at UC Irvine\nHDSI 123 and Zoom (Link below) \nAbstract: If learning from data is valuable\, can learning from big data be very valuable? So far\, it has been so in vision and language\, for which foundation models can be trained on web-scale data to support a plethora of downstream tasks; not so much in control\, for which scalable learning remains elusive. Can information encoded in vision and language models guide reinforcement learning of control policies? In this talk\, I will discuss several ways for foundation models to help agents to learn to behave. Language models can provide better context for decision-making: we will see how they can succinctly describe the world state to focus the agent on relevant features; and how they can form generalizable skills that identify key subgoals. Vision and vision–language models can help the agent to model the world: we will see how they can block visual distractions to keep state representations task-relevant; and how they can hypothesize about abstract world models that guide exploration and planning. \nBio: Roy Fox is an Assistant Professor of Computer Science at the University of California\, Irvine. His research interests include theory and applications of control learning: reinforcement learning (RL)\, control theory\, information theory\, and robotics. His current research focuses on structured and model-based RL\, language for RL and RL for language\, and optimization in deep control learning of virtual and physical agents.
URL:https://datascience.ucsd.edu/event/tilos-seminar-how-large-models-of-language-and-vision-help-agents-to-learn-to-behave/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2023/10/TILOS-Square_HDSI-Website-e1712854679822.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240315T110000
DTEND;TZID=America/Los_Angeles:20240315T143000
DTSTAMP:20260601T091604
CREATED:20240122T000022Z
LAST-MODIFIED:20240315T011310Z
UID:10000428-1710500400-1710513000@datascience.ucsd.edu
SUMMARY:HDSI Capstone Showcase
DESCRIPTION:
URL:https://dsc-capstone.org/showcase-24/
LOCATION:Price Center East Ballroom\, 9500 Gilman Drive\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Student Event
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image-1.png
END:VEVENT
BEGIN:VEVENT
DTSTART;VALUE=DATE:20240315
DTEND;VALUE=DATE:20240317
DTSTAMP:20260601T091604
CREATED:20240307T172716Z
LAST-MODIFIED:20240307T172716Z
UID:10000454-1710460800-1710633599@datascience.ucsd.edu
SUMMARY:HDSI LLM Workshop
DESCRIPTION:
URL:https://ucsd-hdsi-llm-workshop.github.io/2024/index.html
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Workshops
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image-e1712856546428.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240313T130000
DTEND;TZID=America/Los_Angeles:20240313T140000
DTSTAMP:20260601T091604
CREATED:20240313T195909Z
LAST-MODIFIED:20240313T195909Z
UID:10000460-1710334800-1710338400@datascience.ucsd.edu
SUMMARY:Domain Counterfactuals for Trustworthy ML via Sparse Interventions | David I. Inouye
DESCRIPTION:Talk Abstract: \nAlthough incorporating causal concepts into deep learning shows promise for increasing explainability\, fairness\, and robustness\, existing methods require unrealistic assumptions and aim to recover the full latent causal model. This talk proposes an alternative: domain counterfactuals. Domain counterfactuals ask a more concrete question: “What would a sample look like if it had been generated in a different domain (or environment)?”   This avoids the challenges of full causal recovery while answering an important causal query. I will theoretically analyze the domain counterfactual problem for invertible causal models and prove an estimation bound that depends on the sparsity of intervention\, i.e.\, the number of intervened causal variables.  Leveraging this theory\, I will introduce a practical counterfactual estimation algorithm that outperforms baselines. Additionally\, I will showcase the potential of domain counterfactuals for counterfactual fairness and domain generalization through preliminary results. Finally\, I will connect this work to my broader research focus on distribution matching\, highlighting its potential as a foundational tool for building trustworthy machine learning systems. \nBio: \nProf. David I. Inouye is an assistant professor in the Elmore Family School of Electrical and Computer Engineering at Purdue University. His lab focuses on trustworthy machine learning (ML)\, which aims to make ML systems more robust\, causal and explainable. Currently\, he is interested in advancing distribution matching algorithms and applications such as causality\, domain generalization\, and distribution shift explanations. He is also interested in highly robust distributed learning algorithms on a network of devices\, called Internet Learning. His research is funded by ARL\, ONR\, and NSF. Previously\, he was a postdoc at Carnegie Mellon University working with Prof. Pradeep Ravikumar. He completed his Computer Science PhD at The University of Texas at Austin in 2017 advised by Prof. Inderjit Dhillon and Prof. Pradeep Ravikumar. He was awarded the NSF Graduate Research Fellowship (NSF GRFP).
URL:https://datascience.ucsd.edu/event/domain-counterfactuals-for-trustworthy-ml-via-sparse-interventions-david-i-inouye/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 404\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240312T140000
DTEND;TZID=America/Los_Angeles:20240312T153000
DTSTAMP:20260601T091604
CREATED:20240304T171426Z
LAST-MODIFIED:20240311T181154Z
UID:10000450-1710252000-1710257400@datascience.ucsd.edu
SUMMARY:From Pixels to Measurements: Understanding the Dynamic World ~ Adam Harley
DESCRIPTION:In computer vision\, “video understanding” typically concerns summarization: tracking the main objects\, or describing the main actions. While progress here has been impressive\, many practical applications require extracting information which is much more fine-grained. For example\, biologists are highly interested in tracking specific key points of organisms in long video recordings. Algorithms for such tasks require the generality and precision of low-level vision methods (e.g.\, optical flow)\, but benefit from knowledge about the physical world (e.g.\, things continue to exist while they are occluded). In this talk\, I will present our progress on this crucial space of problems. Our central contribution is to widen the window of “temporal context” used for inference: instead of tracking entities from one frame to the next\, we inspect dozens of frames simultaneously\, and return an answer that makes sense for the full clip. I will discuss the methods and datasets that we have created to drive progress along these lines\, and highlight natural science applications of the work. Finally\, I will introduce our ongoing effort to produce a “foundation model” of motion\, aiming to deliver arbitrary-granularity tracking for a huge variety of real-world situations. \nAdam is a postdoctoral scholar at Stanford University\, working with Leonidas Guibas. He received a Ph.D. in robotics from Carnegie Mellon University\, where he worked with Katerina Fragkiadaki. He received his M.S. in Computer Science at Toronto Metropolitan University\, working with Kosta Derpanis. Adam is a recipient of the NSERC PGS-D scholarship\, and the Toronto Metropolitan University Gold Medal. His research interests lie in Computer Vision and Machine Learning\, particularly for 3D understanding and fine-grained tracking.
URL:https://datascience.ucsd.edu/event/special-seminar-adam-harley/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240311T140000
DTEND;TZID=America/Los_Angeles:20240311T153000
DTSTAMP:20260601T091604
CREATED:20240304T171239Z
LAST-MODIFIED:20240304T171239Z
UID:10000449-1710165600-1710171000@datascience.ucsd.edu
SUMMARY:Special Seminar | Zhuang Liu
DESCRIPTION:Talk info: to be provided
URL:https://datascience.ucsd.edu/event/special-seminar-zhuang-liu/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240308T120000
DTEND;TZID=America/Los_Angeles:20240308T130000
DTSTAMP:20260601T091604
CREATED:20240226T212933Z
LAST-MODIFIED:20240318T225308Z
UID:10000447-1709899200-1709902800@datascience.ucsd.edu
SUMMARY:TILOS Webinar: AI Ethics in Research
DESCRIPTION:The Ethics and Early Career Committee would like to invite you to our upcoming webinar on AI Ethics in Research. This will take place virtually through Zoom on Friday\, March 8th at noon Pacific\, 2pm Central\, 3pm Eastern (https://nu.zoom.us/j/2183621123). \nPlease join Dr. Nisheeth Vishnoi from Yale and Dr. David Danks from UC San Diego who will discuss their Research in AI Ethics. Professor Danks develops practical frameworks and methods to incorporate ethical and policy considerations throughout the AI lifecycle\, including different ways to include them in optimization steps. Bias and fairness have been a particular focus given the multiple ways in which they can be measured\, represented\, and used. Professor Vishnoi uses optimization as a lens to study how subjective human and societal biases emerge in the objective world of artificial algorithms\, as well as how to design strategies to mitigate these biases. \nThis event is a great opportunity to learn about the constantly evolving issues of AI Ethics in research and the societal impact of AI. It will also provide a platform for students to gain insights and valuable advice that can help them in their future career pursuits. \nNisheeth Vishnoi is the A. Bartlett Giamatti Professor of Computer Science and a co-founder of the Computation and Society Initiative at Yale University. He studies the foundations of computation\, and his research spans several areas of theoretical computer science\, optimization\, and machine learning.  He is also interested in understanding nature and society from a computational viewpoint. Here\, his current focus includes understanding the emergence of intelligence and developing methods to address ethical issues at the interface of artificial intelligence and humanity. \nDavid Danks is Professor of Data Science & Philosophy and affiliate faculty in Computer Science & Engineering at University of California\, San Diego. His research interests range widely across philosophy\, cognitive science\, and machine learning\, including their intersection. Danks has examined the ethical\, psychological\, and policy issues around AI and robotics across multiple sectors\, including transportation\, healthcare\, privacy\, and security. He has also done significant research in computational cognitive science and developed multiple novel causal discovery algorithms for complex types of observational and experimental data. Danks is the recipient of a James S. McDonnell Foundation Scholar Award\, as well as an Andrew Carnegie Fellowship. He currently serves on multiple advisory boards\, including the National AI Advisory Committee.
URL:https://datascience.ucsd.edu/event/tilos-webinar-ai-ethics-in-research/
LOCATION:Virtual
CATEGORIES:Webinar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2023/10/TILOS-Square_HDSI-Website-e1712854679822.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240306T170000
DTEND;TZID=America/Los_Angeles:20240306T183000
DTSTAMP:20260601T091604
CREATED:20240209T162735Z
LAST-MODIFIED:20240209T163214Z
UID:10000438-1709744400-1709749800@datascience.ucsd.edu
SUMMARY:The Ethical and Policy Implications of Artificial Intelligence
DESCRIPTION:The Institute for Practical Ethics welcomes David Danks as the 2024 keynote speaker. \nDanks\, a UC San Diego professor in the Department of Philosophy and Halıcıoğlu Data Science Institute\, is an expert researcher at the intersection of philosophy\, cognitive science and machine learning. He serves on multiple boards\, including the United States National AI Advisory Committee. \nArtificial intelligence is seemingly everywhere today\, both in public perception and in our everyday lives. This growth has led to many stories about the widespread harms that can result from AI done poorly. As a result\, there are now numerous demands for ‘ethical AI\,’ but relatively little understanding of what that might involve. \nIn this keynote\, David Danks will explore the nature of responsible AI\, arguing that it involves much more than code or data. He will critically assess current approaches to producing more responsible AI\, then suggest key policy and practical approaches that would likely be more effective. It is critical we create more responsible AI\, but that will require rethinking many of our current practices in academia\, government and industry.
URL:https://www.eventbrite.com/e/the-ethical-and-policy-implications-of-artificial-intelligence-tickets-817599541237?aff=ipewebsite
LOCATION:Sanford Consortium
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/jpeg:https://datascience.ucsd.edu/wp-content/uploads/2024/02/IPE_David-Danks.jpg
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240304T140000
DTEND;TZID=America/Los_Angeles:20240304T150000
DTSTAMP:20260601T091604
CREATED:20240304T170615Z
LAST-MODIFIED:20240304T170615Z
UID:10000448-1709560800-1709564400@datascience.ucsd.edu
SUMMARY:Evaluating and Designing Computing Systems for the Future of Work | Hancheng Cao
DESCRIPTION:Abstract: From collaborative software to generative AI\, computing technologies are redefining the way we work\, communicate and collaborate. Yet with the growing complexities of computing platforms\, it becomes increasingly challenging to foresee their impacts on human behavior\, leading to not only poor user experience but also problematic applications that mirror and amplify societal issues. How can we better understand machine behavior and machine-mediated user behavior over computing platforms? How can we build applications that align with our needs and values with emerging computing technologies? My research aims to answer these questions through novel measurements and computational methods inspired by social science insights\, such as mining increasingly available large-scale data on how people build\, adopt\, and interact with computing systems. In this talk\, I will present my work demonstrating this approach in the future of work context\, where I develop data-driven\, AI-powered and human-centered methods to understand\, evaluate and design sociotechnical systems at the workplace. I will present an analysis of remote meeting experience through mining millions of meetings\, a study on how an AI algorithm can be built to predict team fracture\, and a development and evaluation study on a generative AI-based scientific feedback system for researchers. These projects exemplify the opportunities to leverage computation and data to better understand\, support and augment work practices.         \nBio: Hancheng Cao is a final year PhD candidate in computer science (with a PhD minor in management science and engineering) at Stanford University working with Prof. Daniel McFarland and Prof. Michael Bernstein. He works in the field of computational social science and human computer interaction\, where he mines large-scale data\, develops algorithms and builds systems to study human behavior. Recognized as a Stanford Interdisciplinary Graduate Fellow\, he has published 30 academic papers across fields\, with three works he led recognized as Best Paper (CHI 2023) or Honorable Mention (CSCW 2020\, CHI 2021) awards. His research has also appeared in leading social science journals (e.g. American Sociological Review). His research has been widely covered in the media\, including Wired\, Forbes\, New Scientist\, TED among others.
URL:https://datascience.ucsd.edu/event/evaluating-and-designing-computing-systems-for-the-future-of-work-hancheng-cao/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;VALUE=DATE:20240304
DTEND;VALUE=DATE:20240307
DTSTAMP:20260601T091604
CREATED:20240304T170710Z
LAST-MODIFIED:20240304T171028Z
UID:10000445-1709510400-1709769599@datascience.ucsd.edu
SUMMARY:EnCORE Workshop | Old Questions and New Directions in Theory of Clustering
DESCRIPTION:We are hosting an EnCORE workshop on Old Questions and New Directions in Theory of Clustering at UCSD from March 4th to 6th\, 2024. While in person registration is closed due to limited seats availability\, you can register to attend the workshop virtually here: https://sites.google.com/view/clusteringinsandiego \nWe have a stellar lineup of speakers and we hope the workshop will generate many new questions to work on. \n 
URL:https://sites.google.com/view/clusteringinsandiego
LOCATION:Virtual
CATEGORIES:Workshops
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2023/10/Encore-logo_HDSI-Website.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240226T123000
DTEND;TZID=America/Los_Angeles:20240226T140000
DTSTAMP:20260601T091604
CREATED:20240213T221414Z
LAST-MODIFIED:20240220T235645Z
UID:10000443-1708950600-1708956000@datascience.ucsd.edu
SUMMARY:Building Human-AI Alignment: Specifying\, Inspecting\, and Modeling AI Behaviors | Serena Booth
DESCRIPTION:Abstract: The learned behaviors of AI and robot agents should align with the intentions of their human designers. Alignment is necessary for AI systems to be used in many sectors of the economy\, and so the process of aligning AI systems becomes critical to study for defining effective AI policy. Toward this goal\, people must be able to easily specify\, inspect\, and model agent behaviors. For specifications\, we will consider expert-written reward functions for reinforcement learning (RL) and non-expert preferences for reinforcement learning from human feedback (RLHF). I will show evidence that experts are bad at writing reward functions: even in a trivial setting\, experts write specifications that are overfit to a particular RL algorithm\, and they often write erroneous specifications for agents that fail to encode their true intent. I will also show that the common approach to learning a reward function from non-experts in RLHF uses an inductive bias that fails to encode how humans express preferences\, and that our proposed bias better encodes human preferences both theoretically and empirically. I will discuss the policy implications: namely\, that engineers’ design processes and embedded assumptions in building AI must be considered. For inspection\, humans must be able to assess the behaviors an agent learns from a given specification. I will discuss a method to find settings that exhibit particular behaviors\, like out-of-distribution failures. I will discuss the policy implications for testing AI systems\, for example through red teaming. Lastly\, cognitive science theories attempt to show how people build conceptual models that explain agent behaviors. I will show evidence that some of these theories are used in research to support humans\, but that we can still build better curricula for modeling. I will discuss the policy need for careful onboarding to AI systems. I will end by discussing my current work in the U.S. Senate on responding to the proliferation of AI. Collectively\, my research provides evidence that—even with the best of intentions— current human-AI systems often fail to induce alignment\, and my research proposes promising directions for how to build better aligned human-AI systems.
URL:https://datascience.ucsd.edu/event/special-seminar-serena-booth/
LOCATION:GPS\, Robinson Building Complex (RBC)\, 3106
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240222T140000
DTEND;TZID=America/Los_Angeles:20240222T150000
DTSTAMP:20260601T091604
CREATED:20240126T183316Z
LAST-MODIFIED:20240202T225717Z
UID:10000431-1708610400-1708614000@datascience.ucsd.edu
SUMMARY:The continuum of gene regulation at single cell resolution\, from Drosophila development to human complex traits | Diego Calderon
DESCRIPTION:Single-cell technologies have emerged as powerful tools for studying development\, enabling comprehensive surveys of cellular diversity at profiled timepoints. They shed light on the dynamics of regulatory element activity and gene expression changes during the emergence of each cell type. Despite their potential\, nearly all atlases of embryogenesis are constrained by sampling density\, i.e.\, the number of discrete time points at which individual embryos are harvested. This limitation affects the resolution at which regulatory transitions can be characterized. In this talk\, I present a novel cell collection approach capable of constructing a continuous representation of dynamic regulatory processes. I applied this approach to generate a continuous\, single-cell atlas of chromatin accessibility and gene expression spanning Drosophila embryogenesis. Additionally\, I will discuss my past and future research\, applying new genomic technologies to characterize gene regulation important for human diseases.
URL:https://datascience.ucsd.edu/event/special-seminar-diego-calderon/
LOCATION:Powell-Focht Bioengineering Hall (PFBH)\, FUNG Auditorium
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240221T140000
DTEND;TZID=America/Los_Angeles:20240221T153000
DTSTAMP:20260601T091604
CREATED:20240205T171619Z
LAST-MODIFIED:20240220T172031Z
UID:10000436-1708524000-1708529400@datascience.ucsd.edu
SUMMARY:The Synergy between Machine Learning and the Natural Sciences | Max Welling
DESCRIPTION:Abstract: Traditionally machine learning has been heavily influenced by neuroscience (hence the name artificial neural networks) and physics (e.g. MCMC\, Belief Propagation\, and Diffusion based Generative AI). We have recently witnessed that the flow of information has also reversed\, with new tools developed in the ML community impacting physics\, chemistry and biology. Examples include faster DFT\, Force-Field accelerated MD simulations\, PDE Neural Surrogate models\, generating druglike molecules\, and many more. In this talk I will review the exciting opportunities for further cross fertilization between these fields\, ranging from faster (classical) DFT calculations and enhanced transition path sampling to traveling waves in artificial neural networks. \nBio: Prof. Max Welling is a research chair in Machine Learning at the University of Amsterdam and a Distinguished Scientist at MSR. He is a fellow at the Canadian Institute for Advanced Research (CIFAR) and the European Lab for Learning and Intelligent Systems (ELLIS) where he also serves on the founding board. His previous appointments include VP at Qualcomm Technologies\, professor at UC Irvine\, postdoc at U. Toronto and UCL under supervision of prof. Geoffrey Hinton\, and postdoc at Caltech under supervision of prof. Pietro Perona. He finished his PhD in theoretical high energy physics under supervision of Nobel laureate prof. Gerard ‘t Hooft. \nMax Welling has served as associate editor in chief of IEEE TPAMI from 2011-2015\, he serves on the advisory board of the Neurips foundation since 2015 and has been program chair and general chair of Neurips in 2013 and 2014 respectively. He was also program chair of AISTATS in 2009 and ECCV in 2016 and general chair of MIDL 2018. Max Welling is recipient of the ECCV Koenderink Prize in 2010 and the ICML Test of Time award in 2021. He directs the Amsterdam Machine Learning Lab (AMLAB) and co-directs the Qualcomm-UvA deep learning lab (QUVA) and the Bosch-UvA Deep Learning lab (DELTA).
URL:https://datascience.ucsd.edu/event/distinguished-colloquium-max-welling/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Guest Lecture
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/02/Max_Welling_DLS_1240x650.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240221T123000
DTEND;TZID=America/Los_Angeles:20240221T140000
DTSTAMP:20260601T091604
CREATED:20240220T163858Z
LAST-MODIFIED:20240220T163858Z
UID:10000444-1708518600-1708524000@datascience.ucsd.edu
SUMMARY:Computational approaches for uncovering implicit strategies in political discourse | Julia Mendelsohn
DESCRIPTION:When discussing politics\, people often use subtle linguistic strategies to influence how their audience thinks about issues\, which can then impact public opinion and policy. For example\, anti-immigration activists may frame immigration as a threat to native born citizens’ jobs\, describe immigrants with dehumanizing vermin-related metaphors\, or even use coded expressions to covertly connect immigration with antisemitic conspiracy theories. This talk will focus on the development of computational approaches to analyze three strategies: framing\, dehumanization\, and dogwhistle communication. I will discuss how I draw from multiple social science disciplines to develop typologies and curate data resources\, as well as how I build and evaluate natural language processing models for detecting these strategies. I further analyze the use of these strategies in political discourse across several domains\, and assess the implications of such nuanced rhetoric for both society and technology.
URL:https://datascience.ucsd.edu/event/computational-approaches-for-uncovering-implicit-strategies-in-political-discourse-julia-mendelsohn-2/
LOCATION:GPS\, Robinson Building Complex (RBC)\, 3106
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240220T130000
DTEND;TZID=America/Los_Angeles:20240220T143000
DTSTAMP:20260601T091604
CREATED:20240220T182547Z
LAST-MODIFIED:20240220T182547Z
UID:10000446-1708434000-1708439400@datascience.ucsd.edu
SUMMARY:Learning Inductive Representations for Reasoning over Knowledge Graphs | Zhaocheng Zhu
DESCRIPTION:Abstract: Reasoning\, the ability to logically draw conclusions from existing knowledge\, has been long pursued as a goal of artificial intelligence. Although numerous learning algorithms have been developed for reasoning\, most of them are limited to the domain they are trained on. By contrast\, humans often derive high-level rules or principles from experience and apply them to new domains — an ability referred as inductive generalization. In this talk\, we present a series of works that learn inductive representations for reasoning over knowledge graphs. First\, we introduce Neural Bellman-Ford Networks (NBFNet) that captures paths between entities and can generalize to graphs of new entities. Then we discuss Graph Neural Network Query Executor (GNN-QE)\, an extension of NBNet that answers multi-hop logical queries and generalizes well on our inductive benchmark. Finally\, by learning inductive representations for both entities and relations\, we demonstrate that a model can generalize to any graph with arbitrary entity and relation vocabularies\, paving the way for foundation models for knowledge graph reasoning. \n \nBio: Zhaocheng Zhu is a final-year Ph.D. candidate advised by Prof. Jian Tang at Mila – Quebec AI Institute\, University of Montreal. His research interests include reasoning\, knowledge graphs and large language models. His works\, among the first to study inductive generalization across structures\, have led to a paradigm shift away from traditional knowledge graph embedding methods that have been used for years. He gave a tutorial on knowledge graph reasoning at AAAI 2022. He is also an active developer of machine learning systems\, and led the development of two open-source libraries\, GraphVite for large-scale embedding training and TorchDrug for drug discovery research.
URL:https://datascience.ucsd.edu/event/learning-inductive-representations-for-reasoning-over-knowledge-graphs-zhaocheng-zhu/
LOCATION:Computer Science & Engineering Building (CSE)\, Room 1242\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240215T123000
DTEND;TZID=America/Los_Angeles:20240215T140000
DTSTAMP:20260601T091604
CREATED:20240213T220919Z
LAST-MODIFIED:20240213T221018Z
UID:10000441-1708000200-1708005600@datascience.ucsd.edu
SUMMARY:Targeting humanitarian aid with machine learning and digital data | Emily Aiken
DESCRIPTION:Abstract: The majority of humanitarian aid and social protection programs globally are targeted\, providing assistance to individuals or communities identified to be poorest or most in need. In low- and middle-income countries\, the targeting of aid programs is often limited by low-quality\, out-of-date\, or missing data on poverty and vulnerability. Novel “big” digital data sources\, such as those captured by satellites\, mobile phones\, and financial services providers — when combined with advances in machine learning — can improve the accuracy of aid program targeting. In this talk\, I will cover empirical results on the accuracy of these new data-driven and algorithmic approaches to aid allocation\, and will discuss emergent implications for fairness\, privacy\, transparency\, and community dynamics.
URL:https://datascience.ucsd.edu/event/targeting-humanitarian-aid-with-machine-learning-and-digital-data-emily-aiken/
LOCATION:GPS\, Robinson Building Complex (RBC)\, 3203
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240214T140000
DTEND;TZID=America/Los_Angeles:20240214T153000
DTSTAMP:20260601T091604
CREATED:20240201T193829Z
LAST-MODIFIED:20240220T172238Z
UID:10000434-1707919200-1707924600@datascience.ucsd.edu
SUMMARY:Enabling Performant and Trustworthy Learning-enabled CPS-IoT Systems | Mani Srivastava
DESCRIPTION:Abstract: “The previously discrete technologies of IoT and AI have now entered a tight virtuous embrace. IoT allows sensing and actuation in our physical\, social\, and urban spaces with unimaginable ubiquity. AI allows sophisticated inferences and decisions to be made algorithmically using deep neural networks\, even from unstructured and high- dimensional data\, with uncanny performance. Together they seek to perform sophisticated perception-cognition-communication-action loops in diverse applications. However\, designers of learning-enabled IoT systems face the challenge of extremely resource-constrained edge platforms operating in uncertain environments while assuring performance and trustworthiness. Moreover\, in many applications\, the systems go beyond taking actions based on rich inferences about the world state to perform long-term reasoning about complex events and obey the underlying physics\, rules\, and constraints. Based on our experience in designing such systems in applications including mHealth\, ocean animal health\, agriculture robotics\, and military\, This talk explores meeting these challenges through a combination of (i) neurosymbolic architectures that allow the incorporation of physics awareness and human knowledge while enhancing user trust\, (ii) automatic platform-aware architecture search and code generation\, and (iii) techniques to efficiently adapt to the deployment environment.”            \n \nBio: “Mani Srivastava is Distinguished Professor and Vice Chair at UCLA’s ECE Department with a joint appointment in the CS Department. His research is broadly in human-cyber-physical and IoT systems that are learning-enabled\, resource- constrained\, and trustworthy. It spans problems across the entire spectrum of applications\, architectures\, algorithms\, and technologies in the context of systems and applications for mHealth\, sustainable buildings\, smart environments\, etc. He is a Fellow of the ACM and the IEEE.”    
URL:https://datascience.ucsd.edu/event/special-seminar-mani-srivastava/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240213T130000
DTEND;TZID=America/Los_Angeles:20240213T140000
DTSTAMP:20260601T091604
CREATED:20240209T164305Z
LAST-MODIFIED:20240209T164305Z
UID:10000440-1707829200-1707832800@datascience.ucsd.edu
SUMMARY:On Data Ecology\, Data Markets\, the Value of Data\, and Dataflow Governance | Raul Castro Fernandez
DESCRIPTION:Abstract:\nData shapes our social\, economic\, cultural\, and technological environments. Data is valuable\, so people seek it\, inducing data to flow. The resulting dataflows distribute data and thus value. For example\, large Internet companies profit from accessing data from their users\, and engineers of large language models seek large and diverse data sources to train powerful models. It is possible to judge the impact of data in an environment by analyzing how the dataflows in that environment impact the participating agents. My research hypothesizes that it is also possible to design (better) data environments by controlling what dataflows materialize; not only can we analyze environments but also synthesize them. In this talk\, I present the research agenda on “data ecology\,” which seeks to build the principles\, theory\, algorithms\, and systems to design beneficial data environments. I will also present examples of data environments my group has designed\, including data markets for machine learning\, data-sharing\, and data integration. I will conclude by discussing the impact of dataflows in data governance and how the ideas are interwoven with the concepts of trust\, privacy\, and the elusive notion of “data value.” As part of the technical discussion\, I will complement the data market designs with the design of a data escrow system that permits controlling dataflows.\nBio:\nIn my research\, I ask what is the value of data and explore the potential of data markets to unlock that value. My group collaborates with economists\, legal scholars\, statisticians\, and domain scientists. We build systems to share\, discover\, prepare\, integrate\, and process data. I have traditionally worked on distributed query processing systems and continue to do so. I have received a SIGMOD’23 Test-of-time-Award. I am an assistant professor in the Department of Computer Science and on the Committee of Data Science at The University of Chicago. Before UChicago\, I did a postdoc at MIT with Sam Madden and Mike Stonebraker. And before that\, I completed a PhD at Imperial College London with Peter Pietzuch.
URL:https://datascience.ucsd.edu/event/on-data-ecology-data-markets-the-value-of-data-and-dataflow-governance-raul-castro-fernandez/
LOCATION:CA
CATEGORIES:Colloquium,Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image-e1712856546428.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240213T080000
DTEND;TZID=America/Los_Angeles:20240213T170000
DTSTAMP:20260601T091605
CREATED:20240213T221205Z
LAST-MODIFIED:20240213T221205Z
UID:10000442-1707811200-1707843600@datascience.ucsd.edu
SUMMARY:Computational approaches for uncovering implicit strategies in political discourse | Julia Mendelsohn
DESCRIPTION:Abstract: When discussing politics\, people often use subtle linguistic strategies to influence how their audience thinks about issues\, which can then impact public opinion and policy. For example\, anti-immigration activists may frame immigration as a threat to native born citizens’ jobs\, describe immigrants with dehumanizing vermin-related metaphors\, or even use coded expressions to covertly connect immigration with antisemitic conspiracy theories. This talk will focus on the development of computational approaches to analyze three strategies: framing\, dehumanization\, and dogwhistle communication. I will discuss how I draw from multiple social science disciplines to develop typologies and curate data resources\, as well as how I build and evaluate natural language processing models for detecting these strategies. I further analyze the use of these strategies in political discourse across several domains\, and assess the implications of such nuanced rhetoric for both society and technology.
URL:https://datascience.ucsd.edu/event/computational-approaches-for-uncovering-implicit-strategies-in-political-discourse-julia-mendelsohn/
LOCATION:GPS\, Robinson Building Complex (RBC)\, 3106
CATEGORIES:Seminar
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240213T023000
DTEND;TZID=America/Los_Angeles:20240213T160000
DTSTAMP:20260601T091605
CREATED:20240207T223735Z
LAST-MODIFIED:20240220T172208Z
UID:10000437-1707791400-1707840000@datascience.ucsd.edu
SUMMARY:Principled Approaches for Trustworthy Algorithms\, Statistics\, and Machine Learning | Gautam Kamath
DESCRIPTION:Abstract: Despite impressive recent advances\, machine learning models exhibit a number of critical deficiencies. They are prone to leaking sensitive information about their training data. They remain alarmingly brittle to attacks by malicious parties. Troublingly\, these issues stem from more fundamental statistical vulnerabilities\, which remain unresolved even decades later\, highlighting significant gaps in our understanding of how to deal with these important considerations. As long as these problems remain\, our models will not be appropriate for use beyond deployment in toy settings. In this talk\, I will discuss recent advances on a number of these problems\, which give key new algorithmic insights into how to address these considerations\, and enable real-world deployments that were previously thought infeasible. In a first vignette\, we will explore how to guarantee individual privacy in machine learning models\, with a particular focus on large language models and the important role played by public data in the training pipeline. In a second vignette\, we focus on how to robustly perform mean estimation\, giving the first efficient and accurate algorithms for multivariate settings. We will go on to discuss connections to robustness against data poisoning attacks\, robust exploratory data analysis\, and surprising conceptual and technical connections with privacy. \nBio: Gautam Kamath is an Assistant Professor at the University of Waterloo\, and a Faculty Member and Canada CIFAR AI Chair at the Vector Institute for Artificial Intelligence. His research interests are in trustworthy algorithms\, statistics\, and machine learning\, particularly focusing on considerations like data privacy and robustness. He has a B.S. from Cornell University and a Ph.D. from MIT. He is the recipient of the 2023 Golden Jubilee Research Excellence Award\, recognizing him as the most outstanding junior researcher in the University of Waterloo’s Faculty of Math. Beyond research\, he is celebrated for his teaching. His course on differential privacy is the most popular resource for learning the topic\, with his lecture videos having over 100\,000 views. He has also given invited tutorials on the topic in multiple different countries. He is further well known for his passion and commitment to service and improving the community. Besides organizing and chairing several workshops and conferences\, he is an Editor-in-Chief of Transactions on Machine Learning Research\, and on the Executive Committee of the Learning Theory Alliance.
URL:https://datascience.ucsd.edu/event/principled-approaches-for-trustworthy-algorithms-statistics-and-machine-learning-gautam-kamath/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
BEGIN:VEVENT
DTSTART;TZID=America/Los_Angeles:20240212T140000
DTEND;TZID=America/Los_Angeles:20240212T153000
DTSTAMP:20260601T091605
CREATED:20240126T182854Z
LAST-MODIFIED:20240220T172343Z
UID:10000430-1707746400-1707751800@datascience.ucsd.edu
SUMMARY:Integrating Longitudinal Multimodal Data To Realize Precision Medicine | Samantha Piekos
DESCRIPTION:Abstract: The interplay of biology\, environment\, and lifestyle direct the development and progression of complex diseases and other health outcomes. Therefore\, integration of longitudinal multimodal data is needed to understand the mechanisms underpinning major molecular transitions. Previously during my doctoral work at Stanford\, I integrated multiomics data to elucidate the epigenetic mechanism of human surface ectoderm differentiation. I also built a pipeline to investigate the role of polymorphism\, particularly non-coding genetic variants\, in complex diseases. To address the common pain point of data silos limiting the interpretation of multimodal data integration\, I formed a collaboration with Google Data Commons to build a free\, open-source biomedical knowledge graph with a common schema and API. Currently it is composed of approximately 130 million nodes and 1.7 trillion triples (node-edge-node) from 22 publicly available biomedical datasets. Knowledge graphs are a key tool for hypothesis generation\, data interpretation\, and dimensionality reduction required for systems medicine research. Upon starting my postdoctoral work at the Institute for Systems Biology\, I identified pregnancy as an excellent model system for prototyping precision medicine approaches. I used electronic healthcare records (EHR) from Providence St. Joseph Healthcare to investigate the impact of COVID-19 maternal infection and vaccination on maternal-fetal outcomes. In addition\, I integrated multiomics placental data to investigate molecular network changes (interomics and intraomics) in common obstetric disorders. In a follow-up study (enrollment complete) we have longitudinal deep-phenotyping data of 435 people throughout pregnancy 80 of which have pregnancy complications. This includes multiomics\, survey\, EHR\, and air quality data collected from first prenatal visit through delivery. My lab will use this data to define major molecular transition states throughout pregnancy. I will also investigate the disease mechanisms of common obstetric disorders including identifying for an individual the earliest possible point of deviation from a healthy trajectory. This interdisciplinary approach will identify potential drug targets\, biomarker panels\, and individualized clinical interventions. \n  \nBio: Samantha completed her PhD in Stem Cell Biology and Regenerative Medicine with a PhD minor in Biomedical Informatics at Stanford University under the advisement of Dr. Anthony Oro. Using a multiomics approach\, Samantha demonstrated how transcription factors direct keratinocyte differentiation by changing the epigenetic landscape\, including chromatin looping\, thereby effecting the cell transcriptional program. Samantha has also been collaborating with Google since June 2019 to build Biomedical Data Commons\, a knowledge graph that integrates biomedical data from a wide array of sources into a single searchable database thereby increasing data accessibility. Upon completion of her PhD in 2020\, began her postdoctoral fellowship at the Institute for Systems Biology under the advisement of Drs. Lee Hood and Nathan Price. Using electronic healthcare records (EHR)\, she has provided insight into the impact of maternal COVID-19 and vaccination on maternal-fetal outcomes. In addition to her EHR research\, Samantha is using multidimensional omics placental data to understand the molecular mechanism of common obstetric disorders. Upon transitioning to Assistant Professor\, she intends to perform multimodal data integration of longitudinal deep-phenotyping data to evaluate changes in molecular networks in complex diseases.
URL:https://datascience.ucsd.edu/event/special-seminar-samantha-piekos/
LOCATION:Halıcıoğlu Data Science Institute (HDSI)\, Room 123\, 3234 Matthews Ln\, La Jolla\, CA\, 92093\, United States
CATEGORIES:Seminar
ATTACH;FMTTYPE=image/png:https://datascience.ucsd.edu/wp-content/uploads/2024/01/HDSI-UCSD-Image_Dark-blue-e1710178042629.png
END:VEVENT
END:VCALENDAR