Data, Inference, Prediction, and Computation
The following 5 courses together provide the basic skills to deal with data statistically and computationally:
Give us a call or drop by anytime, we endeavor to answer all inquiries within 24 hours.
PO Box 16122 Collins Street West Victoria, Australia
info@domain.com / example@domain.com
Phone: + (066) 0760 0260 / + (057) 0760 0560
The following 5 courses together provide the basic skills to deal with data statistically and computationally:
This first course in data science introduces students to data exploration, statistical inference, and prediction. It introduces the Python programming language as a tool for tabular data manipulation, visualization, and simulation. Through homework assignments and projects, students are given an opportunity to develop their analytical skills while working with real-world datasets from a variety of domains.
Prerequisites:
Provides an understanding of the structures that underlie the programs, algorithms, and languages used in data science by expanding the repertoire of computational concepts introduced in DSC 10 and exposing students to techniques of abstraction. Course will be taught in Python and will cover topics including recursion, higher-order functions, function composition, object-oriented programming, interpreters, classes, and simple data structures such as arrays, lists, and linked lists.
Prerequisites:
Builds on topics covered in DSC 20 and provides practical experience in composing larger computational systems through several significant programming projects using Java. Students will study advanced programming techniques including encapsulation, abstract data types, interfaces, algorithms and complexity, and data structures such as stacks, queues, priority queues, heaps, linked lists, binary trees, binary search trees, and hash tables.
Prerequisites:
Students will learn about a variety of topics in data science through interactive presentations from faculty and industry professionals.
Prerequisites:
Students will explore topics and tools relevant to the practice of data science in a workshop format. The instructor works with students on guided projects to help students acquire knowledge and skills to complement their course work in the core data science classes. Topics vary from quarter to quarter.
Prerequisites:
The following 2 courses provide a theoretical foundation for data science at the lower-division level:
The sequence DSC 40A-B introduces the theoretical foundations of data science. DSC 40A, the first course in the sequence, exposes students to the mathematical theory underlying fundamental topics in machine learning. Topics include empirical risk minimization, optimization, regression, classification, and discrete probability. Students practice creative problem-solving while learning how to rigorously justify and communicate mathematical ideas.
Prerequisites:
The sequence DSC 40A-B introduces the theoretical foundations of data science. DSC 40B, the second course in the sequence, covers the fundamentals of computer science with applications to data science. Topics include time complexity analysis, the analysis of recursive algorithms, graph theory, and graph search algorithms. Whereas other courses in the curriculum, such as DSC 20 and DSC 30, may touch on these topics briefly, this course aims to develop a deeper, theoretical understanding. DSC 40A-B connect to DSC 10, 20, and 30 by providing the theoretical foundation for the methods that underlie data science.
Prerequisites:
The marriage of data, computation, and inferential thinking, or “data science,” is redefining how people and organizations solve challenging problems and understand the world. This course bridges lower- and upper-division data science courses as well as methods courses in other fields. Students master the data science life-cycle and learn many of the fundamental principles and techniques of data science spanning algorithms, statistics, machine learning, visualization, and data systems.
Prerequisites:
Individual research on a topic related to data science, by special arrangement with and under the direction of a UC San Diego faculty member, in connection with an internship at an organization. It is the students’ responsibility to find an internship related to data science prior to enrolling in this course. Internship work will inform but not necessarily define the research topic. The research topic is expected to promote the study of the principles and techniques involved in the internship work. May be taken for credit up to two times.
Prerequisites:
Note: This course does NOT fulfill requirements for the DSC major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.
Students will investigate a topic in data science through directed reading, discussion, and project work under the supervision of a faculty member. May be taken for credit up to two times.
Prerequisites:
Note: This course does NOT fulfill requirements for the DSC major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.
Students will participate in independent study or research in data science under the direction of a UC San Diego faculty member. May be taken for credit up to two times.
Prerequisites:
Note: This course does NOT fulfill requirements for the DSC major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.
This course is an introduction to storage and management of large-scale data using classical relational (SQL) systems, with an eye toward applications in data science. The course covers topics including the SQL data model and query language, relational data modeling and schema design, elements of cost-based query optimizations, relational data base architecture, and database-backed applications.
Prerequisites:
This course introduces the principles of computing systems and infrastructure for scaling analytics to large datasets. Topics include memory hierarchy, distributed systems, model selection, heterogeneous datasets, and deployment at scale. The course will also discuss the design of systems such as MapReduce/Hadoop and Spark, in conjunction with their implementation. Students will also learn how dataflow operations can be used to perform data preparation, cleaning, and feature engineering.
Prerequisites:
Data visualization helps explore and interpret data through interaction. This course introduces the principles, techniques, and algorithms for creating effective visualizations. The course draws on the knowledge from several disciplines including computer graphics, human-computer interaction, cognitive psychology, design, and statistical graphics and synthesizes relevant ideas. Students will design visualization systems using D3 or other web-based software and evaluate their effectiveness.
Prerequisites:
The course covers learning and using probabilistic models for knowledge representation and decision-making. Topics covered include graphical models, temporal models, and online learning, as well as applications to natural language processing, adversarial learning, computational biology, and robotics. Prior completion of MATH 181A is strongly recommended.
Prerequisites:
Note: This course substitutes the core CSE 150A requirement. Students cannot receive major credit for both CSE 150A and DSC 140A, as only 1 course can fulfill this major core requirement.
This course is an introduction to machine learning which explores techniques for learning suitable representations from data. Topics include clustering, dimensionality reduction, manifold learning, principal component analysis, spectral embeddings, multilayer perceptrons, autoencoders, convolutional and recurrent neural networks, and other aspects of deep learning. The course focuses on the underlying mathematical principles, but some attention is also given to implementation.
Prerequisites:
Note: This course substitutes the core CSE 151A requirement. Students cannot receive major credit for both CSE 151A and DSC 140B, as only 1 course can fulfill this major core requirement.
This course mainly focuses on introducing current methods and models that are useful in analyzing and mining real-world data. It will cover frequent pattern mining, regression and classification, clustering, and representation learning. All participants should be comfortable with programming, and with basic optimization and linear algebra.
Prerequisites:
Note: This course substitutes the core CSE 158 requirement. Students cannot receive major credit for both CSE 158 and DSC 148, as only 1 course can fulfill this major core requirement.
In this two-course sequence students will investigate a topic and design a system to produce statistically informed output. The investigation will span the entire lifecycle, including assessing the problem, learning domain knowledge, collecting/cleaning data, creating a model, addressing ethical issues, designing the system, analyzing the output, and presenting the results. 180A deals with research, methodology, and system design. Students will produce a research summary and a project proposal.
Prerequisites:
In this two-course sequence students will investigate a topic and design a system to produce statistically informed output. The investigation will span the entire lifecycle, including assessing the problem, learning domain knowledge, collecting/cleaning data, creating a model, addressing ethical issues, designing the system, analyzing the output, and presenting the results. 180B will consist of implementing the project while studying the best practices for evaluation.
Prerequisites:
The course will introduce a variety of No-SQL data formats, data models, high-level query languages, and programming abstractions representative of the needs of modern data analytic tasks. Topics include hierarchical graph database systems, unrestricted graph database systems, array databases, comparison of expressive power of the data models, and parallel programming abstractions, including Map/Reduce and its descendants.
Prerequisites:
This course will focus on ideas from classical and modern signal processing, with the main themes of sampling continuous data and building informative representations using orthonormal bases, frames, and data dependent operators. Topics include sampling theory, Fourier analysis, lossy transformations and compression, time and spatial filters, and random Fourier features and connections to kernel methods. Sources of data include time series and streaming signals and various imaging modalities.
Prerequisites:
This course addresses the intersection of data science and contemporary arts and culture, exploring four main themes of authorship, representation, visualization, and data provenance. The course is not solely an introduction to data science techniques nor merely an arts practice course, but explores significant new possibilities for both fields arising from their intersection. Students will examine problems from complementary perspectives of artist-researchers and data scientists.
Prerequisites:
This class explores statistical and computational methods to enable students to use text as a data source in the social sciences. Hands-on examples will equip students to work with text data in final projects.
Prerequisites:
Note: Students will not receive credit for both POLI 176 and DSC 161.
This course examines the greater context under which the practice of data science exists and explores concrete ways these issues surface in technical work. Students learn frameworks for understanding how individuals relate to social institutions, how to use them to identify how issues of fairness arise in the “life of a data scientist,” and use them to propose and critique potential solutions.
Prerequisites:
Spatial data science is a set of concepts and methods that deal with accessing, managing, visualizing, analyzing, and reasoning about spatial data in applications where location, shape and size of objects, and their mutual arrangement are important. This upper-division course explores advanced data science concepts for spatial data, introducing students to principles and techniques of spatial data analysis, including geographic information systems, spatial big data management, and geostatistics.
Prerequisites:
Introduction to Robotics Perception and Navigation is a fast-paced Data Science-centric robotics class where students are divided into teams to work on increasingly difficult challenges resulting in a robotics scale car navigating autonomously on simulated city-like tracks (indoor and outdoors) using Spatial-AI avoiding obstacles using ROS (Robot Operating System), LIDAR, Active and Stereo 3D cameras, Inertial Measurement Unit (IMU), and odometry.
Prerequisites:
Students will participate in a competition to design and develop a machine learning model that achieves state-of-the-art performance on real-world prediction tasks. Competitions will be selected from industry partners or from online platforms (e.g., Kaggle), and focus on prediction tasks from real-world applications in areas such as medicine, finance, marketing, and scientific discovery. Students will work in teams to develop a model for a single competition over the quarter, developing their strategies through milestone assignments. Students will be evaluated based on their performance in the competition, reproducibility, teamwork, and their assignments. Students should have extensive experience programming in Python and training machine learning models using mainstream packages. This is an advanced topics course for graduate students and senior undergraduate students.
Prerequisites:
Note: This course is an ELECTIVE and should be automatically applied as such in your degree audit.
This course is infused with hands-on computer engineering knowledge and techniques. Students will initially work individually with their own Internet-of-Things kit, then collaboratively in groups of 3 or 4 on a final project developing an entire IoT system for data collection and analysis from the physical world.
Prerequisites:
Note: This course is an ELECTIVE and should be automatically applied as such in your degree audit.
Directed study and research at laboratories/institutions outside of campus. It is the students’ responsibility to find an internship related to data science prior to enrolling in this course.
Prerequisites:
Note: This course does NOT fulfill requirements for the DS25 major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.
Data science topics whose study involves reading and discussion by a small group of students under supervision of a faculty member. May be taken for credit up to two times.
Prerequisites:
Note: This course does NOT fulfill requirements for the DS25 major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.
Independent reading or research on a topic related to data science by special arrangement with a faculty member. May be taken for credit up to two times.
Prerequisites:
Note: This course does NOT fulfill requirements for the DS25 major or minor.
For information on eligibility and the enrollment process, visit the Enrolling in Classes page of the website.