E4620 Numerical Methods for Data Analysis
Department of Electrical Engineering, Columbia University
E4620 is typically taught once per year in the Fall semester. The information below is meant to provide a snapshot of the material covered. OverviewCourse descriptionE4620 is intended to provide students with an introduction to the mathematical and computational foundations of data analysis. Students will learn how to understand data analysis through the lens of linear algebra, specifically through the use of matrix factorization techniques. The course will take a principled approach to addressing the theory and computational complexity of numerical linear algebra algorithms for a range of problems including: data fitting, data classification, clustering, and data reduction. Theory and algorithms will be illustrated using a wide range of engineering applications. The course can be loosely broken down into 4 parts:
Lecture slidesFall’25 update: This is a new course and slides after topic 13 and additional material will be added soon. Slides for the least squares topics are taken from Lieven Vandenberghe's EECE133A class at UCLA. 1. Vector spaces Additional material a. K-means clustering TextbookThere is no official textbook for the class, however the material loosly follows 1--2 below. 1. Introduction to Applied Linear Algebra – Vectors, Matrices, and Least Squares, Boyd & Vandenberghe, Cambridge University Press Note: 1 and 3 are freely availablbe from the websites listed. PrerequisitesTherre are no formal prerequisites for this class beyond teh ability to write short scripts in a high-level scripting language such as Python, Julia, or MATLAB. |