WINTER 2021

MATH - BIOINF - STATS 547: Mathematics of Data

Instructor: Prof. INDIKA Rajapakse

Teaching Assistant: STEPHEN Lindsly

Class Time: Tuesday and Thursday, 4:00 - 5:30PM, Office Hours: Tuesday and Thursday, 5:30 - 7:00PM

Syllabus

Timeline

Simplicity, Rigor, and Magic

References of Interest

PROBLEM OF THE DAY

POD1: Missing Entries Solution

POD2: Class Expectations

POD3: Least squares

POD4: Clustering

POD5: Learning


NOTES, SLIDES, AND PAPERS

Date: 1-21-2021

Topics

  • Basic introduction to data representation: Vectors, matrices, and tensors

  • Introduction to Eigenvalues and Eigenvectors

Notes

Extra slides: The following slides are the data I showed you on 1-19-2021, with a bit more explanation to help you understand where the data comes from.

Papers

  1. Donoho D. "50 years of data science." Journal of Computational and Graphical Statistics. 2017 Oct 2;26(4):745-66.

  2. Lieberman-Aiden, Erez, ..., Groudine Mark, ..., Lander Eric. "Comprehensive mapping of long-range interactions reveals folding principles of the human genome." Science 326.5950 (2009): 289-293.


Date: 1-26-2021

Notes

Resources

Extra slides: The following slides contain the guest lecturers, helpful advice, and the course syllabus


Date: 1-28-2021

Guest Lecture: Dr. Cleve Moler


Date: 2-2-2021

Notes

Resources

Papers

Classics (just browse)

  1. Turk, Matthew, and Alex Pentland. "Eigenfaces for recognition." Journal of cognitive neuroscience 3.1 (1991): 71-86.

  2. Bott, Raoul. "Morse theory indomitable." Publications Mathématiques de l'IHÉS 68 (1988): 99-114.

Preparation for Spectral Clustering

  1. Ng, Andrew Y., Michael I. Jordan, and Yair Weiss. "On spectral clustering: Analysis and an algorithm." Advances in neural information processing systems 2 (2002): 849-856.

  2. Von Luxburg, Ulrike. "A tutorial on spectral clustering." Statistics and computing 17.4 (2007): 395-416


Date: 2-4-2021

Notes

Papers

  1. Gavish, Matan, and David L. Donoho. "The optimal hard threshold for singular values is 4/sqrt(3)." IEEE Transactions on Information Theory 60.8 (2014): 5040-5053. (Amazing Paper!)

  1. Chen, Jie, Alfred O. Hero III, and Indika Rajapakse. "Spectral identification of topological domains." Bioinformatics 32.14 (2016): 2151-2158.


Date: 2-9-2021

Resources

Notes


Date: 2-11-2021

Guest Lecture: Prof. Gilbert Strang

Resources


Date: 2-16-2021

Notes

Papers

  1. Ng, Andrew Y., Michael I. Jordan, and Yair Weiss. "On spectral clustering: Analysis and an algorithm." Advances in neural information processing systems 2 (2002): 849-856.

  2. Belkin, Mikhail, and Partha Niyogi. "Laplacian eigenmaps and spectral techniques for embedding and clustering." Advances in neural information processing systems. 2002.

  3. Von Luxburg, Ulrike. "A tutorial on spectral clustering." Statistics and computing 17.4 (2007): 395-416. (Excellent Review!)


Date: 2-18-2021

Guest Lecture: Col. Chris Macedonia


Date: 2-23-2021

Notes

Book:

  • Kutz, J. Nathan, et al. Dynamic mode decomposition: data-driven modeling of complex systems. Society for Industrial and Applied Mathematics, 2016.

Chapter 1: Dynamic Mode Decomposition: An Introduction

Nice Video!


Date: 2-25-2021

Notes

Book:

  • Kutz, J. Nathan, et al. Dynamic mode decomposition: data-driven modeling of complex systems. Society for Industrial and Applied Mathematics, 2016.

Chapter 1: Dynamic Mode Decomposition: An Introduction

Code

Papers


Date: 3-2-2021

Notes

Slides

Papers


Date: 3-4-2021

Book:

  • Kutz, J. Nathan. Data-driven modeling & scientific computation: methods for complex systems & big data. Oxford University Press, 2013.

Independent Component Analysis

Papers

Data-guided Control (DGC)

Non-negative Matrix Factorization (NMF or NNMF)

DMD with Control

Video: Dynamic Mode Decomposition (DMD) with Control


Date: 3-9-2021

Guest Lecture: Dr. Reza Ghanadan from Google

  • Assuring AI for Real-World Decision Making with Robust AI System Design


Date: 3-11-2021

Notes


Date: 3-16-2021

Guest Lecture: Dr. David Heckerman from Amazon


Date: 3-18-2021

Notes

Papers


Date: 3-25-2021

Notes

Papers


Date: 3-30-2021

Notes

Papers


Date: 4-1-2021

Slides

Notes

Book:

  • Eldén, Lars. Matrix methods in data mining and pattern recognition. Society for Industrial and Applied Mathematics, 2007.

Chapter 8: Tensor Decomposition

Papers


Date: 4-6-2021

Slides

Notes

Book:

  • Eldén, Lars. Matrix methods in data mining and pattern recognition. Society for Industrial and Applied Mathematics, 2007.

Chapter 8: Tensor Decomposition

Papers

  1. Kolda, Tamara G., and Brett W. Bader. "Tensor decompositions and applications." SIAM review 51.3 (2009): 455-500. (Excellent Review!)

  2. Chen C, Rajapakse I. "Tensor Entropy for Uniform Hypergraphs." IEEE Transactions on Network Science and Engineering 7.4 (2020): 2889-2900.

  3. Sweeney P, Chen C, Rajapakse I, Cone R. "Network Dynamics of Hypothalamic Feeding Neurons." Proceedings of the National Academy of Sciences (2021).

  4. Williams, Alex H., et al. "Unsupervised discovery of demixed, low-dimensional neural dynamics across multiple timescales through tensor component analysis." Neuron 98.6 (2018): 1099-1115.

  5. Wolf, Michael M., Alicia M. Klinvex, and Daniel M. Dunlavy. "Advantages to modeling relational data using hypergraphs versus graphs." 2016 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 2016.


Date: 4-8-2021

Slides

Papers

  1. Carlsson, Gunnar. "The shape of biomedical data." Current Opinion in Systems Biology 1 (2017): 109-113.

  2. Carlsson, Gunnar. "Topology and data." Bulletin of the American Mathematical Society 46.2 (2009): 255-308.

  3. Lum, Pek Y., et al. "Extracting insights from the shape of complex data using topology." Scientific reports 3.1 (2013): 1-8.


Date: 4-13-2021

Notes

Papers

  1. Brunton, Steven L., Joshua L. Proctor, and J. Nathan Kutz. "Discovering governing equations from data by sparse identification of nonlinear dynamical systems." Proceedings of the national academy of sciences 113.15 (2016): 3932-3937.


Date: 4-15-2021

Data-guided Control

Notes

Papers

  1. Ronquist S, Patterson G, Muir LA, Lindsly S, Chen H, Brown M, Wicha M, Bloch A, Brockett R and Rajapakse I. "Algorithm for Cellular Reprogramming." Proceedings of the National Academy of Sciences 114.45 (2017): 11832-11837.

  2. Liu, Yang-Yu, Jean-Jacques Slotine, and Albert-László Barabási. "Controllability of complex networks." Nature 473.7346 (2011): 167-173.


Date: 4-20-2021

Slides

Notes