Teaching: 2016, Fall (École Polytechnique):
Functional Data Analysis (MAP579):
- The goal of the course is to enrich the participants with practical tools to be able to process and analyse data of functional nature, typically arising in climate research
(for example, temperature curves, wind speed, and other time-series data). During the semester we will learn how to denoise functions (smoothing), align sample curves
(time warping; shift registration, landmark registration, continuous registration), discover dominant patterns present in the data (dimensionality reduction), represent unequally spaced observations (functional representation), and visualize the obtained results. The course will also provide the possibility to work on hourly, atmospheric observations provided by
the SIRTA-ReOBS project. The methods will be implemented in Matlab.
- What you will learn: denoise experimental data, find recurrent patterns within the data, work with time series, and see applications of machine learning to process experimental data.
- The course requires a basic knowledge of probability theory (random variable, density function, mean, variance) at the level of 'A First Course in Probability by Sheldon Ross', calculus (function, derivative, integrals) at the level of 'Calculus by James Stewart', linear algebra (vector, matrix) at the level of 'Introduction to Linear Algebra by Serge Lang'.
- Lecture 1 (Oct. 4):
- Keywords: smoothing by least squares, basis function technique, linear smoother, localized least squares (kernel smoothing), Nadaraya-Watson estimator, Gasser-Müller estimator, localized basis function estimator, local polynomial smoothing.
- Slides: maths, quick summary of Matlab.
- Data: weather data.
- Covered: Chapter 1-4 of .
- Lecture 2 (Oct. 11):
- Keywords (maths): smoothing with roughness penalty (regularization approach), harmonic acceleration operator, spline smoothing, B-spline basis, degree of freedom, quadrature rules (trapezoid rule, Simpson's rule), (generalized) cross-validation, bi-resolution analysis.
- Keywords (FDA toolbox): basis object, create/evaluate/plot basis systems.
- Slides: maths, FDA package: basics.
- Covered: Chapter 5 of , Chapter 1-3 of .
- Lecture 3 (Oct. 18):
- Keywords (maths): smoothing with constraints (positivity, monotonicity, probability density function), maximum-likelihood estimation, curve registration, amplitude/phase variability, shift registration, Procrustes method, feature or landmark registration, time-warping function, continuous registration, modified Newton-Raphson method.
- Keywords (FDA toolbox): fd, fdnames, Lfd, fdPar objects.
- Slides: maths, FDA package: smoothing.
- Covered: Chapter 6-7 of , Chapter 4-5 of .
- Lecture 4 (Nov. 8):
- Keywords: dimensionality reduction, principal component analysis (Karhunen-Loeve transformation, Hoteling transformation), constrained optimization (Lagrange multipliers).
- Slides: maths, PCA tasks.
- Covered: 'Rd part' of Chapter 8 in , and .
- Lecture 5 (Nov. 15):
- Lecture 6 (Nov. 22):
- Lecture 7 (Dec. 13):
- Keywords: inner product space, norm, Hilbert space, CBS inequality, kernel, reproducing property, RKHS, kernel ridge regression, representer theorem, covariance operator, kernel PCA.
- References: [4-6].
-  J.O. Ramsay, B.W. Silverman. Functional Data Analysis. Springer, 2005. [link]
-  J.O. Ramsay, Giles Hooker, Spencer Graves. Functional Data Analysis with R and Matlab. Springer, 2009. [link]
-  Cosma Shalizi. Notes on 'Principal Components Analysis', 2016. [link]
-  Bernhard Schölkopf, Alex Smola, Klaus-Robert Müller. Kernel Principal Component Analysis, pages 583-588, ICANN-1997.
-  Sebastian Mika, Bernhard Schölkopf, Alex Smola, Klaus-Robert Müller, Matthias Scholz, Gunnar Rätsch. Kernel PCA and De-Noising in Feature Spaces, pages 536-542, NIPS-2009.
-  Ingo Steinwart, Andreas Christmann. Support Vector Machines, 2008.
- Code/dataset (external):