# Diffusion Maps, Spectral Clustering and Reaction Coordinates of Dynamical Systems

A central problem in data analysis is the low dimensional representation of high dimensional data, and the concise description of its underlying geometry and density. In the analysis of large scale simulations of complex dynamical systems, where the notion of time evolution comes into play, important problems are the identification of the slow variables and the representation of the reaction coordinates that parameterize them.

In this paper we provide a unifying view of these apparently different tasks, by considering a family of diffusion maps, defined as the embedding of complex data onto a low dimensional Euclidian space, via the eigenvectors of suitably normalized random walks defined on the given datasets. We show, both theoretically and by examples how this embedding can be used for dimensionality reduction, manifold learning, geometric analysis of complex data sets and fast simulations of stochastic dynamical systems. Joint work with R.R. Coifman, S. Lafon, M. Maggioni and I.G. Kevrekidis

*Author: Boaz Nadler, Weizmann Institute*