LIX Laboratory, Ecole polytechnique.

Research Interests

My research interests are in the areas of Machine Learning, Data Science and Mining, and Artificial Intelligence in general. More specifically, my research interests are a bit eclectic, but there are two major themes:

  • Multi-label classification, multi-target and structured-output prediction

  • Learning from sequential data and data streams, which includes sequential decision making, autonomous agents, and reinforcement learning

The two themes are often connected; since sequential implies multiple and often vice versa. In both, I am specifically interested in themes of explainability, uncertainty analysis, robustness and reliability, domain shift/concept drift, transfer learning and continual learning.

I’ll use any tools suited to the task, including deep neural network architectures and deep learning, probabilistic graphical models, Monte Carlo methods and MCMC and other methods from computational statistics, and classical machine learning algorithms such as decision trees.

I also enjoy tackling real-world problems (some listed below), and data science applications; including sensor networks and sensory data, transport and energy systems and medicine, biology and the natural sciences.


See a short presentation (pdf) of some topics of interest and of the wider DaSciM team (see also the DaSciM team web page). As follows a selection of research activity, a lot of which involves the work of PhD students (and other colleagues).


Topic: Multi-label Classification and Multi-Target Prediction

In multi-label classification, multiple target variables are associated with each instance, as opposed to the traditional supervised learning problem where a single class label is assigned to each instance. This involves standard tasks such as image and text categorization, as well as recommender systems, missing value imputation, and more general structured-output prediction problems such as time series and trajectory forecasting.

Particular challenges: explainability and interpretability, distribution learning, distribution shift.

Methods of choice: classifier chains, regressor chains, probabilistic graphical models.

CC

Highlights:


Topic: Learning from Data Streams

Many real-world applications are found in the context of data streams, where data instances arrive rapidly and continuously in a theoretically-infinite stream, for example from sensors, online social media and text streams. Reinforcement learning is typically carried out in such a scenario; where a stream of observations comes from an environment.

Challenges: learning with weak, partial, noisy, and delayed labels; adaptation to concept drift; online learning.

Applications: anomaly and event detection, time series and trajectory forecasting, complex energy systems.

Highlights:




Application: Modelling and diagnosis of sleep disorders (Updated: 04/2019)

Working with Olivier Pallanca (neurophysiologist) we are building predictive models for diagnosing different types of insomnia and, more importantly, predicting the response to different treatment options based on the personal characteristics of each patient ; with data such as psychological questionnaires, overnight EEG and ECG signals, skin conductance, eye movement, reaction time. This involves a number of subtasks such as event detection in sequences and streams involving multiple correlated outputs. Interpretability of models is a key aspect.


Application: Trajectory prediction (2015)

Given only a week or so of location data from a mobile phone device, it was possible to make reasonably accurate predictions about the a traveller’s route and future destination in an urban setting. Here is a Demo Animation (the captions explain what is going on)

— using real data collecting in the greater Helsinki area.


Application: Modelling tree growth in Scots pine (2015)

We worked with forestry scientists from the University of Helsikni to model intra-annual growth of Scots pine trees at sites in Finland and France using time series and machine learning models (paper).


Application: Tracking on very low-power sensor motes (2012)

We formulated and implementing a distributed particle filter on very low-power motes (4 MHz CPU) for real-time target tracking. This video of a demo of testbed deployment shows tracking using only light sensor observations.