Department Seminar Series

Applications and possibilities for signal processing, machine learning and network sciences in proteomics and metabolomics

3rd February 2015, 13:00 add to calenderAshton Lecture Theater
Dr Andrew Dowsey
Department of Electrical Engineering and Electronics
University of Liverpool

Abstract

Liquid Chromatography interfaced to Mass Spectrometry (LC-MS) has emerged as the central technique for discovering patterns of protein or metabolite changes between groups of biological or clinical samples in systems biology and translational medicine. Various statistical models have been introduced that facilitate the analysis of large-scale experiments including appropriate treatment of often substantial sources of variation e.g. batch effects, technical vs biological variation, combining multiple peptide measurements per protein. These models are dependent on sensitive and robust feature extraction from a preceding raw data processing pipeline. However, current pipelines are ad-hoc, involving a set of self-contained error-propagating steps that reduce the data down to a symbolic feature-based representation at the earliest stage.
In this talk we will overview some unique properties of raw mass spectrometry signals, highlight the challenges faced by processing this big data (up to 40Gb per sample) and discuss the potential role of signal processing, machine learning and network sciences in the field. We will then describe our progress in developing a statistically rigorous raw data analysis workflow (Liao et al, Proc IEEE ISBI, 2014). In this workflow, the LC-MS output for each sample is first transformed into an image; these are then registered to account for chromatography inconsistency; and finally, a functional mixed-effects model is applied. We will also describe progress towards complementary domain-specialised modelling: Using non-negative matrix factorisation with a minimum volume constraint, we have learnt a low-dimensional representation that approximates all possible peptide signals. This dictionary is incorporated into an L1-regularised sparse regression approach with Poisson noise model, resulting in a unique approach that separates and accurately quantifies overlapping peptide signals.

Bio:
-----
Andrew has just joined the EEE Department at Liverpool as Reader in Biomedical Signal Processing & Informatics. In his previous role as Bioinformatics Research Lead for the Centre for Advanced Discovery and Experimental Therapeutics (CADET) at The University of Manchester, he spearheaded the application of computational approaches to CADET's biological mass spectrometry, drug discovery and drug development programmes. During PhD and postdoctoral training at the Royal Society/Wolfson Medical Image Computing Laboratory, Imperial College London, he was awarded a MICCAI Young Investigator Award and held an EPSRC Overseas Postdoctoral Fellowship at the Life Sciences Interface. Andrew is currently an MRC New Investigator and Associate Editor for the IEEE Journal of Biomedical and Health Informatics.
add to calender (including abstract)