X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

A statistical framework for genomic data fusion

A statistical framework for genomic data fusion is a computational framework for integrating and drawing inferences from a collection of genome-wide measurements. Each dataset is represented via a kernel function, which defines generalized similarity relationships between pairs of entities, such as genes or proteins. The kernel representation is both flexible and efficient, and can be applied to many different types of data. Furthermore, kernel functions derived from different types of data can be combined in a straightforward fashion. Recent advances in the theory of kernel methods have provided efficient algorithms to perform such combinations in a way that minimizes a statistical loss function. These methods exploit semidefinite programming techniques to reduce the problem of finding optimizing kernel combinations to a convex optimization problem. Computational experiments performed using yeast genome-wide datasets, including amino acid sequences, hydropathy profiles, gene expression data and known protein-protein interactions, demonstrate the utility of this approach. A statistical learning algorithm trained from all of these data to recognize particular classes of proteins--membrane proteins and ribosomal proteins--performs significantly better than the same algorithm trained on any single type of data. Matlab code to center a kernel matrix and Matlab code for normalization are available.

URL: http://noble.gs.washington.edu/proj/sdp-svm/

Resource ID: nlx_149420     Resource Type: Resource     Version: Latest Version

Keywords

kernel matrix, random, gene expression, blast, smith-waterman, pfam hmm, hydrophobicity fft, linear interaction, diffusion kernel, protein, membrane, ribosomal

Additional Resource Types

data set, source code

Supercategory

Resource

Parent Organization

Original Submitter

Anonymous

Version Status

Curated

Submitted On

12:00am April 29, 2011

Originated From

SciCrunch

Changes from Previous Version

First Version

Version 1

Created 4 years ago by Anonymous