PINS: A Perturbation Clustering Approach for Data Integration and Disease Subtyping

Abstract

Disease subtyping is accomplished by a computer-implemented algorithm that manipulates a first genetic dataset to construct a set of first connectivity matrices. To this set of matrices Gaussian noise is introduced to generate a perturbed dataset. The computer-implemented algorithm assesses which of the set of first connectivity matrices was least affected by introduction of noise and that matric is used to define the optimal clustering. Once the optimal clustering is determined, computer-implemented supervised classification is performed to determine, for a particular patient, with which disease subtype cluster that person’s genetic data most closely aligns. Armed with this knowledge, the treatment regimen is specified with much higher likelihood of success.

Publication
US patent application number 15068048

Supplemental file