A sparse Bayesian learning method for structural equation model-based gene regulatory network inference

Li, Yan, Liu, Dayou, Chu, Jianfeng, Zhu, Yungang, Liu, Jie and Cheng, Xiaochun ORCID: https://orcid.org/0000-0003-0371-9646 (2020) A sparse Bayesian learning method for structural equation model-based gene regulatory network inference. IEEE Access, 8 . pp. 40067-40080. ISSN 2169-3536 [Article] (doi:10.1109/ACCESS.2020.2976743)

[img]
Preview
PDF - Published version (with publisher's formatting)
Available under License Creative Commons Attribution.

Download (4MB) | Preview

Abstract

Gene regulatory networks (GRNs) are underlying networks identified by interactive relationships between genes. Reconstructing GRNs from massive genetic data is important for understanding gene functions and biological mechanism, and can provide effective service for medical treatment and genetic research. A series of artificial intelligence based methods have been proposed to infer GRNs from both gene expression data and genetic perturbations. The accuracy of such algorithms can be better than those models that just consider gene expression data. A structural equation model (SEM), which provides a systematic framework integrating both types of gene data conveniently, is a commonly used model for GRN inference. Considering the sparsity of GRNs, in this paper, we develop a novel sparse Bayesian inference algorithm based on Normal-Equation-Gamma (NEG) type hierarchical prior (BaNEG) to infer GRNs modeled with SEMs more accurately. First, we reparameterize an SEM as a linear type model by integrating the endogenous and exogenous variables; Then, a Bayesian adaptive lasso with a three-level NEG prior is applied to deduce the corresponding posterior mode and estimate the parameters. Simulations on synthetic data are run to compare the performance of BaNEG to some state-of-the-art algorithms, the results demonstrate that the proposed algorithm visibly outperforms the others. What’s more, BaNEG is applied to infer underlying GRNs from a real data set composed of 47 yeast genes from Saccharomyces cerevisiae to discover potential
relationships between genes.

Item Type: Article
Keywords (uncontrolled): Sparse Bayesian learning, high-dimensional data, gene regulatory network, gene expression data, structural equation model
Research Areas: A. > School of Science and Technology > Computer Science
Item ID: 29519
Notes on copyright: This work is licensed under a Creative Commons Attribution 4.0 License.
Useful Links:
Depositing User: Xiaochun Cheng
Date Deposited: 12 Mar 2020 14:26
Last Modified: 12 Mar 2020 15:34
URI: https://eprints.mdx.ac.uk/id/eprint/29519

Actions (login required)

View Item View Item

Full text downloads (NB count will be zero if no full text documents are attached to the record)

Downloads per month over the past year