Document Detail


De novo prediction of RNA-protein interactions from sequence information.
MedLine Citation:
PMID:  23138266     Owner:  NLM     Status:  Publisher    
Abstract/OtherAbstract:
Protein-RNA interactions are fundamentally important in understanding cellular processes. In particular, non-coding RNA-protein interactions play an important role to facilitate biological functions in signalling, transcriptional regulation, and even the progression of complex diseases. However, experimental determination of protein-RNA interactions remains time-consuming and labour-intensive. Here, we develop a novel extended naïve-Bayes-classifier for de novo prediction of protein-RNA interactions, only using protein and RNA sequence information. Specifically, we first collect a set of known protein-RNA interactions as gold-standard positives and extract sequence-based features to represent each protein-RNA pair. To fill the gap between high dimensional features and scarcity of gold-standard positives, we select effective features by cutting a likelihood ratio score, which not only reduces the computational complexity but also allows transparent feature integration during prediction. An extended naïve Bayes classifier is then constructed using these effective features to train a protein-RNA interaction prediction model. Numerical experiments show that our method can achieve the prediction accuracy of 0.77 even though only a small number of protein-RNA interaction data are available. In particular, we demonstrate that the extended naïve-Bayes-classifier is superior to the naïve-Bayes-classifier by fully considering the dependences among features. Importantly, we conduct ncRNA pull-down experiments to validate the predicted novel protein-RNA interactions and identify the interacting proteins of sbRNA CeN72 in C. elegans, which further demonstrates the effectiveness of our method.
Authors:
Ying Wang; Xiaowei Chen; Zhi-Ping Liu; Qiang Huang; Yong Wang; Derong Xu; Xiang-Sun Zhang; Runsheng Chen; Luonan Chen
Related Documents :
20572276 - Bioresorbable electrospun fibers for immobilization of thiol-containing compounds.
9853956 - A system for stable indirect immobilization of multimeric recombinant proteins.
23362196 - Recent advances in the electrochemistry and spectroelectrochemistry of membrane proteins.
23259596 - Tio(2)-based phosphoproteomic analysis of schistosomes: characterization of phosphoryla...
10526366 - Ab initio folding of proteins using restraints derived from evolutionary information.
20572276 - Bioresorbable electrospun fibers for immobilization of thiol-containing compounds.
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2012-11-9
Journal Detail:
Title:  Molecular bioSystems     Volume:  -     ISSN:  1742-2051     ISO Abbreviation:  Mol Biosyst     Publication Date:  2012 Nov 
Date Detail:
Created Date:  2012-11-9     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101251620     Medline TA:  Mol Biosyst     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Affiliation:
School of Science, Dalian Jiaotong University, Dalian 116028, China.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Three cases of IgG4-related orbital inflammation presented as unilateral pseudotumor and review of t...
Next Document:  ?-Linolenic acid suppresses cholesterol and triacylglycerol biosynthesis pathway by suppressing SREB...