Document Detail

Interconnection between the Protein Solubility and Amino Acid and Dipeptide Compositions.
MedLine Citation:
PMID:  22789104     Owner:  NLM     Status:  Publisher    
Obtaining soluble proteins in sufficient concentrations helps increase the overall success rate in various experimental studies. Protein solubility is an individual trait ultimately determined by its primary protein sequence. Exploring the interconnection between the protein solubility and the compositions of protein sequence is instrumental for setting priorities on targets in large scale proteomics projects. In this paper, amino acid composition (20 dimensions) and the dipeptide composition (400 dimensions) were extracted to form the total candidate feature pool (420 dimensions), and each feature was selected into the feature vectors one by one, which were sorted by the absolute value of the correlation coefficient. Finally, we evaluated and recorded the 420 results of Support Vector Machine (SVM) as the prediction engine. According to the results of SVM, the first 208 features were chosen from the 420 dimensions, which were considered as the efficient ones. By analyzing the composition of the former 208 features, we found that the protein solubility was significantly influenced by the occurrence frequencies of the acidic amino acids, basic amino acids, non-polar hydrophobic amino acids and the two polar neutral amino acids(C, Q) in the protein sequences. Additionally, we detected that the dipeptides composed by the acidic amino acids (D, E) and basic amino acids (K, R and H), especially the dipeptide composed by the acidic amino acids (D, E), had strong interconnection with the protein solubility.
Xiaohui Niu; Nana Li; Dinyan Chen; Zengzhen Wang
Related Documents :
7193454 - Partial characterization of the polyisoprenoid carrier of n-acetylglucosamine in glycin...
9573164 - Suppressor scanning at positions 177 and 236 in the escherichia coli lactose/h+ cotrans...
669794 - Lipid metabolism of borrelia hermsi.
1170844 - Lipid metabolism in the cow during starvation-induced ketosis.
10048024 - Teichuronic acid operon of bacillus subtilis 168.
7918614 - Identification of a substance, previously shown to enhance mitogenesis of human lymphoc...
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2012-7-12
Journal Detail:
Title:  Protein and peptide letters     Volume:  -     ISSN:  1875-5305     ISO Abbreviation:  -     Publication Date:  2012 Jul 
Date Detail:
Created Date:  2012-7-13     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9441434     Medline TA:  Protein Pept Lett     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Department of Epidemiology and Health Statistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, PR China.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Mutation studies in the active site of ß-glycosidase from Pyrococcus furiosus DSM 3638.
Next Document:  In-House SAD Phasing of an Unique Thermophilic Rieske Ferredoxin Containing a Stabilizing Disulfide ...