| Interconnection between the Protein Solubility and Amino Acid and Dipeptide Compositions. | |
| | |
MedLine Citation:
|
PMID: 22789104 Owner: NLM Status: Publisher |
Abstract/OtherAbstract:
|
Obtaining soluble proteins in sufficient concentrations helps increase the overall success rate in various experimental studies. Protein solubility is an individual trait ultimately determined by its primary protein sequence. Exploring the interconnection between the protein solubility and the compositions of protein sequence is instrumental for setting priorities on targets in large scale proteomics projects. In this paper, amino acid composition (20 dimensions) and the dipeptide composition (400 dimensions) were extracted to form the total candidate feature pool (420 dimensions), and each feature was selected into the feature vectors one by one, which were sorted by the absolute value of the correlation coefficient. Finally, we evaluated and recorded the 420 results of Support Vector Machine (SVM) as the prediction engine. According to the results of SVM, the first 208 features were chosen from the 420 dimensions, which were considered as the efficient ones. By analyzing the composition of the former 208 features, we found that the protein solubility was significantly influenced by the occurrence frequencies of the acidic amino acids, basic amino acids, non-polar hydrophobic amino acids and the two polar neutral amino acids(C, Q) in the protein sequences. Additionally, we detected that the dipeptides composed by the acidic amino acids (D, E) and basic amino acids (K, R and H), especially the dipeptide composed by the acidic amino acids (D, E), had strong interconnection with the protein solubility. |
| | |
Authors:
|
Xiaohui Niu; Nana Li; Dinyan Chen; Zengzhen Wang |
Related Documents
:
|
8973204 - Structural identification of a major mitogenic lipid derived from bacillus subtilis as ... 16657814 - Influence of temperature and seed ripening on the in-vivo incorporation of co(2) into t... 21314634 - What restricts the clinical use of nicotinic acid? 15545214 - The central role of perilipin a in lipid metabolism and adipocyte lipolysis. 6097234 - Alterations of linoleic, arachidonic and eicosapentaenoic acids in renal cortex and med... 6161444 - Alcoholic bouin fixation of insect nervous systems for bodian silver staining. ii. modi... |
Publication Detail:
|
Type: JOURNAL ARTICLE Date: 2012-7-12 |
Journal Detail:
|
Title: Protein and peptide letters Volume: - ISSN: 1875-5305 ISO Abbreviation: - Publication Date: 2012 Jul |
Date Detail:
|
Created Date: 2012-7-13 Completed Date: - Revised Date: - |
Medline Journal Info:
|
Nlm Unique ID: 9441434 Medline TA: Protein Pept Lett Country: - |
Other Details:
|
Languages: ENG Pagination: - Citation Subset: - |
Affiliation:
|
Department of Epidemiology and Health Statistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, PR China. wzzh@mails.tjmu.edu.cn. |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
|
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Mutation studies in the active site of ß-glycosidase from Pyrococcus furiosus DSM 3638.
Next Document: In-House SAD Phasing of an Unique Thermophilic Rieske Ferredoxin Containing a Stabilizing Disulfide ...