Document Detail

Interconnection between the Protein Solubility and Amino Acid and Dipeptide Compositions.
MedLine Citation:
PMID:  22789104     Owner:  NLM     Status:  Publisher    
Obtaining soluble proteins in sufficient concentrations helps increase the overall success rate in various experimental studies. Protein solubility is an individual trait ultimately determined by its primary protein sequence. Exploring the interconnection between the protein solubility and the compositions of protein sequence is instrumental for setting priorities on targets in large scale proteomics projects. In this paper, amino acid composition (20 dimensions) and the dipeptide composition (400 dimensions) were extracted to form the total candidate feature pool (420 dimensions), and each feature was selected into the feature vectors one by one, which were sorted by the absolute value of the correlation coefficient. Finally, we evaluated and recorded the 420 results of Support Vector Machine (SVM) as the prediction engine. According to the results of SVM, the first 208 features were chosen from the 420 dimensions, which were considered as the efficient ones. By analyzing the composition of the former 208 features, we found that the protein solubility was significantly influenced by the occurrence frequencies of the acidic amino acids, basic amino acids, non-polar hydrophobic amino acids and the two polar neutral amino acids(C, Q) in the protein sequences. Additionally, we detected that the dipeptides composed by the acidic amino acids (D, E) and basic amino acids (K, R and H), especially the dipeptide composed by the acidic amino acids (D, E), had strong interconnection with the protein solubility.
Xiaohui Niu; Nana Li; Dinyan Chen; Zengzhen Wang
Related Documents :
8973204 - Structural identification of a major mitogenic lipid derived from bacillus subtilis as ...
16657814 - Influence of temperature and seed ripening on the in-vivo incorporation of co(2) into t...
21314634 - What restricts the clinical use of nicotinic acid?
15545214 - The central role of perilipin a in lipid metabolism and adipocyte lipolysis.
6097234 - Alterations of linoleic, arachidonic and eicosapentaenoic acids in renal cortex and med...
6161444 - Alcoholic bouin fixation of insect nervous systems for bodian silver staining. ii. modi...
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2012-7-12
Journal Detail:
Title:  Protein and peptide letters     Volume:  -     ISSN:  1875-5305     ISO Abbreviation:  -     Publication Date:  2012 Jul 
Date Detail:
Created Date:  2012-7-13     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9441434     Medline TA:  Protein Pept Lett     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Department of Epidemiology and Health Statistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, PR China.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Mutation studies in the active site of ß-glycosidase from Pyrococcus furiosus DSM 3638.
Next Document:  In-House SAD Phasing of an Unique Thermophilic Rieske Ferredoxin Containing a Stabilizing Disulfide ...