List of features used for predicting peptide detectability

Reference: H. Tang, R. J. Arnold, P. Alves, Z. Xun, D. E. Clemmer, M. V. Novotny, J. P. Reilly and P. Radivojac, A computational approach toward label-free protein quantification using predicted peptide detectability. ISMB (Supplement of Bioinformatics) 2006: 481-488.


Some features were pre-calculated using 14 methods as following:


  • 4 protein disorder predictors (ref: PMID 12910457) server implementation): 1) VL2; 2) VL2V; 3) VL2C; 4) VL2S.
  • 5) VLXT protein disorder predictor (ref: PMID: 11093259)
  • 6) Flexibility (ref: PMID: 8090708)
  • Hydrophobic moments with different parameters: 7) window=11, angle=100deg; 8) window=11, angle=160deg; 9) window=11, angle=120deg; (ref: PMID 6582470).
  • 10) VL3 disorder predictor (ref: PMID: 14579347 and PMID: 15751111, and server implementation)
  • 11) B-factor predictor (ref: PMID: 14691223)
  • 12) DisPhos predictor (ref: PMID: 14960716 and server implementation)
  • Calmodulin binding site predictors: 13) residue level; 14) region level; (ref: PMID: 16493654).


  • All 175 features include:


  • 42 features: 3 features for each method 1-14 calculated in +/-5, +/-10 and +/-15 windows;
  • 3 features for the maximal value calculated by method 11 in +/-5, +/-10 and +/-15 windows;
  • 20 features for the amino acid content of the peptide;
  • 20 features for the first amino acid residue of the peptide;
  • 20 features for the last amino acid residue of the peptide;
  • 20 features for the existance of an amino acid residue in the first three residues of the peptide;
  • 20 features for the existance of an amino acid residue in the last three residues of the peptide;
  • 20 features for the existance of an amino acid residue in the protein right after the peptide;
  • 2 features: if the peptide is at N- or C- terminal of the protein;
  • 2 features: if the peptide is within 50 residues from the N- or C- terminal of the protein;
  • 1 feature: the frequency of the hydrophobic amino acid residues;
  • 1 feature: the frequency of the aromatic amino acid residues;
  • 1 feature: entropy of the peptide
  • 1 feature: mass / length ratio;
  • 1 feature: mass of the peptide;
  • 1 feature: length of the peptide;