Tab-delimited file format

Various programs in Crux report their output and read inputs in tab-delimited text format. Each such file consists of a single line containing the tab-separated names of fields, followed by one or more lines giving the corresponding field values.

 
Name Description comet tide search-for-xlinks barista q-ranker percolator spectral-counts
PSM peptides proteins sub-proteins PSM peptides proteins
1 scan The identifying number of each scan        
2 charge The charge state for this spectrum in this PSM.        
3 spectrum precursor m/z The observed m/z of the spectrum precursor ion.        
4 spectrum neutral mass The computed mass of the spectrum precursor at the given charge state. This is equal to the precursor m/z minus the mass of a proton (1.00727646677 Da), all multiplied by the charge.        
5 peptide mass The mass of the peptide sequence, computed as the sum of the the amino acid masses plus the mass of water (18.010564684 Da or 18.0153 Da, depending on whether we are using monoisotopic or average mass).          
6 delta_cn The normalized difference in xcorr for this PSM relative to other PSMs for this spectrum at this charge state.              
7 sp score The SEQUEST-type preliminary score.              
8 sp rank The rank of this PSM when sorted by Sp score. Note that, in tide-search, the Sp score is only computed for PSMs that are reported to the user. Hence, the rank of the Sp score will be in the range from 1 to n, where the value of n is determined by the --top-match parameter.              
9 xcorr score The SEQUEST-type cross correlation score.            
10 xcorr rank The rank of this PSM when sorted by xcorr.            
11 p-value The p-value computed from the estimated Weibull parameters of the score distribution for this spectrum.                    
12 percolator score The discriminant score assigned by percolator.                  
13 percolator rank The rank of this PSM, peptide or protein when sorted by percolator score.                  
14 percolator q-value The q-value assigned by percolator.                  
15 q-ranker score The discriminant score assigned by q-ranker.                      
16 q-ranker q-value The q-value assigned by q-ranker.                      
17 q-ranker PEP The posterior error probability estimated from the q-ranker score                      
18 barista score The discriminant score assigned by barista.                  
19 barista q-value The q-value assigned by barista.                  
20 barista PEP The PEP assigned by barista                    
21 b/y ions matched The number of b-ions and y-ions in the peptide that have a corresponding peak in the spectrum.                
22 b/y ions total The total number of b- and y-ions predicted for this peptide.            
23 total matches/spectrum The number of candidate peptides in the database found for this spectrum, including duplicates. Note that this is always the number of target candidate peptides, even if the PSM involves a decoy peptide.            
24 distinct matches/spectrum The number of unique candidate peptides in the database found for this spectrum. Note that this is always the number of target candidate peptides, even if the PSM involves a decoy peptide.            
25 sequence The peptide sequence.          
26 cleavage type The cleavage rules for generating this peptide based on the user-specified enzyme specificity.        
27 protein id A comma-separated list of proteins in which this peptide appears. Following the protein name is a number in parentheses giving the start position of the peptide in the protein.    
28 flanking aa The amino acids that precede and follow this peptide in the parent protein ID.        
29 RAW The raw (unnormalized) count of spectra per peptide                      
30 SIN A protein quantification score                      
31 NSAF The normalized spectral abundace factor                      
32 dNSAF The distributed normalized spectral abundace factor                      
33 EMPAI The exponentially modified protein abundance index                      
34 parsimony rank The protein rank based on its spectral-counts score.                      
35 filename The name of the file in which the PSM appears.                  
36 psm_id PSM identifier.                      
37 protein group number Protein group index                      
38 peptides-scan.charge Peptides belonging to each protein in the group, with scan and charge (for example, EAMPK-001285.2)                  
39 proteins Proteins in the protein group. Each protein can be followed by peptides in parenthesis, if they belong only to this particular protein in the group. Since peptides are considered indistinguishable if they have identical amino acid sequences or they differ only by I/L or T/S in the same position in the peptide, the peptides shared by the group may not all be identical.                    
40 parent group numbers The numbers of the protein groups in the barista.target.proteins.txt that are supersets of this group in terms of peptides.                      
41 peptide mass mono Monoisotopic peptide mass                      
42 peptide mass average Average peptide mass                      
43 mass error(ppm)                      
44 protein id(loc) 1                      
45 protein id(loc) 2                      
46 by total                      
47 by observable (0-1200)                      
48 by observable bin (0-1200)                      
49 by observable (0-max)                      
50 by observable bin (0-max)                      
51 by observed bin                      
52 ion current total                      
53 ion current observed                      
54 ions observable bin (0-1200)