Skip to main content

Advertisement

You are viewing the new article page. Let us know what you think. Return to old version

Pride Wizard: generation of standards compliant quantitative proteomics data

The introduction of the Proteomics Identifications Database (PRIDE) provided the proteomics community with a standards compliant data repository for proteomics data. PRIDE implements standards put forward by the Proteome Standards Initiative (PSI) including mzData.

Many commonly used proteomics software packages do not currently support these standards. As such, formatting data to adhere to the PRIDE schema requires the writing of data parsers to perform the conversion. To address this, the Pride Wizard has been introduced to perform the transformation steps required to convert more commonly used file formats into documents that adhere to the PRIDE schema. Examples of these include .mgf files for peak lists and Mascot .dat files for peptide and protein identifications.

The existing PRIDE schema has no provision for quantitative proteomics data. A new controlled vocabulary is introduced here to allow storage of quantitative I-TRAQ data. It is envisaged that this can be extended to allow submission of quantitative data from other labelling techniques.

The tool has been used to populate a PRIDE database with mass spectra and associated protein identifications and quantifications from a comprehensive set of proteomics experiments from a systems biology study of growth-rate in the eukaryotic cell.

Author information

Correspondence to Neil Swainston.

Rights and permissions

Reprints and Permissions

About this article

Keywords

  • System Biology
  • Protein Identification
  • Proteomics Data
  • Control Vocabulary
  • Peak List