Posters 
Abstract
Implementing Sequest Cluster Protein Database MS/MS Searches Through the Computational Portal and Analysis System (CPAS) for High Throughput Proteomics Experiments
 
Bill Nelson1, Josh Eckels2, Brendan MacLean2, Chee Hong Wong3, Sum Thai Wong3, Jon Klein4, Bert Lynn1

The Computational Portal and Analysis System (CPAS) is an open-source, extensible infrastructure for evaluating and publishing data from high throughput biological experiments. CPAS is a three-tiered web application written with the Sun Microsystems Java Enterprise Edition. Currently CPAS’s most developed module is for MS/MS proteomics experiments.

The original installation of CPAS installed the X!Tandem search engine and the Institute for System Biology’s Trans-Proteomic Pipeline(TPP). From the CPAS user interface, data files from MS/MS analyses of complex protein samples can be automatically processed through a pipeline that searches MS spectra against protein databases, assigns peptide search hits to proteins, and performs relative quantitation of normal vs. other, isotope labeled samples. Simpler experiments are also possible.

X!Tandem is a powerful open-source search engine but there are several other search engines that are in common use; X!Tandem (www.thegpm.org), Sequest (Thermo Scientific), and Mascot (Matrix Science) are probably the most widely used. Each search engine has its own strengths and weaknesses; therefore their complementary results are a powerful tool for validation of identified peptides. The standard data formats implemented by the TPP (mzXML, pepXML) allow the direct comparison of results produced by different search engines and MS instrument manufacturers.

Because of licensing costs and limited IT resources implementing multiple search engines and an analysis pipeline is a serious endeavor. Through open-source development contributions - Mascot integration by the Bioinformatics Institute, Matrix, Singapore (CPAS version 1.7) and Sequest Cluster integration by the University of Kentucky, Lexington, Kentucky (CPAS version 2.0) - CPAS lessens the resources required to implement a complex data analysis pipeline and provides a mechanism for multiple labs to share a common resource.

1University of Kentucky, Lexington, KY, USA
2Labkey Software, Seattle, WA, USA
3Bioinformatics Institute, Singapore
4University of Louisville, Louisville, KY, USA