pFind Studio: a computational solution for mass spectrometry-based proteomics


pFind® is a search engine system for automated peptide and protein identification from tandem mass spectra.

Although many available database searching tools have been developed, there are still a lot of challenges in identification reliability, sensitivity and usability. For example, the target-decoy database strategy has been widely used for the estimation of false discovery rate (FDR) of peptide identification. However, this is usually done manually by users and all of existing tools lack an automated module to estimate the FDR. Another problem is the speed of searching high-throughput spectra against huge peptide and protein databases. The improvements in the sensitivity of mass spectrometers and the rapid expansion of databases have increased the scope and complexity of searching. Traditional software architecture, i.e., running all tasks in a stand-alone process and having not any data index, is more and more inadequate.

The newest version of pFind incorporate several newly developed or improved algorithms, modules and workflows: The system incorporates the target-decoy database search strategy for automated FPR estimation. Users only need to specify a required FDR before searching. Then the system will calculate a threshold that achieves the FDR and filter search results automatically. We developed a toolbox to index protein databases for high-throughput application and designed all modules under a parallel-processing-oriented architecture for distributing the computational load efficiently among a lot of computers. These developments greatly improve the overall searching speed.

