91成人精品-国产对白在线-欧美城天堂网地址-精品1区2区3区|www.sanghaoran.com

熱點文獻

Bioinformatics

A Note on the False Discovery Rate of Novel Peptides in Proteogenomics

Zhang, K., Fu, Y., Zeng, W.-F., He, K., Chi, H., Liu, C., Li, Y.-C., Gao, Y., Xu, P., He, S.-M..

Motivation: Proteogenomics has been well accepted as a tool to discover novel genes. In most conventional proteogenomic studies, a global false discovery rate (FDR) is used to filter out false positives for identifying credible novel peptides. However, it has been found that the actual level of false positives in novel peptides is often out of control and behaves differently for different genomes. In order to quantitatively model this problem, we theoretically analyze the sub-group FDRs of annotated and novel peptides. Our analysis shows that the annotation completeness ratio of a genome is the dominant factor influencing the subgroup FDR of novel peptides. Experimental results on two real datasets of E. coli and M. tuberculosis support our conjecture.

Contact: yfu@amss.ac.cn, xupingghy@gmail.com, smhe@ict.ac.cn