Tally-2.0: upgraded validator of tandem repeat detection in protein sequences

Perovic, Leclercq, Sumonja, Richard, Veljkovic, Kajava (2020) Tally-2.0: upgraded validator of tandem repeat detection in protein sequences Bioinformatics (IF: 5.8) 36(10) 3260-3262
Full Text
Full text

Click the PDF icon to view the full text of the paper

Abstract

Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs.Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%.Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&tool=27. It is supported on Linux. Source code is available upon request.Supplementary data are available at Bioinformatics online.© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Links

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214015
http://www.ncbi.nlm.nih.gov/pubmed/32096820
http://dx.doi.org/10.1093/bioinformatics/btaa121

Similar articles

Tools