A knowledge-based distance metric highlights underperformance of variant effect predictors on gain-of-function missense variants

This article has 1 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Most current variant effect predictors (VEPs) are less effective at identifying non-loss-of-function (non-LOF) disease variants, which arise due to a gain of function (GOF) or dominant-negative (DN) mechanism. Previously, we showed that Extent of Disease Clustering (EDC), a structure-derived metric, can be used to prioritize putative non-LOF genes due to the tendency of their pathogenic missense variants to cluster. We also introduced a structure-derived spatial distance metric, Spatial Proximity to Disease Variants (SPDV), which proved useful for scoring RyR1 variants. Here, we apply EDC and SPDV across thousands of human disease genes. In comparison with 72 variant effect predictors (VEPs), SPDV exceeds or is at least as effective as half of the tested models at identifying dominant disease variants. When applied to GOF variants, SPDV performs similarly to top VEPs. We show that SPDV performance improves with increasing target EDC values and identify an EDC threshold after which SPDV consistently surpasses all other predictors. Its use as an analytical metric enables the identification of a subset of putative GOF disease genes where current VEP methods underperform relative to SPDV, indicating they may be improved by incorporating spatial distance information.

Related articles

Related articles are currently not available for this article.