In vitro and in silico analysis reveals an efficient algorithm to predict the splicing consequences of mutations at the 5′ splice sites

K Sahashi, A Masuda, T Matsuura, J Shinmi… - Nucleic acids …, 2007 - academic.oup.com
K Sahashi, A Masuda, T Matsuura, J Shinmi, Z Zhang, Y Takeshima, M Matsuo, G Sobue…
Nucleic acids research, 2007academic.oup.com
We have found that two previously reported exonic mutations in the PINK1 and PARK7
genes affect pre-mRNA splicing. To develop an algorithm to predict underestimated splicing
consequences of exonic mutations at the 5′ splice site, we constructed and analyzed 31
minigenes carrying exonic splicing mutations and their derivatives. We also examined 189
249 U2-dependent 5′ splice sites of the entire human genome and found that a new
variable, the SD-Score, which represents a common logarithm of the frequency of a specific …
Abstract
We have found that two previously reported exonic mutations in the PINK1 and PARK7 genes affect pre-mRNA splicing. To develop an algorithm to predict underestimated splicing consequences of exonic mutations at the 5′ splice site, we constructed and analyzed 31 minigenes carrying exonic splicing mutations and their derivatives. We also examined 189 249 U2-dependent 5′ splice sites of the entire human genome and found that a new variable, the SD-Score, which represents a common logarithm of the frequency of a specific 5′ splice site, efficiently predicts the splicing consequences of these minigenes. We also employed the information contents (R i ) to improve the prediction accuracy. We validated our algorithm by analyzing 32 additional minigenes as well as 179 previously reported splicing mutations. The SD-Score algorithm predicted aberrant splicings in 198 of 204 sites (sensitivity = 97.1%) and normal splicings in 36 of 38 sites (specificity = 94.7%). Simulation of all possible exonic mutations at positions −3, −2 and −1 of the 189 249 sites predicts that 37.8, 88.8 and 96.8% of these mutations would affect pre-mRNA splicing, respectively. We propose that the SD-Score algorithm is a practical tool to predict splicing consequences of mutations affecting the 5′ splice site.
Oxford University Press