Loading icon

Revolutionizing Genomic Variant Calling: The Rise of DeepVariant and Machine Learning

Post banner image
Share:

DeepVariant, a variant calling tool developed by Google using deep learning techniques, has shown remarkable effectiveness in identifying single nucleotide polymorphisms (SNPs) and small insertions and deletions in genomic data. This tool stands out for its use of a deep neural network, which processes images of aligned reads to predict the presence and type of genetic variants. Unlike traditional variant calling methods that require specific adaptations or parameter adjustments, DeepVariant improves its performance through retraining with different types of sequencing data. A significant advantage of DeepVariant is its accuracy, which has been confirmed by various benchmarking studies. For instance, it has been shown to outperform other tools in calling single base variants and short insertions and deletions. Even when earlier versions of the software showed limitations with certain types of data (such as exome and PCR-amplified sequencing data), the inclusion of these data types in the training suite significantly enhanced its performance. Despite its impressive accuracy, DeepVariant does come with a computational cost. However, the tool is continually updated to leverage the rapidly evolving efficiencies in deep learning software and hardware. Furthermore, an efficient implementation is available on the Google Cloud Platform, which helps mitigate some of these computational challenges. In terms of future development, there's potential for expanding DeepVariant to call structural variants, a more complex and diverse category of genetic variation. The ongoing advancements in deep learning hardware and software are likely to further enhance the capabilities of DeepVariant. In summary, DeepVariant represents a significant advancement in the field of genomic variant calling. Its deep learning-based approach not only yields high accuracy but also demonstrates the potential to adapt to various types of genomic data. This makes it a valuable tool for both research and clinical applications in genomics. For more detailed insights into DeepVariant and its capabilities, you might want to explore the original articles published in Nature Biotechnology, Nature Methods, and Nature Communications. These articles provide comprehensive information about the development, performance, and future potential of DeepVariant in the field of genomics​​​​​​​​. Reference:

Poplin, R., Chang, P. C., Alexander, D., Schwartz, S., Colthurst, T., Ku, A., ... & DePristo, M. A. (2018). A universal SNP and small-indel variant caller using deep neural networks. Nature biotechnology, 36(10), 983-987.
Nawy, T. (2018). Variants from the deep. Nature Methods, 15(11), 861-861.
Luo, R., Sedlazeck, F. J., Lam, T. W., & Schatz, M. C. (2019). A multi-task convolutional deep neural network for variant calling in single molecule sequencing. Nature communications, 10(1), 998.
Supernat, A., Vidarsson, O. V., Steen, V. M., & Stokowy, T. (2018). Comparison of three variant callers for human whole genome sequencing. Scientific reports, 8(1), 17851.