Artificial intelligence Biology and medicine
Gene study in a DNA chain. Mutations and genetic diseases. Gene therapy modification of cells to produce a therapeutic effect. Family tree and pedigree. Disease propensity. Paternity confirmation. Mostra meno
Di Andrii Yalanskyi

Researchers at Google DeepMind have used artificial intelligence to predict whether mutations in human genes are likely to be harmful.

The AlphaMissense algorithm for predicting the effects of missense mutations in human DNA has been developed by artificial intelligence giant Google DeepMind.

The new tool could be very useful in the diagnosis and development of therapies, in particular for the treatment of rare diseases.

Amino acids and DNA mutations

The amino acids – the building blocks of proteins – are determined by the triplets of nucleotides – the building blocks of the genome.

If a genetic mutation alters these triplets (by replacing one base on the DNA strand with another), a different amino acid is inserted into the protein. This is called a missense mutation: the resulting protein may function differently than expected, or not at all.

Real implications

Such mutations are unpredictable and, until now, it has been very difficult to predict how they are going to turn out.

Most are harmless. However, some can be dangerous because they predispose to the development of diseases. These mutations cause genetic diseases such as cystic fibrosis and sickle cell anaemia.

Of more than 4 million observed missense variants, only an estimated 2% have been clinically classified as pathogenic or benign, while the vast majority of them are of unknown clinical significance.

A typical human has 9,000 such mutations throughout their genome. Fortunately, most of these are harmless.

The artificial intelligence

DeepMind researchers, led by Jun Cheng, developed the AlphaMissense algorithm using artificial intelligence.

Using knowledge from Alphafold (an AI that can quickly predict any structure a protein will take on from its amino acid sequence), it is able to ‘guess’ whether a mutated protein looks ‘natural’ or not; and in the latter case, it determines how harmful the mutation may be by assigning it a risk score.

«If we substitute a word from an English sentence, a person that is familiar with English can immediately see whether this word substitution will change the meaning of the sentence or not.» explains Jun Cheng of Google DeepMind.

AI training

AlphaMissense has been trained using DNA data from humans and closely related primates to identify common missense mutations, which are likely to be benign, and distinguish them from rarer, potentially harmful ones. The algorithm also studied millions of protein sequences to understand the ‘language’ of healthy proteins.

Once trained, the algorithm could evaluate missense mutations by generating a score indicating their disease-causing potential.

Results and future prospects

AlphaMissense was able to analyse and classify 89% of the 71 million missense mutations, predicting that 57% of them could be harmless and 32% could cause disease (with the remainder uncertain).

Experts are quick to point out that the system is still a work in progress and that it will be some time before it is clinically reliable. But the use of artificial intelligence could have a significant impact on the diagnosis and development of therapies for genetic diseases, especially rare ones.

Sources and Data