Multiple sequence alignment (MSA) lies at the heart of comparative genomics, structural biology and evolutionary inference. By arranging three or more nucleotide or amino acid sequences in a matrix, ...
Self-supervised neural language models with attention have recently been applied to biological sequence data, advancing structure, function and mutational effect prediction. Some protein language ...
Historical introduction and overview -- Collecting and storing sequences in the laboratory -- Alignment of pairs of sequences -- Introduction to probability and statistical analysis of sequence ...