| 1 | 8/25 | Introduction slides | | |
| 8/27 | Sequence Modelling and Generation with Transformer slides | | |
| 2 | 9/1 | Modeling Structure with Graph Neural Networks slides | | |
| 9/3 | Single-cell biology Research Overview slides | | |
| 3 | 9/8 | Diffusion Models slides; Small molecule drug design via MARS slides | MARS: Markov Molecular Sampling for Multi-objective Drug Discovery; Multi-Objective Molecule Generation using Interpretable Substructures | |
| 9/10 | Multi-domain Distribution Learning for De Novo Drug Design | A 3D Generative Model for Structure-Based Drug Design; Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets; Structure-based drug design with equivariant diffusion models; Reinforced genetic algorithm for structure-based drug design | |
| 4 | 9/15 | Simulating 500 million years of evolution with a language model | ESM2 and ProGen2 | |
| 9/17 | Importance Weighted Expectation-Maximization for Protein Sequence Design | Proximal Exploration for Model-guided Protein Sequence Design | |
| 5 | 9/22 | Robust deep learning based protein sequence design using ProteinMPNN | A Deep SE(3)-Equivariant Model for Learning Inverse Protein Folding | |
| 9/24 | Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates | Scaffolding protein functional sites using deep learning | |
| 6 | 9/29 | Highly accurate protein structure prediction with AlphaFold (AlphaFold2) | Protein Structure Prediction: AlphaFold/AlphaFold2 | |
| 10/1 | De novo design of protein structure and function with RFdiffusion | Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem; PPDiff: Diffusing in Hybrid Sequence-Structure Space for Protein-Protein Complex Design | |
| 7 | 10/6 | De novo design of luciferases using deep learning | | |
| 10/8 | Accurate structure prediction of biomolecular interactions with AlphaFold 3 | RosettaFold3, RosettaFoldAllAtom | |
| 8 | 10/13, 10/15 | Fall break, no class | | |
| 9 | 10/20 | Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling | DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome; Nucleotide Transformer; Hyenadna: Long-range genomic sequence modeling at single nucleotide resolution; Dnabert-2: Efficient foundation model and benchmark for multi-species genome | |
| 10/22 | Predicting DNA structure using a deep learning method | | |
| 10 | 10/27 | CodonBERT: Large Language Models for mRNA design and optimization | | |
| 10/29 | AlphaGenome: advancing regulatory variant effect prediction with a unified DNA sequence model | A sequence-based global map of regulatory activity for deciphering human genetics | |
| 11 | 11/3 | no class | | |
| 11/5 | Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences | | |
| 12 | 11/10 | A programmable reaction-diffusion system for spatiotemporal cell signaling circuit design | | |
| 11/12 | Transfer learning enables predictions in network biology | | |
| 13 | 11/17 | scGPT: toward building a foundation model for single-cell multi-omics using generative AI | scGPT: toward building a foundation model for single-cell multi-omics using generative AI; Universal Cell Embeddings: A Foundation Model for Cell Biology | |
| 11/19 | FlowMol3: Flow Matching for 3D De Novo Small-Molecule Generation | Autonomous, multiproperty-driven molecular discovery: From predictions to measurements and back | |
| 14 | 11/24 | A small-molecule TNIK inhibitor targets fibrosis in preclinical and clinical models | | |
| 11/26 | thanksgiving, no class | | |
| 15 | 12/1 | Conditional Antibody Design as 3D Equivariant Graph Translation | End-to-End Full-Atom Antibody Design; Conditional Antibody Design as 3D Equivariant Graph Translation; Atomically accurate de novo design of single-domain antibodies | |
| 12/3 | A generative deep learning approach to de novo antibiotic design | | |
| Optional | | | |
| | Base-resolution models of transcription-factor binding reveal soft motif syntax | | |