Slide 6
Slide 6 text
6
The Current State of Evaluation Metrics for Generative Models
Distribution metrics (From MOS )
- FCD (Fréchet ChemNet Distances)
- SNN (Similarity to Nearest Neighbor)
- Scaffold Similarity
- Validity
- Uniqueness
- Filters (% passing MOSES defined smarts filters)
- Novelty
- IntDiv (Can detect mode collapse of generative models)
- Fragment similarity
Oracle based metrics
(From TDC and GuacaMol)
Molecule generation with a desired property
in mind.
- Docking score
- ML based score (DRD2, JNK3, GSK3B)
- Similarity to another molecules
- Rediscovery
- Isomer identification
- Property optimization (LogP, QED, SA)
- Scaffold hops
- …
Generate
Feedback
Oracle
Training
Learned distribution
[1] Polykosvkiy et al. “Molecular Sets (MOSES): A Benchmarking Platform for
Molecular Generation Models”
https://arxiv.org/abs/1811.12823
[2] Huang et al. “Therapeutics Data Commons: Machine Learning Datasets and
Tasks for Drug Discovery and Development”
https://arxiv.org/abs/2102.09548
[3] Brown et al. “GuacaMol: Benchmarking Models for De Novo
Molecular Design”
https://arxiv.org/abs/1811.09621
[1]
[2] [3]