Justin S. Smith,
Roman Zubatyuk,
Benjamin Nebgen,
Nicholas Lubbers,
Kipton Barros,
Adrian E. Roitberg,
Olexandr Isayev,
Sergei Tretiak,
Version 0 of Colab Fit Dataset published 2023 via ColabFit
ANI-1x contains DFT calculations for approximately 5 million molecular conformations. From an initial training set, an active learning method was used to iteratively add conformations where insufficient diversity was detected. Additional conformations were sampled from existing databases of molecules, such as GDB-11 and ChEMBL. On each of these configurations, one of molecular dynamics sampling, normal mode sampling, dimer sampling, or torsion sampling was performed.