UMAP Visualization of SARS-CoV-2 Data in ChEMBL

Uniform Manifold Approximation and Projection (UMAP) uses the nearest neighbour graph to perform dimesion reduction for visualization. I have been recently using UMAP to visualize the chemical space coverage of compounds. This piece of code computes UMAP for SARS-CoV-2 Database in ChEMBL consisting of 6900 compounds (downloded in November 28, 2020) as an example. Please modify the path used to suit your own directory when using the same csv file, which could be downloaded from https://www.ebi.ac.uk/chembl/g/#browse/compounds/filter/_metadata.compound_records.src_id%3A52.