UCSF

Shards Now

Type I binding sites

General Information

Created By
jji at cgl.ucsf.edu
Critieria
p.mwt < 190
Subset ID
27

ZINC Subset Overview

Last Updated
2014-09-24
Subset Size
424,775
Benign functionality only?
No filtering done

Quick Links

Browse
Sample molecules
Detailed view
Annotations view
Files
Properties
Purchasing
Unique Substances
Unix download
MOL2
SDF
Flexibase [Scripts to download database files to Linux/MacOS]
Windows download
MOL2
SDF
Flexibase [Scripts to download database files to Windows]

Chemical Diversity and Clustering

We assess the chemical diversity of a subset by clustering the molecules. First, we sort ligands by increasing molecular weight. Then, we use the SUBSET 1.0 algorithm ( Voigt JH, Bienfait B, Wang S, Nicklaus MC. JCICS, 2001, 41, 702-12) to progressively select compounds that differ from those previously selected by at least the Tanimoto cutoff, using ChemAxon default fingerprints. The resulting representatives have two interesting properties:

  • 1) Each representative differs from all the others by at least the Tanitmoto cutoff and
  • 2) All the molecules in the subset are within the Tanimoto cutoff of at least one representative.
Thus the representatives can be said to "cover" the chemical space of the subset at a given Tanimoto level. N/A indicates that clustering is pending.

Tanimoto Cutoff Level 60% 70% 80% 90% 100%
Number of Representatives 5,172 12,325 0 0 424,775

Physical Property Distributions

We compute the physical properties of each molecule in the subset, and graph them below.   Download Calculated Physical Properties