TY - JOUR
T1 - Molecular similarity based estimation of properties
T2 - a comparison of structure spaces and property spaces.
AU - Gute, Brian D
AU - Grunwald, G. D.
AU - Mills, D.
AU - Basak, Subhash C
PY - 2001/2
Y1 - 2001/2
N2 - Molecular similarity methods have emerged as powerful tools in analog selection, chemical classification based on toxic modes of action, and property estimation. The basic assumption of structure-activity relationships (SAR) is that similar structures usually have similar properties. Therefore, similarity methods can be used for the selection of analogs and estimation of properties of chemicals from their structural analogs in property spaces. Each similarity method is user defined. Its efficacy depends on the set of descriptors used to define the intermolecular similarity of chemicals as well as on the mathematical function used to quantify similarity. Also, similarity methods can be based on experimental data or computed molecular descriptors. We have carried out a comparative study of similarity spaces derived from experimental data vis-a-vis computed structural parameters for two sets of chemicals: (a) a diverse set of 76 chemicals derived from the TSCA Inventory and (b) the 166 structurally distinct constituents of JP-8 identified by GC/MS. Property spaces for these two sets of chemicals were created using experimental and calculated physicochemical properties. Atom pairs (APs) and topological indices calculated by POLLY v2.3 were used to create theoretical structure spaces. These spaces were used for the KNN-based estimation of properties with K=1--10, 15, 20, 25. The results will be presented with a comparative analysis of the effectiveness of property spaces and structure spaces in analog selection and property estimation.
AB - Molecular similarity methods have emerged as powerful tools in analog selection, chemical classification based on toxic modes of action, and property estimation. The basic assumption of structure-activity relationships (SAR) is that similar structures usually have similar properties. Therefore, similarity methods can be used for the selection of analogs and estimation of properties of chemicals from their structural analogs in property spaces. Each similarity method is user defined. Its efficacy depends on the set of descriptors used to define the intermolecular similarity of chemicals as well as on the mathematical function used to quantify similarity. Also, similarity methods can be based on experimental data or computed molecular descriptors. We have carried out a comparative study of similarity spaces derived from experimental data vis-a-vis computed structural parameters for two sets of chemicals: (a) a diverse set of 76 chemicals derived from the TSCA Inventory and (b) the 166 structurally distinct constituents of JP-8 identified by GC/MS. Property spaces for these two sets of chemicals were created using experimental and calculated physicochemical properties. Atom pairs (APs) and topological indices calculated by POLLY v2.3 were used to create theoretical structure spaces. These spaces were used for the KNN-based estimation of properties with K=1--10, 15, 20, 25. The results will be presented with a comparative analysis of the effectiveness of property spaces and structure spaces in analog selection and property estimation.
UR - http://www.scopus.com/inward/record.url?scp=0035258059&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0035258059&partnerID=8YFLogxK
U2 - 10.1080/10629360108035359
DO - 10.1080/10629360108035359
M3 - Article
C2 - 11328710
AN - SCOPUS:0035258059
SN - 1062-936X
VL - 11
SP - 363
EP - 382
JO - SAR and QSAR in environmental research
JF - SAR and QSAR in environmental research
IS - 5-6
ER -