This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Applicability domain" – news · newspapers · books · scholar · JSTOR(November 2009) (Learn how and when to remove this message)
This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details.(November 2009) (Learn how and when to remove this message)
(Learn how and when to remove this message)
The applicability domain (AD) (for both chemistry and machine learning) of a QSAR model is the physico-chemical, structural or biological space, knowledge or information on which the training set of the model has been developed, and for which it is applicable to make predictions for new compounds.
The purpose of AD is to state whether the model's assumptions are met, and for which chemicals the model can be reliably applicable. In general, this is the case for interpolation rather than for extrapolation. Up to now there is no single generally accepted algorithm for determining the AD: a comprehensive survey can be found in a Report and Recommendations of ECVAM Workshop 52.[1] There exists a rather systematic approach for defining interpolation regions.[2] The process involves the removal of outliers and a probability density distribution method using kernel-weighted sampling.
Another widely used approach for the structural AD of the regression QSAR models is based on the leverage calculated from the diagonal values of the hat matrix of the modeling molecular descriptors.[3][4][5]
A recent rigorous benchmarking study of several AD algorithms identified standard-deviation of model predictions as the most reliable approach.[6]
To investigate the AD of a training set of chemicals one can directly analyse properties of the multivariate descriptor space of the training compounds or more indirectly via distance (or similarity) metrics. When using distance metrics care should be taken to use an orthogonal and significant vector space. This can be achieved by different means of feature selection and successive principal components analysis.
^Netzeva T, Worth A, Aldenberg T, Benigni R, Cronin M, Gramatica P, Jaworska J, Kahn S, Klopman G, Marchant C, Myatt G, Nikolova-Jeliazkova N, Patlewicz G, Perkins R, Roberts D, Schultz T, Stanton D, van de Sandt J, Tong W, Veith G, Yang C: Current status of methods for defining the applicability domain of (Quantitative) Structure–Activity Relationships. Altern Lab Anim 2005, 33: 1-19
^Jaworska J, Nikolova-Jeliazkova N, Aldenberg T: QSAR applicability domain estimation by projection of the training set descriptor space: a review. Altern Lab Anim 2005, 33(5):445-459
^Tropsha A, Gramatica P, Gombar VK, The importance of being Earnest: Validation is the absolute essential for successful application and interpretation of QSPR models. QSAR Comb.Sci. 2003, 22: 69-77
^Gramatica P, Principles of QSAR models validation: internal and external QSAR Comb.Sci. 2007, 26(5): 694-701
^Tetko IV, Sushko I, Pandey AK, Zhu H, Tropsha A, Papa E, Oberg T, Todeschini R, Fourches D, Varnek A. Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection. J Chem Inf Model. 2008 Sep;48(9):1733-46.
and 17 Related for: Applicability domain information
The applicabilitydomain (AD) (for both chemistry and machine learning) of a QSAR model is the physico-chemical, structural or biological space, knowledge...
Applicability may refer to: Direct applicability, concept of European Union constitutional law that regulations require no implementing legislation within...
The public domain (PD) consists of all the creative work to which no exclusive intellectual property rights apply. Those rights may have expired, been...
In the Internet, a domain name is a string that identifies a realm of administrative autonomy, authority or control. Domain names are often used to identify...
in Predictive Modeling. A Transparent and Flexible Alternative to ApplicabilityDomain Determination". Journal of Chemical Information and Modeling. 54...
generally, convolution in one domain (e.g., time domain) equals point-wise multiplication in the other domain (e.g., frequency domain). Other versions of the...
can be included in different domain knowledge. Knowledge which may be applicable across a number of domains is called domain-independent knowledge, for...
unknown or not applicable. The data domain for the marital status column is: "M", "S". In a normalized data model, the reference domain is typically specified...
to works previously in the public domain. Initially, Judge Babcock held that the First Amendment was not applicable to resurrecting foreign copyright...
field constant. However this is not applicable to ferromagnets due to the variation of magnetization from domain to domain. In this case, the interaction field...
in the generation, applicabilitydomain and ring systems are detected based on inverse QSPR/QSAR analysis. The applicabilitydomain, or target area, is...
in the time domain) to a function of a complex variable s {\displaystyle s} (in the complex-valued frequency domain, also known as s-domain, or s-plane)...
database. A relation is in first normal form if and only if no attribute domain has relations as elements. Or more informally, that no table column can...
Domain drop catching, also known as domain sniping, is the practice of registering a domain name once registration has lapsed, immediately after expiry...
disputes. .ZA domain name disputes types: Abusive registration Offensive registration Alternative Dispute Resolution (ADR) only applicable to un-moderated...
Domain engineering, is the entire process of reusing domain knowledge in the production of new software systems. It is a key concept in systematic software...