1.06
/ December 15, 2020; 3 years ago (2020-12-15)
Repository
github.com/IUPAC-InChI/InChI
Operating system
Microsoft Windows and Unix-like
Platform
IA-32 and x86-64
Available in
English
License
IUPAC / InChI Trust Licence
Website
www.inchi-trust.org
The International Chemical Identifier (InChI/ˈɪntʃiː/IN-chee or /ˈɪŋkiː/ING-kee) is a textual identifier for chemical substances, designed to provide a standard way to encode molecular information and to facilitate the search for such information in databases and on the web. Initially developed by the International Union of Pure and Applied Chemistry (IUPAC) and National Institute of Standards and Technology (NIST) from 2000 to 2005, the format and algorithms are non-proprietary. Since May 2009, it has been developed by the InChI Trust, a nonprofit charity from the United Kingdom which works to implement and promote the use of InChI.[3]
The identifiers describe chemical substances in terms of layers of information — the atoms and their bond connectivity, tautomeric information, isotope information, stereochemistry, and electronic charge information.[4]
Not all layers have to be provided; for instance, the tautomer layer can be omitted if that type of information is not relevant to the particular application. The InChI algorithm converts input structural information into a unique InChI identifier in a three-step process: normalization (to remove redundant information), canonicalization (to generate a unique number label for each atom), and serialization (to give a string of characters).
InChIs differ from the widely used CAS registry numbers in three respects: firstly, they are freely usable and non-proprietary; secondly, they can be computed from structural information and do not have to be assigned by some organization; and thirdly, most of the information in an InChI is human readable (with practice). InChIs can thus be seen as akin to a general and extremely formalized version of IUPAC names. They can express more information than the simpler SMILES notation and, in contrast to SMILES strings, every structure has a unique InChI string, which is important in database applications. Information about the 3-dimensional coordinates of atoms is not represented in InChI; for this purpose a format such as PDB can be used.
The InChIKey, sometimes referred to as a hashed InChI, is a fixed length (27 character) condensed digital representation of the InChI that is not human-understandable. The InChIKey specification was released in September 2007 in order to facilitate web searches for chemical compounds, since these were problematic with the full-length InChI.[5] Unlike the InChI, the InChIKey is not unique: though collisions are expected to be extremely rare, there are known collisions.[6]
In January 2009 the 1.02 version of the InChI software was released. This provided a means to generate so called standard InChI, which does not allow for user selectable options in dealing with the stereochemistry and tautomeric layers of the InChI string. The standard InChIKey is then the hashed version of the standard InChI string. The standard InChI will simplify comparison of InChI strings and keys generated by different groups, and subsequently accessed via diverse sources such as databases and web resources.
The continuing development of the standard has been supported since 2010 by the not-for-profit InChI Trust, of which IUPAC is a member. The current software version is 1.06 and was released in December 2020.[7] Prior to 1.04, the software was freely available under the open-source LGPL license,[8]
but it now uses a custom license called IUPAC-InChI Trust License.[9]
^"IUPAC International Chemical Identifier Project Page". IUPAC. Archived from the original on 27 May 2012. Retrieved 2012-12-05.
^Heller, S.; McNaught, A.; Stein, S.; Tchekhovskoi, D.; Pletnev, I. (2013). "InChI - the worldwide chemical structure identifier standard". Journal of Cheminformatics. 5 (1): 7. doi:10.1186/1758-2946-5-7. PMC 3599061. PMID 23343401.
^"The InChI Trust and IUPAC". InChI Trust. Retrieved August 22, 2022.
^Heller, S.R.; McNaught, A.; Pletnev, I.; Stein, S.; Tchekhovskoi, D. (2015). "InChI, the IUPAC International Chemical Identifier". Journal of Cheminformatics. 7: 23. doi:10.1186/s13321-015-0068-4. PMC 4486400. PMID 26136848.
^"The IUPAC International Chemical Identifier (InChI)". IUPAC. 5 September 2007. Archived from the original on October 30, 2007. Retrieved 2007-09-18.
^E.L. Willighagen (17 September 2011). "InChIKey collision: the DIY copy/pastables". Retrieved 2012-11-06.
^Goodman, Jonathan M.; Pletnev, Igor; Thiessen, Paul; Bolton, Evan; Heller, Stephen R. (December 2021). "InChI version 1.06: now more than 99.99% reliable". Journal of Cheminformatics. 13 (1): 40. doi:10.1186/s13321-021-00517-z. PMC 8147039. PMID 34030732.
^McNaught, Alan (2006). "The IUPAC International Chemical Identifier:InChl". Chemistry International. Vol. 28, no. 6. IUPAC. Retrieved 2007-09-18.
^"IUPAC/InChI-Trust Licence for the International Chemical Identifier (InChI) Software" (PDF). IUPAC/InChI-Trust. 2020. Retrieved 2022-08-09.
and 30 Related for: International Chemical Identifier information
The InternationalChemicalIdentifier (InChI /ˈɪntʃiː/ IN-chee or /ˈɪŋkiː/ ING-kee) is a textual identifier for chemical substances, designed to provide...
identification (the process of identifying), or an identifier (that is, an instance of identification). An identifier may be a word, number, letter, symbol...
husband of Rachael Dadd IChI (IUPAC chemicalidentifier), the original name for the InternationalChemicalIdentifier Ichi the Killer (disambiguation) Ichiban...
provides a numerical identifier, known as CAS registry number to each chemical substance that has been reported in the chemical literature (such as chemistry...
compound. This is achieved by the InternationalChemicalIdentifier (InChI) nomenclature. However, the American Chemical Society's CAS numbers nomenclature...
taken from all letters of the alphabet. In 1975, the International CODEN Service located at Chemical Abstracts Service (CAS) became responsible for further...
Measurements (IRMM) InternationalChemicalIdentifier (InChI) International Union of Biochemistry and Molecular Biology (IUBMB) International Union of Pure...
together has been greatly facilitated by the creation of the InternationalChemicalIdentifier (InChI) and associated software. Thus the standard InChI for...
The Publisher Item Identifier (PII) is a unique identifier used by a number of scientific journal publishers to identify documents. It uses the pre-existing...
The International Programme on Chemical Safety (IPCS) was formed in 1980 and is a collaboration between three United Nations bodies, the World Health...
The Unique Ingredient Identifier (UNII) is an alphanumeric identifier linked to a substance's molecular structure or descriptive information and is generated...
process extracts identifiers from the article abstract and puts those in a field called Secondary Identifier (SI). The secondary identifier field is to store...
OELib Chemistry Development Kit Chemical Markup Language Software for molecular modeling NCI/CADD ChemicalIdentifier Resolver Chen, V.B.; et al. (2009)...
for chemical nomenclature. Cell notation for representation of an electrochemical cell Dyson / IUPAC (1944) Hayward (1961) InternationalChemical Identifier...
reactions. The basic particle that constitutes a chemical element is the atom. Chemical elements are identified by the number of protons in the nuclei of their...
like PubChem3D, the Resource Description Framework, and the InternationalChemicalIdentifier. In June 2021 Willighagen announced his intention to step...
It is a physical science within the natural sciences that studies the chemical elements that make up matter and compounds made of atoms, molecules and...
receive different UN numbers. Associated with each UN number is a hazard identifier, which encodes the general hazard class and subdivision (and, in the case...
the sky by high-flying aircraft are actually "chemtrails" consisting of chemical or biological agents, sprayed for nefarious purposes undisclosed to the...
InternationalChemical Safety Cards (ICSC) are data sheets intended to provide essential safety and health information on chemicals in a clear and concise...
Chemistry WebBook NCI/CADD ChemicalIdentifier Resolver ChemSub Online (Multilingual chemical names) NIOSH Pocket Guide to Chemical Hazards, index of CAS numbers...
conventional weapons. The use of chemical weapons in international armed conflicts is prohibited under international humanitarian law by the 1925 Geneva...
The Chemical Weapons Convention (CWC), officially the Convention on the Prohibition of the Development, Production, Stockpiling and Use of Chemical Weapons...
involved. It is thus a vast and highly interdisciplinary field. Chemical ecologists seek to identify the specific molecules (i.e. semiochemicals) that function...
dual classification which use the Colour Index Generic Name (the prime identifier) and Colour Index Constitution Numbers. These numbers are prefixed with...
Chemical Abstracts Service (CAS) is a division of the American Chemical Society. It is a source of chemical information and is located in Columbus, Ohio...
the named object with an extra sequence number to make it into a unique identifier. Systematic names often co-exist with earlier common names assigned before...
liquid product listed in chapter 17 of the International Bulk Chemical Code. As well as industrial chemicals and clean petroleum products, such ships also...