Database of protein families, domains and functional sites
InterPro
Content
Description
InterPro functionally analyzes protein sequences and classifies them into protein families while predicting the presence of domains and functional sites.
Contact
Research center
EMBL
Laboratory
European Bioinformatics Institute
Primary citation
The InterPro protein families and domains database:
20 years on[1]
Release date
1999
Access
Website
www.ebi.ac.uk/interpro/
Download URL
ftp.ebi.ac.uk/pub/databases/interpro/
Miscellaneous
Data release frequency
8-weekly
Version
97.0 (9 November 2023; 7 months ago (2023-11-09))
InterPro is a database of protein families, protein domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences[2] in order to functionally characterise them.[3][4]
The contents of InterPro consist of diagnostic signatures and the proteins that they significantly match. The signatures consist of models (simple types, such as regular expressions or more complex ones, such as Hidden Markov models) which describe protein families, domains or sites. Models are built from the amino acid sequences of known families or domains and they are subsequently used to search unknown sequences (such as those arising from novel genome sequencing) in order to classify them. Each of the member databases of InterPro contributes towards a different niche, from very high-level, structure-based classifications (SUPERFAMILY and CATH-Gene3D) through to quite specific sub-family classifications (PRINTS and PANTHER).
InterPro's intention is to provide a one-stop-shop for protein classification, where all the signatures produced by the different member databases are placed into entries within the InterPro database. Signatures which represent equivalent domains, sites or families are put into the same entry and entries can also be related to one another. Additional information such as a description, consistent names and Gene Ontology (GO) terms are associated with each entry, where possible.
^Blum M, Chang HY, Chuguransky S, Grego T, Kandasaamy S, Mitchell A, et al. (November 2020). "The InterPro protein families and domains database: 20 years on". Nucleic Acids Research. 49 (D1): D344–D354. doi:10.1093/nar/gkaa977. PMC 7778928. PMID 33156333.
^Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, et al. (January 2012). "InterPro in 2011: new developments in the family and domain prediction database". Nucleic Acids Research. 40 (Database issue): D306-12. doi:10.1093/nar/gkr948. PMC 3245097. PMID 22096229.
^Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, et al. (January 2001). "The InterPro database, an integrated documentation resource for protein families, domains and functional sites". Nucleic Acids Research. 29 (1): 37–40. doi:10.1093/nar/29.1.37. PMC 29841. PMID 11125043.
^Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, et al. (December 2000). "InterPro--an integrated documentation resource for protein families, domains and functional sites". Bioinformatics. 16 (12): 1145–50. doi:10.1093/bioinformatics/16.12.1145. PMID 11159333.
from the public domain Pfam and InterPro: IPR018804 This article incorporates text from the public domain Pfam and InterPro: IPR007959 This article incorporates...
Both chains share an all alpha-helical structure. Proteins matching the InterPro family signature for Fel d 1 parts is widespread among Theria, a subclass...
GTP hydrolysis by the α-subunit (InterPro: IPR001019), which can then re-bind the βγ-dimer (InterPro: IPR001632 InterPro: IPR001770) and the receptor. RGS...
hdl:2433/175269. PMID 23370115. Chang HY (31 March 2015). "The Sweetest Thing". InterPro Protein Focus. Higginbotham JD (1986). Gelardi RC, Nabors LO (eds.). Alternative...
doi:10.1016/S0021-9258(19)85334-6. PMID 8349613. "Laminin G domain". InterPro. European Bioinformatics Institute. Retrieved 22 February 2016. Tisi D...
InterPro: IPR002327 Cytochrome c, class IC InterPro: IPR008168 Cytochrome c, class ID InterPro: IPR002324 Cytochrome c, class IE InterPro: IPR002323 The heme group in...
Glucose polymer used as energy store in animals Glucose 6-phosphatase – InterPro FamilyPages displaying wikidata descriptions as a fallback Hexose phosphate...
Annexin, type I InterPro: IPR002388 Annexin, type II InterPro: IPR002389 Annexin, type III InterPro: IPR002390 Annexin, type IV InterPro: IPR002391 Annexin...
Oxidoreductase This article incorporates text from the public domain Pfam and InterPro: IPR015409 Holmes RS, Goldberg E (October 2009). "Computational analyses...
three distinct structural domains: an N-terminal helical bundle domain (InterPro: IPR005639) involved in membrane insertion and pore formation; a beta-sheet...
from the public domain Pfam and InterPro: IPR022406 This article incorporates text from the public domain Pfam and InterPro: IPR022405 This article incorporates...
from the public domain Pfam and InterPro: IPR013096 This article incorporates text from the public domain Pfam and InterPro: IPR006045 This article incorporates...
gangrene This article incorporates text from the public domain Pfam and InterPro: IPR013510 Gerard J. Tortora, Berdell R. Funke, Cristine L. Case (2007)...
zostericola (sipunculid worm) IPR002063 – InterPro entry for hemerythrin This article incorporates text from the public domain Pfam and InterPro: IPR012312...
from the public domain Pfam and InterPro: IPR005479 This article incorporates text from the public domain Pfam and InterPro: IPR005480 This article incorporates...
factor function. By sequence similarity, most sigma factors are σ70-like (InterPro: IPR000943). They have four main regions (domains) that are generally conserved:...
from the public domain Pfam and InterPro: IPR010472 This article incorporates text from the public domain Pfam and InterPro: IPR015425 This article incorporates...
mitochrondrial Twinkle primase/helicase. Some DnaG-like (bacteria-like; InterPro: IPR020607) primases have been found in archaeal genomes. Eukaryote and...
from the public domain Pfam and InterPro: IPR002939 This article incorporates text from the public domain Pfam and InterPro: IPR001623 This article incorporates...