Expasy

Last updated June 12, 2022

Logo Expasy 2020

Expasy is an online bioinformatics resource operated by the SIB Swiss Institute of Bioinformatics. It is an extensible and integrative portal which provides access to over 160 databases and software tools and supports a range of life science and clinical research areas, from genomics, proteomics and structural biology, to evolution and phylogeny, systems biology and medical chemistry. The individual resources (databases, web-based and downloadable software tools) are hosted in a decentralised way by different groups of the SIB Swiss Institute of Bioinformatics and partner institutions.

Search engine

Queries of Expasy allow:

parallel searches SIB databases through a single search
aggregated search results from the complete set of >160 resources accessible from the portal.^[1]

Expasy provides up-to-date information from the most recent release of each resources.

The terms used in Expasy are based on the EDAM comprehensive ontology.^[2]

History

Expasy was created in August 1993. Originally, it was called ExPASy (Expert Protein Analysis System) and acted as a proteomics server to analyze protein sequences and structures and two-dimensional gel electrophoresis (2-D Page electrophoresis).^[3] Among others, ExPASy hosted the protein sequence knowledgebase, UniProtKB/Swiss-Prot, and its computer annotated supplement, UniProtKB/TrEMBL, before these moved to the UniProt website.^{[ citation needed ]}

ExPASy was the first website of the life sciences and among the first 150 websites in the world. As of 5 April 2007^[update], ExPASy had been consulted 1 billion times since its installation on 1 August 1993.^[4]

In June 2011, it became the SIB ExPASy Bioformatics Resources Portal: a diverse catalogue of bioinformatics resources developed by SIB Groups.^[5] The current version of Expasy was released in October 2020.^[6]^[7]

Notes and references

↑ "SIB Resources". sib.swiss. Retrieved 27 October 2020.
↑ Ison J, Kalas M, Jonassen I, Bolser D, Uludag M, McWilliam H, et al. (May 2013). "EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats". Bioinformatics. 29 (10): 1325–32. doi:10.1093/bioinformatics/btt113. PMC 3654706 . PMID 23479348.
↑ Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A (July 2003). "ExPASy: The proteomics server for in-depth protein knowledge and analysis". Nucleic Acids Research. 31 (13): 3784–8. doi:10.1093/nar/gkg563. PMC 168970 . PMID 12824418.
↑ ExPASy: SIB Bioinformatics Resource Portal
↑ Artimo P, Jonnalagedda M, Arnold K, Baratin D, Csardi G, de Castro E, et al. (July 2012). "ExPASy: SIB bioinformatics resource portal". Nucleic Acids Research. 40 (Web Server issue): W597-603. doi:10.1093/nar/gks400. PMC 3394269 . PMID 22661580.
↑ Duvaud, Séverine; Gabella, Chiara; Lisacek, Frédérique; Stockinger, Heinz; Ioannidis, Vassilios; Durinx, Christine (13 April 2021). "Expasy, the Swiss Bioinformatics Resource Portal, as designed by its users". Nucleic Acids Research. 49 (W1): W216–W227. doi: 10.1093/nar/gkab225 . ISSN 0305-1048. PMC 8265094 . PMID 33849055.
↑ "Discover the new Expasy.org, the Swiss Bioinformatics Resource Portal". sib.swiss. Retrieved 26 October 2020.

External links

Official website

Related Research Articles

Biological databases are libraries of biological sciences, collected from scientific experiments, published literature, high-throughput experiment technology, and computational analysis. They contain information from research areas including genomics, proteomics, metabolomics, microarray gene expression, and phylogenetics. Information contained in biological databases includes gene function, structure, localization, clinical effects of mutations as well as similarities of biological sequences and structures.

The SIB Swiss Institute of Bioinformatics is an academic not-for-profit foundation which federates bioinformatics activities throughout Switzerland.

UniProt Database of protein sequences and functional information

UniProt is a freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the research literature. It is maintained by the UniProt consortium, which consists of several European bioinformatics organisations and a foundation from Washington, DC, United States.

The Protein Information Resource (PIR), located at Georgetown University Medical Center, is an integrated public bioinformatics resource to support genomic and proteomic research, and scientific studies. It contains protein sequences databases

The European Bioinformatics Institute (EMBL-EBI) is an Intergovernmental Organization (IGO) which, as part of the European Molecular Biology Laboratory (EMBL) family, focuses on research and services in bioinformatics. It is located on the Wellcome Genome Campus in Hinxton near Cambridge, and employs over 600 full-time equivalent (FTE) staff. Institute leaders such as Rolf Apweiler, Alex Bateman, Ewan Birney, and Guy Cochrane, an adviser on the National Genomics Data Center Scientific Advisory Board, serve as part of the international research network of the BIG Data Center at the Beijing Institute of Genomics.

Amos Bairoch is a Swiss bioinformatician and Professor of Bioinformatics at the Department of Human Protein Sciences of the University of Geneva where he leads the CALIPHO group at the Swiss Institute of Bioinformatics (SIB) combining bioinformatics, curation, and experimental efforts to functionally characterize human proteins.

InterPro is a database of protein families, protein domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences in order to functionally characterise them.

PROSITE Database of protein domains, families and functional sites

PROSITE is a protein database. It consists of entries describing the protein families, domains and functional sites as well as amino acid patterns and profiles in them. These are manually curated by a team of the Swiss Institute of Bioinformatics and tightly integrated into Swiss-Prot protein annotation. PROSITE was created in 1988 by Amos Bairoch, who directed the group for more than 20 years. Since July 2018, the director of PROSITE and Swiss-Prot is Alan Bridge.

SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, structural, and evolutionary units that form proteins. Domains of common Ancestry are grouped into superfamilies. The domains and domain superfamilies are defined and described in SCOP. Superfamilies are groups of proteins which have structural evidence to support a common evolutionary ancestor but may not have detectable sequence homology.

PDBsum is a database that provides an overview of the contents of each 3D macromolecular structure deposited in the Protein Data Bank. The original version of the database was developed around 1995 by Roman Laskowski and collaborators at University College London. As of 2014, PDBsum is maintained by Laskowski and collaborators in the laboratory of Janet Thornton at the European Bioinformatics Institute (EBI).

neXtProt is an on-line knowledge platform on human proteins. It strives to be a comprehensive resource that provides a variety of types of information on human proteins, such as their function, subcellular location, expression, interactions and role in diseases. The major part of the information in neXtProt is obtained from the UniProt Swiss-Prot database but it is complemented by data originating from high-throughput studies with an emphasis on proteomics. neXtProt offers also an advanced search capacity based on the SPARQL technology as well as an API that allows to programatically extract the data stored in the resource. It is developed by the CALIPHO group directed by Amos Bairoch and Lydie Lane of the Swiss Institute of Bioinformatics (SIB).

Rolf Apweiler is a director of European Bioinformatics Institute (EBI) part of the European Molecular Biology Laboratory (EMBL) with Ewan Birney.

SWISS-MODEL is a structural bioinformatics web-server dedicated to homology modeling of 3D protein structures. Homology modeling is currently the most accurate method to generate reliable three-dimensional protein structure models and is routinely used in many practical applications. Homology modelling methods make use of experimental protein structures ("templates") to build models for evolutionary related proteins ("targets").

In bioinformatics, the PANTHER classification system is a large curated biological database of gene/protein families and their functionally related subfamilies that can be used to classify and identify the function of gene products. PANTHER is part of the Gene Ontology Reference Genome Project designed to classify proteins and their genes for high-throughput analysis.

Cathy H. Wu is the Edward G. Jefferson Chair and professor and director of the Center for Bioinformatics & Computational Biology (CBCB) at the University of Delaware. She is also the director of the Protein Information Resource (PIR) and the North east Bioinformatics Collaborative Steering Committee, and the adjunct professor at the Georgetown University Medical Center.

Chromosome 9 open reading frame 152 is a protein that in humans is encoded by the C9orf152 gene. The exact function of the protein is not completely understood.

Serum amyloid A-like 1 is a protein in humans encoded by the SAAL1 gene.

Biocuration is the field of life sciences dedicated to organizing biomedical data, information and knowledge into structured formats, such as spreadsheets, tables and knowledge graphs. The biocuration of biomedical knowledge is made possible by the cooperative work of biocurators, software developers and bioinformaticians and is at the base of the work of biological databases.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "SIB Resources". sib.swiss. Retrieved 27 October 2020.

[2] Ison J, Kalas M, Jonassen I, Bolser D, Uludag M, McWilliam H, et al. (May 2013). "EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats". Bioinformatics. 29 (10): 1325–32. doi:10.1093/bioinformatics/btt113. PMC 3654706 . PMID 23479348.

[pmid12824418-3] Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A (July 2003). "ExPASy: The proteomics server for in-depth protein knowledge and analysis". Nucleic Acids Research. 31 (13): 3784–8. doi:10.1093/nar/gkg563. PMC 168970 . PMID 12824418.

[4] ExPASy: SIB Bioinformatics Resource Portal

[5] Artimo P, Jonnalagedda M, Arnold K, Baratin D, Csardi G, de Castro E, et al. (July 2012). "ExPASy: SIB bioinformatics resource portal". Nucleic Acids Research. 40 (Web Server issue): W597-603. doi:10.1093/nar/gks400. PMC 3394269 . PMID 22661580.

[6] Duvaud, Séverine; Gabella, Chiara; Lisacek, Frédérique; Stockinger, Heinz; Ioannidis, Vassilios; Durinx, Christine (13 April 2021). "Expasy, the Swiss Bioinformatics Resource Portal, as designed by its users". Nucleic Acids Research. 49 (W1): W216–W227. doi: 10.1093/nar/gkab225 . ISSN 0305-1048. PMC 8265094 . PMID 33849055.

[7] "Discover the new Expasy.org, the Swiss Bioinformatics Resource Portal". sib.swiss. Retrieved 26 October 2020.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

v t e Bioinformatics
Databases	Sequence databases: GenBank, European Nucleotide Archive and DNA Data Bank of Japan Secondary databases: UniProt, database of protein sequences grouping together Swiss-Prot, TrEMBL and Protein Information Resource Other databases: Protein Data Bank, Ensembl and InterPro Specialised genomic databases: BOLD, Saccharomyces Genome Database, FlyBase, VectorBase, WormBase, Rat Genome Database, PHI-base, Arabidopsis Information Resource and Zebrafish Information Network
Software	BLAST Bowtie Clustal EMBOSS HMMER MUSCLE SAMtools SOAP suite TopHat
Other	Server: ExPASy Ontology: Gene Ontology Rosalind (education platform)
Institutions	Broad Institute China National GeneBank (CNGB) Computational Biology Department (CBD) Microsoft Research - University of Trento Centre for Computational and Systems Biology (COSBI) Database Center for Life Science (DBCLS) DNA Data Bank of Japan (DDBJ) European Bioinformatics Institute (EMBL-EBI) European Molecular Biology Laboratory (EMBL) Flatiron Institute J. Craig Venter Institute (JCVI) Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG) US National Center for Biotechnology Information (NCBI) Japanese Institute of Genetics Netherlands Bioinformatics Centre (NBIC) Philippine Genome Center (PGC) Scripps Research Swiss Institute of Bioinformatics (SIB) Wellcome Sanger Institute Whitehead Institute
Organizations	African Society for Bioinformatics and Computational Biology (ASBCB) Australia Bioinformatics Resource (EMBL-AR) European Molecular Biology network (EMBnet) International Nucleotide Sequence Database Collaboration (INSDC) International Society for Biocuration (ISB) International Society for Computational Biology (ISCB) Student Council (ISCB-SC) Institute of Genomics and Integrative Biology (CSIR-IGIB) Japanese Society for Bioinformatics (JSBi)
Meetings	Basel Computational Biology Conference‎ ([BC²]) European Conference on Computational Biology (ECCB) Intelligent Systems for Molecular Biology (ISMB) International Conference on Bioinformatics (InCoB) International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB) ISCB Africa ASBCB Conference on Bioinformatics Pacific Symposium on Biocomputing (PSB) Research in Computational Molecular Biology (RECOMB)
File formats	CRAM format FASTA format FASTQ format NeXML format Nexus format Pileup format SAM format Stockholm format VCF format
Related topics	Computational biology List of biobanks List of biological databases Molecular phylogenetics Sequencing Sequence database Sequence alignment
Category Commons