GO Subset Guide

GO subsets (also known as GO slims) are cut-down versions of the GO ontologies containing a subset of the terms in GO. They give a broad overview of the ontology content without the detail of the specific fine-grained terms.

How are GO subsets used?

GO subsets are particularly useful for giving a summary of the results of GO annotation of a genome, microarray, or cDNA collection when broad classification of gene product function is required. Some groups annotate to GO subsets that are relevant to their domain of interest, rather than using the full GO.

Who creates and maintains GO subsets?

GO subsets are created by users according to their needs, and may be specific to species or to particular areas of the ontologies. GO provides a generic GO subsets which, like the GO itself, is not species-specific, and which should be suitable for most purposes. Alternatively, users can create their own GO subsets or use one of the model organism-specific subsets integrated into GO. Please email the GO helpdesk for more information about creating and submitting your GO subsets.

GO subsets available

Maintained GO subsets

The GO subsets in this list are maintained as part of the GO flat file. The files available below for download are generated by script from that file.

Maintained GO subsets for download

Organism or Usage OBO format OWL format json format yaml format
GO slim AGR Developed by GO Consortium for the Alliance of Genomes Resources  goslim_agr.obo goslim_agr.owl goslim_agr.json goslim_agr.yaml
Generic GO subset Developed by GO Consortium goslim_generic.obo goslim_generic.owl goslim_generic.json goslim_generic.yaml
Aspergillus subset Developed by Aspergillus Genome Data goslim_aspergillus.obo goslim_aspergillus.owl goslim_aspergillus.json goslim_aspergillus.yaml
Candida albicans Developed by Candida Genome Database goslim_candida.obo goslim_candida.owl goslim_candida.json goslim_candida.yaml
Chembl Drug Target subset developed by Prudence Mutowo and Jane Lomax goslim_chembl.obo goslim_chembl.owl goslim_chembl.json goslim_chembl.yaml
Metagenomics subset Developed by Jane Lomax and the InterPro group goslim_metagenomics.obo goslim_metagenomics.owl goslim_metagenomics.json goslim_metagenomics.yaml
Mouse GO slim Developed by Mouse Genome Informatics goslim_mouse.obo goslim_mouse.owl goslim_mouse.json goslim_mouse.yaml
Plant subset Developed by The Arabidopsis Information Resource  goslim_plant.obo goslim_plant.owl goslim_plant.json goslim_plant.yaml
Protein Information Resource subset Developed by Darren Natale, PIR goslim_pir.obo goslim_pir.owl goslim_pir.json goslim_pir.yaml
Schizosaccharomyces pombe subset Developed by Val Wood, PomBase goslim_pombe.obo goslim_pombe.owl goslim_pombe.json goslim_pombe.yaml
Yeast subset Developed by Saccharomyces Genome Database goslim_yeast.obo goslim_yeast.owl goslim_yeast.json goslim_yeast.yaml

For internal checking purposes we also provide two “anti-slims”

Usage OBO format OWL format json format yaml format
Do not annotate: the set of high level terms that are useful for grouping, but should have no direct annotations goslim_gocheck_do_not_annotate.obo goslim_gocheck_do_not_annotate.owl goslim_gocheck_do_not_annotate.json goslim_gocheck_do_not_annotate.yaml
Do not manually annotate: the set of high level terms that are useful for grouping, but should have no direct annotations except from automated tools goslim_gocheck_do_not_manually_annotate.obo goslim_gocheck_do_not_manually_annotate.owl goslim_gocheck_do_not_manually_annotate.json goslim_gocheck_do_not_manually_annotate.yaml

Map2Slim option in OWLTools

Given a GO subset file, and a current ontology (in one or more files), the Map2Slim script will map a gene association file (containing annotations to the full GO) to the terms in the GO subset. This script is an option of OWLTools, and it can be used to either create a new gene association file, which contains the most pertinent GO slim accessions, or in count-mode, in which case it will give distinct gene product counts for each subset term.

Background information and details on how to download, install, and implement OWLTools, as well as instructions on how to run the Map2Slim script are available from the OWLTools Wiki at https://github.com/owlcollab/owltools/wiki/Map2Slim.

On the web

Similarly, there are a couple of online tools that may be of use. The first is the Princeton slimming tool, the second is the legacy AmiGO slimmer. It should be noted that online tools do often contain limitations and timeouts.


Improve this page