Skip to contents

STRING cross-reference the proteins with several databases (see "Details" section). By providing your input set o proteins (and optionally background or universe protein set), you can use this function to retrieve full set of terms (annotations) pertinent to your input proteins in each database, among with information for each term.

Usage

rba_string_annotations(
  ids,
  species = NULL,
  allow_pubmed = FALSE,
  split_df = TRUE,
  ...
)

Arguments

ids

Your protein ID(s). It is strongly recommended to supply STRING IDs. See rba_string_map_ids for more information.

species

Numeric: NCBI Taxonomy identifier; Human Taxonomy ID is 9606. (Recommended, but optional if your input is less than 100 IDs.)

allow_pubmed

logical: (default = FALSE) PubMed usually assigns a large number of reference publications to each protein. In order to reduce the output size, PubMed's results will be excluded from the results, unless stated otherwise (By setting this argument to TRUE).

split_df

(logical, default = TRUE), If TRUE, instead of one data frame, results from different categories will be split into multiple data frames based on their 'category'.

...

rbioapi option(s). See rba_options's arguments manual for more information on available options.

Value

A data frame which every row is an assigned terms and the columns are the terms category, description, number of genes, and other pertinent information.

Details

STRING currently maps to and retrieve enrichment results based on Gene Ontology (GO), KEGG pathways, UniProt Keywords, PubMed publications, Pfam domains, InterPro domains, and SMART domains.
Note that this function will return a full list of the terms containing your supplied proteins. To perform enrichment and only retrieve a enriched subset of the terms, use rba_string_enrichment.

Corresponding API Resources

"POST https://string-db.org/api/[output_format]/functional_annotation? identifiers=[your_identifiers]&[optional_parameters]"

References

  • Damian Szklarczyk, Annika L Gable, Katerina C Nastou, David Lyon, Rebecca Kirsch, Sampo Pyysalo, Nadezhda T Doncheva, Marc Legeay, Tao Fang, Peer Bork, Lars J Jensen, Christian von Mering, The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets, Nucleic Acids Research, Volume 49, Issue D1, 8 January 2021, Pages D605–D612, https://doi.org/10.1093/nar/gkaa1074

  • STRING API Documentation

  • Citations note on STRING website

Examples

# \donttest{
rba_string_annotations(ids = "TP53", species = 9606)
# }