637 lines
37 KiB
HTML
637 lines
37 KiB
HTML
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
|
|
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
|
|
<head>
|
|
<title>Querying GEO DataSets and GEO Profiles - GEO - NCBI</title>
|
|
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
|
|
<meta name="author" content="geo" />
|
|
<meta name="keywords" content="NCBI, national institutes of health, nih, database, archive, central, bioinformatics, biomedicine, geo, gene, expression, omnibus, chips, microarrays, oligonucleotide, array, sage, CGH" />
|
|
<meta name="description" content="Gene Expression Omnibus (GEO) is a database repository of high throughput gene expression data and hybridization arrays, chips, microarrays." />
|
|
<meta name="ncbiaccordion" content="collapsible: true, active: false" />
|
|
<meta name="ncbi_app" content="geo" />
|
|
<meta name="ncbi_pdid" content="documentation" />
|
|
<meta name="ncbi_page" content="Querying GEO DataSets and GEO Profiles" />
|
|
<link rel="shortcut icon" href="/geo/img/OmixIconBare.ico" />
|
|
<link rel="stylesheet" type="text/css" href="/geo/css/reset.css" />
|
|
<link rel="stylesheet" type="text/css" href="/geo/css/nav.css" />
|
|
<link rel="stylesheet" type="text/css" href="/geo/css/info.css" />
|
|
<script type="text/javascript" src="/core/jig/1.15.10/js/jig.min.js"></script>
|
|
<script type="text/javascript" src="/geo/js/dd_menu.js"></script>
|
|
<script type="text/javascript" src="/geo/js/info.js"></script>
|
|
<script type="text/javascript">
|
|
jQuery.getScript("/core/alerts/alerts.js", function () {
|
|
galert(['#crumbs_login_bar', 'body > *:nth-child(1)'])
|
|
});
|
|
</script>
|
|
<script type="text/javascript">
|
|
var ncbi_startTime = new Date();
|
|
</script>
|
|
</head>
|
|
<body id="info" class="qqtutorial">
|
|
<div id="all">
|
|
<div id="page">
|
|
<div id="header">
|
|
<div id="ncbi_logo">
|
|
<a href="/">
|
|
<img src="/geo/img/ncbi_logo.gif" alt="NCBI Logo" />
|
|
</a>
|
|
</div>
|
|
<div id="geo_logo">
|
|
<a href="/geo/"><img src="/geo/img/geo_main.gif" alt="GEO Logo" /></a>
|
|
</div>
|
|
</div>
|
|
<div id="nav_bar">
|
|
<ul id="geo_nav_bar">
|
|
<li><a href="#">GEO Publications</a>
|
|
<ul class="sublist">
|
|
<li><a href="/geo/info/GEOHandoutFinal.pdf">Handout</a></li>
|
|
<li><a href="/pmc/articles/PMC10767856/">NAR 2024 (latest)</a></li>
|
|
<li><a href="/pmc/articles/PMC99122/">NAR 2002 (original)</a></li>
|
|
<li><a href="/pmc/?term=10767856,4944384,3531084,3341798,3013736,2686538,2270403,1669752,1619900,1619899,539976,99122">All publications</a></li>
|
|
</ul>
|
|
</li>
|
|
<li><a href="/geo/info/faq.html">FAQ</a></li>
|
|
<li><a href="/geo/info/MIAME.html" title="Minimum Information About a Microarray Experiment">MIAME</a></li>
|
|
<li><a href="mailto:geo@ncbi.nlm.nih.gov">Email GEO</a></li>
|
|
</ul>
|
|
</div>
|
|
<div id="crumbs_login_bar"><a title="NCBI home page" href="/">NCBI</a> »
|
|
<a id="curr_page" title="GEO home page" href="/geo/">GEO</a> »
|
|
<a title="GEO documentation guide" href="/geo/info/">Info</a> »
|
|
<span>Querying GEO DataSets and GEO Profiles</span><span id="login_status"><a href="/geo/submitter/" title="Click here to login. You need to do this only if you want to edit the contact information, submit data, see your unreleased data, or work with data already submitted by you. You do not need to login if you are here just to browse through public holdings">Login</a></span></div>
|
|
<div id="content">
|
|
|
|
<a name="top" id="top"></a>
|
|
<h1>Querying GEO DataSets and GEO Profiles</h1>
|
|
|
|
<ul class="page_menu">
|
|
<li><a href="#Qex">Quick examples</a></li>
|
|
<li><a href="#conq">How to construct queries</a></li>
|
|
<li><a href="#fields">Tables of query fields and examples</a></li>
|
|
</ul>
|
|
|
|
<div class="tabs" id="qex">
|
|
<a name="Qex" id="Qex"></a>
|
|
<h2>Quick examples</h2>
|
|
|
|
<div class="jig-ncbitabs">
|
|
|
|
<ul>
|
|
<li><a href="#datasets" id="datasets_tab">GEO DataSets</a></li>
|
|
<li><a href="#profiles" id="profiles_tab">GEO Profiles</a></li>
|
|
</ul>
|
|
|
|
<div id="datasets">
|
|
<div class="search-examples">
|
|
|
|
<p class="intro">
|
|
This database stores original submitter-supplied study descriptions, as well as curated gene expression DataSets.
|
|
DataSets form the basis of GEO's advanced data display and analysis tools, including gene expression profile charts and clusters.
|
|
</p>
|
|
|
|
<h3>Search Examples:</h3>
|
|
|
|
<table>
|
|
<thead>
|
|
<tr>
|
|
<th>Search by...</th>
|
|
<th>Search text</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody>
|
|
<tr>
|
|
<td>Free text</td>
|
|
<td><a href="/gds?term=smoking+cancer">smoking cancer</a></td>
|
|
</tr>
|
|
<tr>
|
|
<td>Keywords and species</td>
|
|
<td>
|
|
<a href="/gds?term=(smok*+OR+diet)+AND+(mammals[organism]+NOT+human[organism])">
|
|
(smok* OR diet) AND (mammals[organism] NOT human[organism])
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Studies in the
|
|
<a href="/geo/roadmap/epigenomics/">
|
|
NIH Roadmap Epigenomics project
|
|
</a>
|
|
</td>
|
|
<td>
|
|
<a href="/gds?term="roadmap epigenomics"[Project]">
|
|
"roadmap epigenomics"[Project]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Study type</td>
|
|
<td>
|
|
<a href="/gds?term=expression+profiling+by+high+throughput+sequencing[DataSet+Type]">
|
|
"expression profiling by high throughput sequencing"[DataSet Type]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Studies with between 100 and 500 samples</td>
|
|
<td>
|
|
<a href="/gds?term=100:500[Number+of+Samples]">
|
|
100:500[Number of Samples]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Studies with CEL files</td>
|
|
<td>
|
|
<a href="/gds?term=cel[Supplementary+Files]">
|
|
"cel"[Supplementary Files]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>DataSets that have 'age' as an experimental variable</td>
|
|
<td>
|
|
<a href="/gds?term=age[Subset+Variable+Type]">
|
|
"age"[Subset Variable Type]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Author</td>
|
|
<td>
|
|
<a href="/gds?term=smith+a[Author]">
|
|
smith a[Author]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Published between January and June 2007</td>
|
|
<td>
|
|
<a href="/gds?term=2007/01:2007/06[Publication+Date]">
|
|
2007/01:2007/06[Publication Date]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Platform accession</td>
|
|
<td>
|
|
<a href="/gds?term=GPL570">
|
|
GPL570
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Studies with PubMed identifiers</td>
|
|
<td>
|
|
<a href="/gds?term=gds+pubmed[Filter]">
|
|
"gds pubmed"[Filter]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
|
|
<!--p>
|
|
There is a tool under the Preview/Index tab to help construct fielded queries.
|
|
</p-->
|
|
</div>
|
|
|
|
</div>
|
|
<div id="profiles">
|
|
<div class="search-examples">
|
|
<p class="intro">
|
|
This database stores individual gene expression profiles from curated DataSets.
|
|
Search for profiles of interest based on gene annotation or pre-computed profile characteristics.
|
|
</p>
|
|
|
|
<h3>Search Examples:</h3>
|
|
|
|
<table>
|
|
<thead>
|
|
<tr>
|
|
<th>Search by...</th>
|
|
<th>Search text</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody>
|
|
<tr>
|
|
<td>Free text</td>
|
|
<td><a href="/geoprofiles?term=smoking+P450">smoking P450</a></td>
|
|
</tr>
|
|
<tr>
|
|
<td>Gene symbol</td>
|
|
<td><a href="/geoprofiles?term=CYP1A1[Gene+Symbol]">CYP1A1[Gene Symbol]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<td>Gene symbols in DataSets that contain specific keywords</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=(CYP1A1[Gene+Symbol]+OR+ME1[Gene+Symbol])+AND+(smok*+OR+diet)">
|
|
(CYP1A1[Gene Symbol] OR ME1[Gene Symbol]) AND (smok* OR diet)
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Partial gene name in a specific DataSet</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=kinase[Gene+Description]+AND+GDS182">
|
|
kinase[Gene Description] AND GDS182
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>GenBank accession</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=NM_014033">
|
|
NM_014033
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Gene Ontology(GO) term in a specific DataSet</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=apoptosis[Gene+Ontology]+AND+GDS182">
|
|
apoptosis[Gene Ontology] AND GDS182
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Chromosome region and species</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=(8[Chromosome]+AND+10000:3000000[Base+Position])+AND+mouse[organism]">
|
|
(8[Chromosome] AND 10000:3000000[Base Position]) AND mouse[organism]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Genes that show subset effects in DataSets that examine the effect of an agent</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=agent[Flag+Information]+AND+"value+subset+effect"[Flag+Type]">
|
|
agent[Flag Information] AND "value subset effect"[Flag Type]
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Platform accession</td>
|
|
<td>
|
|
<a href="/geoprofiles?term=GPL570">
|
|
GPL570
|
|
</a>
|
|
</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
|
|
</div>
|
|
|
|
</div>
|
|
</div>
|
|
|
|
<a name="conq" id="conq"></a>
|
|
|
|
<h2 class="conq">How to construct queries <a title="Back to top" class="arrow" href="#top" style="margin-top: -4px;"></a></h2>
|
|
|
|
<p class="conq">
|
|
<a href="/gds/">GEO DataSets</a> and
|
|
<a href="/geoprofiles/">GEO Profiles</a>
|
|
are part of NCBI's
|
|
<a href="/gquery/">network of Entrez databases</a>.
|
|
As with these other databases, data of interest may be located simply by entering keywords into the
|
|
<a href="/gds/">GEO DataSets</a>
|
|
or <a href="/geoprofiles/">GEO Profiles</a> search boxes.
|
|
The Advanced Search and Limits pages, linked at the head of the GEO DataSets and GEO Profiles pages,
|
|
assist greatly in the construction of complex queries.
|
|
To construct a complex query, specify the search terms, their fields, and the Boolean operations
|
|
to perform on the terms using the following syntax:
|
|
|
|
<code>term [field] OPERATOR term [field]</code>
|
|
|
|
where <span class="code">term</span> is the search terms, <span class="code">field</span> is the search field, and <span class="code">OPERATOR</span>
|
|
is the Boolean operator ('AND', 'OR', 'NOT' must be capitalized). <br />
|
|
Additional query construction notes and features are provided in the following table:
|
|
</p>
|
|
|
|
<div class="search-examples">
|
|
<table>
|
|
<thead>
|
|
<tr>
|
|
<th>Notes and features</th><th>Example</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody>
|
|
<tr>
|
|
<td>Complete listings and descriptions of all supported fields are provided in the <a href="#gdsfields">tables below</a>.</td>
|
|
<td>a search example for each field is provided within the tables</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Fields may be specified either by their full name or an alias. Full names and aliases are listed in the <a href="#gdsfields">tables below</a>.</td><td><a href="/gds?term=gds[Entry+Type]">gds[Entry Type]</a> and <a href="/gds?term=gds[ETYP]">gds[ETYP]</a> perform the same search </td>
|
|
</tr>
|
|
<tr>
|
|
<td>
|
|
Some fields have a fixed list of allowed search terms, others are free text. The <a href="#gdsfields">tables below</a> indicate which fields have fixed lists.
|
|
Lists of allowed terms may be browsed on the Advanced Search page by selecting the relevant field from the drop-down menu and clicking 'Show Index'.
|
|
</td>
|
|
<td>
|
|
'age' is a fixed term for the Subset Variable Type field<br /> <a href="/gds?term=age[Subset+Variable+Type]">age[Subset Variable Type]</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Use quotes to indicate a phrase.</td><td><a href="/gds?term=salt+stress">salt stress</a> <br />retrieves studies that mention both salt and stress anywhere in the description, whereas <br /><a href="/gds?term="salt+stress"">"salt stress"</a> <br />retrieves studies where the words exist as a phrase </td>
|
|
</tr>
|
|
<tr>
|
|
<td>Use parentheses to properly combine multiple search criteria. The terms inside the parentheses are processed as a unit and then incorporated into the overall search.</td><td> <a href="/gds?term=human[organism]+AND+(smok*+OR+diet)">human[organism] AND (smok* OR diet)</a>
|
|
<br />specifically retrieves human studies that mention either smoking or diet, whereas<br />
|
|
<a href="/gds?term=human[organism]+AND+smok*+OR+diet">human[organism] AND smok* OR diet</a><br />
|
|
also returns all studies that mention diet, regardless of organism</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Use an asterisk to expand your search with a wildcard. Wildcards can be placed at the beginning or end of a text string, but not in the middle.</td><td> <a href="/gds?term=smok*">smok*</a> will retrieve documents that contain words like smoke, smoking or smoker </td>
|
|
</tr>
|
|
<tr>
|
|
<td>Use a colon to indicate a range. </td><td><a href="/gds?term=2007/01:2007/06[Publication+Date]">2007/01:2007/06[Publication Date]</a><br /> retrieves studies published between January and June 2007</td>
|
|
</tr>
|
|
<tr>
|
|
<td>Use the 'History' section at the foot of the Advanced Search pages to combine previous queries or find the intersection of multiple queries. Each query you have performed recently is assigned a specific number which can be included within the search statement.</td><td> #1 NOT #2<br /> (#1 OR #2) AND human[organism]</td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
|
|
<a name="fields" id="fields"></a>
|
|
<a name="gdsfields" id="gdsfields"></a>
|
|
<a name="geopfields" id="geopfields"></a>
|
|
<a title="Back to top" class="arrow" href="#top"></a>
|
|
<h2>Query fields and examples</h2>
|
|
|
|
<div class="tabs">
|
|
<div class="jig-ncbitabs">
|
|
|
|
<ul>
|
|
<li><a href="#datasets-table" id="datasets_tab-table">GEO DataSets</a></li>
|
|
<li><a href="#profiles-table" id="profiles_tab-table">GEO Profiles</a></li>
|
|
</ul>
|
|
|
|
<div id="datasets-table">
|
|
<table class="query_fields">
|
|
<thead>
|
|
<tr>
|
|
<th>Field full name</th><th>Field aliases</th><th>Description</th><th>Search term values and rules</th><th class="example">Example</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody>
|
|
<tr>
|
|
<th>All Fields</th>
|
|
<td>ALL, *</td><td>All terms from all searchable fields. Default field.</td><td>free text, wildcard (*) supported </td><td>Find any record that contains the word 'cancer'<br /><a href="/gds?term=cancer[All+fields]">cancer[All fields]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Author</th>
|
|
<td>AUTH, AU, AUTHOR NAME</td><td>Contributors or authors associated with the study</td><td>free text, wildcard (*) supported, author initials are optional</td><td>Find records authored by A Smith<br /><a href="/gds?term=smith+a[Author]">smith a[Author]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>DataSet Type</th>
|
|
<td>GTYP, gdsType</td><td>DataSet or Series type</td>
|
|
<td>
|
|
fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find all studies that examine gene expression by high throughput sequencing<br /><a href="/gds?term=expression+profiling+by+high+throughput+sequencing[DataSet Type]">expression profiling by high throughput sequencing[DataSet Type]</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<th>Description</th>
|
|
<td>DESC, DSC, DESCR</td><td>Text provided in the DataSet, Series or Sample description, summary and other metadata fields</td><td>free text, wildcard (*) supported </td><td>Find studies that contain smoking-related terms in their descriptions<br /><a href="/gds?term=smok*[DESC]">smok*[DESC]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Entry Type</th>
|
|
<td>ETYP, entryType</td><td>Record type</td><td>fixed list, use gds (DataSet), gse (Series) or gpl (Platform)</td><td>Find only DataSet records<br /><a href="/gds?term=gds[Entry+Type]">gds[Entry Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Filter</th>
|
|
<td>FILT, FLTR, SUBSET, SB, FIL</td><td>Filters for records that have links to other NCBI databases</td>
|
|
<td>
|
|
fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find records that have PubMed links<br /><a href="/gds?term=gds+pubmed[Filter]">gds pubmed[Filter]</a>
|
|
</td>
|
|
</tr>
|
|
<tr>
|
|
<th>GEO Accession</th>
|
|
<td>ACCN, accession</td><td>GEO accession number</td><td>valid DataSet (GDS), Platform (GPL), Sample (GSM) or Series (GSE) accession</td><td>Find all studies performed on Platform GPL570<br /><a href="/gds?term=GPL570[GEO+Accession]">GPL570[GEO Accession]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>MeSH Terms</th>
|
|
<td>MESH, MH, SUBH, SH, Subheading</td><td>Medical Subject Headings (MeSH) terms</td><td><a href="https://www.nlm.nih.gov/mesh/meshhome.html">Medical Subject Headings</a> (MeSH) terms, wildcard (*) supported </td><td>Find records that have MeSH term methylation<br /><a href="/gds?term=methylation[MeSH+Terms]">methylation[MeSH Terms]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Number of Platform Probes</th>
|
|
<td>NPRO, n_probes</td><td>Number of Platform probe IDs</td><td>integer, range function supported</td><td>Find Platforms that have over 1 million probes<br /><a href="/gds?term=1000000:100000000[Number+of+Platform+Probes]">1000000:100000000[Number of Platform Probes]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Number of Samples</th>
|
|
<td>NSAM, n_samples</td><td>Number of Samples in the DataSet or Series</td><td>integer, range function supported</td><td>Find studies with between 100 and 500 samples<br /><a href="/gds?term=100:500[Number+of+Samples]">100:500[Number of Samples]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Organism</th>
|
|
<td>ORGN, PORGN, primary organism</td><td>Name of the organism</td><td><a href="/Taxonomy/">NCBI taxonomy</a> terms, wildcard (*) supported, all levels in the taxonomy lineage and common names are indexed</td><td>Find studies performed on mouse<br /><a href="/gds?term=Mus+musculus[Organism]">Mus musculus[Organism]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Platform Technology Type</th>
|
|
<td>PTYP, ptechType</td><td>Platform type</td><td>fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find all studies performed with next-generation sequencing technology<br /><a href="/gds?term=high+throughput+sequencing[Platform+Technology+Type]">high throughput sequencing[Platform Technology Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Project</th>
|
|
<td>PROJ</td><td>Featured project data</td><td>fixed list, use roadmap epigenomics, encode, pilot encode, or modencode</td><td>Find studies in the NIH Roadmap Epigenomics project <br /><a href="/gds?term=roadmap+epigenomics[Project]">roadmap epigenomics[Project]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Publication Date</th>
|
|
<td>PDAT, DP</td><td>Date on which record was released</td><td>format YYYY/MM, range function supported</td><td>Find studies published between January and June 2007<br /> <a href="/gds?term=2007/01:2007/06[Publication+Date]">2007/01:2007/06[Publication Date]</a> </td>
|
|
</tr>
|
|
<tr>
|
|
<th>Related Platform</th>
|
|
<td>RGPL, relatedGPL</td><td>Retrieves the Plaform(s) for a specified DataSet or Series</td><td>valid DataSet (GDS) or Series (GSE) accession</td><td>Find Platforms related to GSE22474<br /><a href="/gds?term=GSE22474[Related+Platform]">GSE22474[Related Platform]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Related Series</th>
|
|
<td>RGSE, relatedGSE</td><td>Retrieves the Series for a specified DataSet or Platform</td><td>valid DataSet (GDS) or Platform (GPL) accession</td><td>Find Series related to GPL570<br /><a href="/gds?term=GPL570[Related+Series]">GPL570[Related Series]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Reporter Identifier</th>
|
|
<td>GEID, seqacc, clone, orf, unigene, Gene Identifier</td><td>Name or identifier of Platform probe; pertains only to Platforms that have been subjected to re-annotation pipeline</td><td>free text, wildcard (*) supported </td><td>Find DataSets that include a probe corresponding to Arg1<br /><a href="/gds?term=Arg1[Reporter+Identifier]">Arg1[Reporter Identifier]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Sample Source</th>
|
|
<td>SRC, source</td><td>The source of the biological material of the Sample; warning: submitter-supplied field, not curated</td><td>free text, wildcard (*) supported </td><td>Find studies with samples from brain<br /><a href="/gds?term=brain[Sample+Source]">brain[Sample Source]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Sample Type</th>
|
|
<td>STYP, sampType</td><td>Sample type or molecule</td><td>fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find studies that use protein samples<br /><a href="/gds?term=protein[Sample+Type]">protein[Sample Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Sample Value Type</th>
|
|
<td>VTYP, valType</td><td>Sample value type; pertains only to curated DataSets</td><td>fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find DataSets with log ratio sample values<br /><a href="/gds?term=log+ratio[Sample+Value+Type]">log ratio[Sample Value Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Submitter Institute</th>
|
|
<td>INST, institute</td><td>Institute or organization as given in submitter account</td><td>free text</td><td>Find data submitted by the Broad Institute<br /><a href="/gds?term=Broad+Institute[Institute]">Broad Institute[Institute]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Subset Description</th>
|
|
<td>SSDE, SSDESC</td><td>DataSet subset descriptions</td><td>free text, wildcard (*) supported </td><td>Find DataSets that include the term 'male' in subset description<br /><a href="/gds?term=male[Subset+Description]">male[Subset Description]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Subset Variable Type</th>
|
|
<td>SSTP, SSTYPE</td><td>Name of DataSet experimental variable</td><td>fixed list, check <a href="/gds/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find DataSets that have 'age' as an experimental variable<br /> <a href="/gds?term=age[Subset+Variable+Type]">age[Subset Variable Type]</a> </td>
|
|
</tr>
|
|
<tr>
|
|
<th>Supplementary Files</th>
|
|
<td>SFIL, SFILE, suppFile</td><td>Supplementary file type names</td><td>free text, wildcard (*) supported </td><td>Find studies that have Affymetrix CEL files<br /><a href="/gds?term=cel[Supplementary+Files]">cel[Supplementary Files]</a> </td>
|
|
</tr>
|
|
<tr>
|
|
<th>Tag Length</th>
|
|
<td>TAGL, taglength</td><td>SAGE or MPSS tag length in base pairs</td><td>integer</td><td>Find 10 base pair SAGE data<br /><a href="/gds?term=10[Tag+Length]">10[Tag Length]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Title</th>
|
|
<td>TITL, TITLE, TI</td><td>Text from titles of DataSets, Series, Platforms, and Samples</td><td>free text, wildcard (*) supported </td><td>Find records where 'Affymetrix' appears in a title<br /><a href="/gds?term=Affymetrix[Title]">Affymetrix[Title]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Update Date</th>
|
|
<td>UDAT</td><td>Date on which record was last updated</td><td>format YYYY/MM, range function supported</td><td>Find records updated during June 2010<br /><a href="/gds?term=2010/06[Update+Date]">2010/06[Update Date]</a></td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
|
|
<div id="profiles-table">
|
|
<table class="query_fields">
|
|
<thead>
|
|
<tr>
|
|
<th>Field full name</th><th>Field aliases</th><th>Description</th><th>Search term values and rules</th><th class="example">Example</th>
|
|
</tr>
|
|
</thead>
|
|
<tbody>
|
|
<tr>
|
|
<th>All Fields</th>
|
|
<td>ALL, *</td><td>All terms from all searchable fields. Default field.</td><td>free text, wildcard (*) supported </td><td>Find P450 genes in DataSets that investigate smoking <br /><a href="/geoprofiles?term=smok*+AND+P450">smok* AND P450</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Annotation Type</th>
|
|
<td>ATYP, annot_type</td><td>Source of annotation</td><td>fixed list, use gene, nucleotide, unigene or protein</td><td>Find profiles with Gene-based annotation<br /><a href="/geoprofiles?term=gene[Annotation+Type]">gene[Annotation Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Base Position</th>
|
|
<td>CPOS, CPOSITION, CHRPOS</td><td>Base pair position on chromosome</td><td>integer, range function supported, must be used in conjuction with Chromosome field</td><td>Find profiles that lie between base positions 10000 to 3000000 on chromosome 8 in mouse<br /><a href="/geoprofiles?term=(8[Chromosome]+AND+10000:3000000[Base+Position])+AND+mouse[organism]">(8[Chromosome] AND 10000:3000000[Base Position]) AND mouse[organism]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Chromosome</th>
|
|
<td>CHR, CHROMOSOME, CH, CHROM</td><td>Chromosome number or name</td><td>chromosome number or name</td><td>Find profiles that lie between base positions 10000 to 3000000 on chromosome 8 in mouse<br /><a href="/geoprofiles?term=(8[Chromosome]+AND+10000:3000000[Base+Position])+AND+mouse[organism]">(8[Chromosome] AND 10000:3000000[Base Position]) AND mouse[organism]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>DataSet Type</th>
|
|
<td>GTYP, gdsType</td><td>DataSet type</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find MPSS profiles<br /><a href="/geoprofiles?term=expression+profiling+by+mpss[DataSet+Type]">expression profiling by mpss[DataSet Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Filter</th>
|
|
<td>FILT, FLTR, SUBSET, SB, FIL</td><td>Filters for records that have links to other NCBI databases</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> Preview/Index page for list of indexed terms</td><td>Find profiles that have links to NCBI's Gene database<br /><a href="/geoprofiles?term=geo+gene[Filter]">geo gene[Filter]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Flag Information</th>
|
|
<td>FINF, FLAG_INFO, NOTE</td><td>Profiles of specific <i>subset types</i> and for which a <i>subset effect</i> is found. GEO DataSets are partitioned into subsets that reflect
|
|
experimental design. Profiles are flagged as having subset effects if they display differential expression across
|
|
experimental variables. CAUTION: The subset effect scoring method is ad hoc, taking into
|
|
account group medians, means, deviation inside the groups, penalties and
|
|
arbitrary cutoff thresholds. This flag is simply an attempt to give
|
|
potentially differentially-regulated genes higher visibility, and is not
|
|
intended to provide an absolute determination of significance.</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find profiles that exhibit subset effects with respect to age or development stage<br /><a href="/geoprofiles?term=age[Flag+Information]+OR+development+stage[Flag+Information]">age[Flag Information] OR development stage[Flag Information]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Flag Type</th>
|
|
<td>FTYP, FLAG_TYPE</td><td>Profiles that exhibit specific types of <i>subset effects</i>.
|
|
GEO DataSets are partitioned into subsets that reflect
|
|
experimental design. Profiles are flagged as having subset effects if they display differential expression across
|
|
experimental variables. CAUTION: The subset effect scoring method is ad hoc, taking into
|
|
account group medians, means, deviation inside the groups, penalties and
|
|
arbitrary cutoff thresholds. This flag is simply an attempt to give
|
|
potentially differentially-regulated genes higher visibility, and is not
|
|
intended to provide an absolute determination of significance.</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> for list of indexed terms</td><td>Find profiles that exhibit rank subset effects <br /><a href="/geoprofiles?term=rank+subset+effect[Flag+Type]">rank subset effect[Flag Type]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>GDS Text</th>
|
|
<td>GDST, GDStxt</td><td>Text from DataSet title and summary</td><td>free text, wildcard (*) supported </td><td>Find profiles for Datasets that investigate muscular dystrophy<br /><a href="/geoprofiles?term=muscular+dystrophy[GDS+Text]">muscular dystrophy[GDS Text]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>GEO Accession</th>
|
|
<td>ACCN, accession</td><td>GEO accession number</td><td>valid DataSet (GDS), Platform (GPL), Sample (GSM) or Series (GSE) accession</td><td>Find profiles for Platform GPL570<br /><a href="/geoprofiles?term=GPL570[GEO+Accession]">GPL570[GEO Accession]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>GEO Description/Title Text</th>
|
|
<td>GEOT, TI, GEOtxt</td><td>Text provided in the DataSet or Series description, title and other metadata fields</td><td>free text, wildcard (*) supported </td><td>Find profiles from studies that examine aspirin<br /><a href="/geoprofiles?term=aspirin[GEO Description/Title+Text]">aspirin[GEO Description/Title Text]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>GI</th>
|
|
<td>GI</td><td>Mapped GenBank Identifier</td><td>integer</td><td>Find profiles for GenBank Identifier 89145416<br /><a href="/geoprofiles?term=89145416[GI]">89145416[GI]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Gene Description</th>
|
|
<td>GDSC, GEND, aliases, GENE, GeneDesc</td><td>Gene description and aliases from Gene, title from UniGene. </td><td>free text, wildcard (*) supported </td><td>Find kinase genes in GDS182<br /><a href="/geoprofiles?term=kinase[Gene+Description]+AND+GDS182">kinase[Gene Description] AND GDS182</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Gene Ontology</th>
|
|
<td>GO</td><td>Gene Ontology terms</td><td><a href="http://www.ebi.ac.uk/GOA/">Gene Ontology</a> (GO) terms, wildcard (*) supported </td><td>Find apoptosis genes in GDS182<br /><a href="/geoprofiles?term=apoptosis[Gene Ontology]+AND+GDS182">apoptosis[Gene Ontology] AND GDS182</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Gene Symbol</th>
|
|
<td>SYMB, GeneSymbol</td><td>Gene Symbol from Gene or UniGene</td><td>free text, wildcard (*) supported </td><td>Find CYP1A1 gene<br /><a href="/geoprofiles?term=CYP1A1[Gene Symbol]">CYP1A1[Gene Symbol]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>ID_REF</th>
|
|
<td>ID, ID_REF</td><td>ID from GEO Platform, SAGE tag, Affy ProbeSet ID</td><td>free text, wildcard (*) supported </td><td>Find profiles for Affymetrix probeset ID 218973_at<br /><a href="/geoprofiles?term=218973_at[ID_REF]">218973_at[ID_REF]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Max Value Rank</th>
|
|
<td>RMAX, RNKMX</td><td>The maximum value percentile rank for any Sample within DataSet</td><td>integer, 0-100, range function supported</td><td>Find profiles where the maximum rank percentile is in the 1st percentile (ie, genes with low expression)<br /><a href="/geoprofiles?term=1[Max+Value+Rank]">1[Max Value Rank]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Min Value Rank</th>
|
|
<td>RMIN, RNKMN</td><td>The minimum value percentile rank for any Sample within DataSet</td><td>integer, 0-100, range function supported</td><td>Find profiles where the minimum rank percentile is in the 100th percentile (ie, highly expressed genes)<br /><a href="/geoprofiles?term=100[Min+Value+Rank]">100[Min Value Rank]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Number of Samples</th>
|
|
<td>NSAM, n_samples</td><td>Number of Samples in the DataSet</td><td>integer, range function supported</td><td>Find profiles with between 100 and 200 samples<br /><a href="/geoprofiles?term=100:200[Number+of+Samples]">100:200[Number of Samples]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Organism</th>
|
|
<td>ORGN</td><td>Name of the organism</td><td><a href="/Taxonomy/">NCBI taxonomy</a> terms, wildcard (*) supported, all levels in the taxonomy lineage and common names are indexed</td><td>Find mouse profiles<br /><a href="/geoprofiles?term=Mus+musculus[Organism]">Mus musculus[Organism]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Platform Reporter Type</th>
|
|
<td>RTYP, rep_type</td><td>Platform reporter type used for annotation</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find profiles where a CLONE ID is the basis for annotation<br /><a href="/geoprofiles?term=Mus+musculus[Organism]">Mus musculus[Organism]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Ranked Standard Deviation</th>
|
|
<td>RSTD, RNSTD</td><td>Percentile rank of profile standard deviation compared to all other profiles in a DataSet</td><td>integer, 0-100, range function supported</td><td>Find profiles with a high level of standard deviation<br /><a href="/geoprofiles?term=100[Ranked+Standard+Deviation]">100[Ranked Standard Deviation]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Reporter Identifier</th>
|
|
<td>NAME, identifier, Gene Identifier</td><td>Name or identifier of Platform probe</td><td>free text, wildcard (*) supported </td><td>Find profiles that include a probe corresponding to Arg1<br /><a href="/geoprofiles?term=D00636[Reporter+Identifier]">D00636[Reporter Identifier]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Sample Source</th>
|
|
<td>SRC, source</td><td>The source of the biological material of the Sample; warning: submitter-supplied field, not curated</td><td>free text, wildcard (*) supported </td><td>Find profiles with samples from brain<br /><a href="/geoprofiles?term=brain[Sample+Source]">brain[Sample Source]</a></td>
|
|
</tr>
|
|
<tr>
|
|
<th>Sample Value Type</th>
|
|
<td>VTYP, value_type</td><td>Sample value type</td><td>fixed list, check <a href="/geoprofiles/advanced/">Advanced Search</a> page for list of indexed terms</td><td>Find profiles with log ratio sample values<br /><a href="/geoprofiles?term=log+ratio[Sample+Value+Type]">log ratio[Sample Value Type]</a></td>
|
|
</tr>
|
|
</tbody>
|
|
</table>
|
|
</div>
|
|
|
|
</div>
|
|
</div>
|
|
|
|
</div>
|
|
</div>
|
|
<div id="last_mod">
|
|
Last modified: July 16, 2024</div>
|
|
<div id="footer">
|
|
<span class="helpbar">|<a href="https://www.nlm.nih.gov"> NLM </a>|<a href="https://www.nih.gov"> NIH </a>|<a href="mailto:geo@ncbi.nlm.nih.gov"> Email GEO </a>|<a href="/geo/info/disclaimer.html"> Disclaimer </a>|<a href="https://www.nlm.nih.gov/accessibility.html"> Accessibility </a>|<a href="https://www.hhs.gov/vulnerability-disclosure-policy/index.html"> HHS Vulnerability Disclosure </a>|
|
|
</span>
|
|
</div>
|
|
</div>
|
|
<script type="text/javascript" src="https://www.ncbi.nlm.nih.gov/portal/portal3rc.fcgi/rlib/js/InstrumentOmnitureBaseJS/InstrumentNCBIBaseJS/InstrumentPageStarterJS.js"></script>
|
|
</body>
|
|
</html>
|