U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in

Account

Logged in as:
username
  • Dashboard
  • Publications
  • Account settings
  • Log out
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
NCBI Datasets
  • NCBI Datasets
  • Taxonomy
  • Genome
  • Gene
  • Command-line tools
  • Documentation
  • Documentation
    • Getting started
    • Download and install
    • How-to guides
      • Genes
        • Get gene metadata
        • Download genes
        • Download gene orthologs
        • Get the longest isoform
      • Genomes
        • Get genome metadata
        • Download genome data
        • Large genome downloads
      • Virus
        • SARS-CoV-2 genomes
        • SARS-CoV-2 proteins
      • Working with JSON
        • Working with data reports
    • Supported programming languages
      • Python
        • Python API
          • ncbi.datasets
            • metadata
              • gene
              • genome
            • openapi
              • api
                • gene_api
                • genome_api
                • prokaryote_api
                • taxonomy_api
                • version_api
                • virus_api
              • api_client
              • apis
              • configuration
              • exceptions
              • model_utils
              • models
                • protobuf_any
                • rpc_status
                • v1_accessions
                • v1_annotated_assemblies
                • v1_annotation
                • v1_annotation_for_assembly
                • v1_annotation_for_assembly_file
                • v1_annotation_for_assembly_type
                • v1_annotation_for_virus_type
                • v1_assembly_dataset_availability
                • v1_assembly_dataset_descriptor
                • v1_assembly_dataset_descriptor_chromosome
                • v1_assembly_dataset_descriptors_filter
                • v1_assembly_dataset_descriptors_filter_assembly_level
                • v1_assembly_dataset_descriptors_filter_assembly_source
                • v1_assembly_dataset_descriptors_filter_assembly_version
                • v1_assembly_dataset_descriptors_request_content_type
                • v1_assembly_dataset_request
                • v1_assembly_dataset_request_resolution
                • v1_assembly_match
                • v1_assembly_metadata
                • v1_assembly_metadata_request
                • v1_assembly_metadata_request_bioprojects
                • v1_assembly_metadata_request_content_type
                • v1_bio_project
                • v1_bio_project_lineage
                • v1_busco_stat
                • v1_count_type
                • v1_dataset_request
                • v1_download_summary
                • v1_download_summary_available_files
                • v1_download_summary_dehydrated
                • v1_download_summary_file_summary
                • v1_download_summary_hydrated
                • v1_element_flank_config
                • v1_error
                • v1_error_assembly_error_code
                • v1_error_gene_error_code
                • v1_error_virus_error_code
                • v1_fasta
                • v1_feature_counts
                • v1_gene_counts
                • v1_gene_dataset_request
                • v1_gene_dataset_request_content_type
                • v1_gene_dataset_request_sort
                • v1_gene_dataset_request_sort_field
                • v1_gene_dataset_request_symbols_for_taxon
                • v1_gene_descriptor
                • v1_gene_descriptor_gene_type
                • v1_gene_descriptor_rna_type
              • models
              • rest
            • package
              • dataset
      • R
    • Reference
      • Command line
        • dataformat
          • tsv
            • genome
            • genome-seq
            • gene
            • virus-genome
            • microbigge
            • prok-gene
            • prok-gene-location
          • excel
            • genome
            • genome-seq
            • gene
            • virus-genome
            • microbigge
            • prok-gene
            • prok-gene-location
          • catalog
          • completion
            • bash
            • zsh
            • fish
            • powershell
          • version
        • datasets
          • summary
            • virus
              • genome
                • taxon
                • accession
            • gene
              • gene-id
              • symbol
              • accession
              • taxon
            • genome
              • accession
              • taxon
            • ortholog
              • gene-id
              • symbol
              • accession
          • download
            • gene
              • gene-id
              • symbol
              • accession
              • taxon
            • genome
              • accession
              • taxon
            • virus
              • genome
                • accession
                • taxon
              • protein
            • ortholog
              • gene-id
              • symbol
              • accession
          • rehydrate
          • completion
            • bash
            • zsh
            • fish
            • powershell
          • version
      • File formats
        • GBFF
        • GFF3
      • Report schemas
        • Gene
        • Genome assembly
        • Genome sequence
        • MicroBIGG-E
        • Prok. gene
        • Prok. gene location
        • Virus
      • Data packages
        • Gene package
        • Genome package
        • SARS-CoV-2 data package
      • GCA and GCF genomes
      • jq cheatsheet
      • REST API
        • Authentication
        • Retired Endpoints
    • FAQs and troubleshooting
      • Frequently asked Questions
      • Mac zip bug
Documentation version
Learn more
  1. Documentation
  2. How-to guides

How-to guides


icon Genes
  • Get gene metadata
  • Download a gene data package
  • Download a gene ortholog data package
  • Get representative protein sequences from Ortholog sets
icon Genomes
  • Get genome metadata
  • Download a genome data package
  • Download large genome data packages
icon Virus
  • Download SARS-CoV-2 genomes
  • Download SARS-CoV-2 protein sequences
icon Working with JSON
  • Working with JSON Lines data reports
Generated March 11, 2025
Follow NCBI
TwitterFacebookLinkedInGitHub

Connect with NLM

  • Twitter
  • SM-Facebook
  • SM-Youtube

National Library of Medicine
8600 Rockville Pike
Bethesda, MD 20894

Web Policies
FOIA
HHS Vulnerability Disclosure

Help
Accessibility
Careers

  • NLM
  • NIH
  • HHS
  • USA.gov