Skip to main page content Skip to main page content

The PubMed Computed Authors RESTful API and FTP Download

The PubMed Computed Authors provides the most up-to-date large scale data of disambiguated author names in PubMed as an open resource freely available for API query and FTP download. It uses advanced machine learning algorithms to disambiguate individual authors for all PubMed articles and is updated regularly in a weekly basis.

Query via API

The API provides retrieval of an author,s publications by query of a PMID of any of his/her publications and the author's name in the form of "last name and first name initial".

https://www.ncbi.nlm.nih.gov/research/litsense-api/api/author/?query=[PMID] [Last Name] [First Name Initial]
For example:
https://www.ncbi.nlm.nih.gov/research/litsense-api/api/author/?query=37268776 Lu Z

For authors with multiple name forms, concatenate queries of different names using a comma "," as delimiter to retrieve all publications published in different names. For example:
https://www.ncbi.nlm.nih.gov/research/litsense-api/api/author/?query=20157491 Islamaj Dogan R, 20819859 Dogan RI, 37878810 Islamaj R

Publications can also be retrieved by the author's ORCID when the ORCID is available. For example:
https://www.ncbi.nlm.nih.gov/research/litsense-api/api/author/?orcid=0000-0001-9998-916X

FTP Download

The dataset of all disambiguated authors in PubMed articles is accessible for FTP download at:
https://ftp.ncbi.nlm.nih.gov/pub/lu/ComputedAuthors/

Each line in the dataset contains data for an disambiguated author in JSON format, for example:,

{"name": "LU+Z", "names": ["Lu,Zhiyong,Z"], "orcid": ["0000-0001-9998-916X"], "pmids": [17094225, 17094227, 17990498, 18237434, 18547432, ...]}