Pharos is the user interface to the Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) program funded by the National Institutes of Health (NIH) Common Fund. (Grant No. 5U54CA189205-02). The goal of KMC is to develop a comprehensive, integrated knowledge-base for the Druggable Genome (DG) to illuminate the uncharacterized and/or poorly annotated portion of the DG, focusing on four of the most commonly drug-targeted protein families:  G-protein-coupled receptors (GPCRs); nuclear receptors (NRs); ion channels (ICs); and kinases. For more information on opportunities in the druggable human genome see this poster

Based on modern web design principles the interface provides facile access to all data types collected by the KMC. Given the complexity of the data surrounding any target, efficient and intuitive visualization has been a high priority, to enable users to quickly navigate & summarize search results and rapidly identify patterns. A critical feature of the interface is the ability to perform flexible search and subsequent drill down of search results. Underlying the interface is a RESTful API that provides programmatic access to all KMC data, allowing for easy consumption in user applications.

The Help page provides more information on various topics.

For feedback and comments, please contact us at

Who's involved?

Pharos is developed at NCATS, together with collaborators from the University of New Mexico, Icahn School of Medicine, Mount Sinai, EMBL-EBI, the Novo Nordisk Foundation Center for Protein Research (U. Copenhagen) and the University of Miami.

The IDG TechDev network is the experimental side of IDG. TechDev research aims to develop and utilize technologies and streamlined experimental workflows for functional studies of poorly characterized and/or un-annotated genes and proteins, thereby providing high value data and knowledge for KMC. KMC in turn is tasked with providing guidance as to research priorities based on knowledge gaps and druggability likelihood analyses.

To find more details on our collaborators visit

What data is available?

The data available in Pharos is obtained from the Target Central Resource Database which integrates data from a variey of data sources including the Harmonizome, Jensen Lab datasets, EBI data sets (such as ChEMBL) and the Drug Target Ontology (DTO) from U. Miami. TCRD integration methodology involves importation by value or by reference to external sources, as informed by performance, provenance, and other design criteria. See here for a full listing of datasets incorporated in to TCRD.

The key data types represented in Pharos are listed below:

Small molecule data
including approved drug data, bioassay data
Protein data
including protein-protein interaction data
Disease data
from OMIM and Disease Ontology
Genomic data
including expression (protein, RNA), transcription factors and epigenomic associations
Phenotypic data
including mouse phenotypes, mouse/human orthologs and GWAS results
Funding data
Text data
including GeneRIF's and text-mined publications
including the Drug Target Ontology, Disease Ontology, PANTHER and GO.

Where can I download all the data from?

The current and past versions of TCRD are available from here

What license does Pharos use?

All data accessed from Pharos is made available under the CC BY-SA 4.0 license.

Where can I get the code?

The sources for the Pharos web interface are available from The repository provides instructions on building and installation

How can I cite Pharos?

If you use Pharos, please consider citing it as

Nguyen, D.-T., Mathias, S. et al, "Pharos: Collating Protein Information to Shed Light on the Druggable Genome", Nucl. Acids Res.i>, 2017, 45(D1), D995-D1002. DOI: 10.1093/nar/gkw1072