What tools are in NMPDR?
NMPDR provides tools for comparative analysis of proteins, comparative analysis of genetic and functional contexts of proteins, and comparative analysis of whole genomes. NMPDR provides access to well-known tools written by other institutions, as well as tools developed specifically for NMPDR and SEED.
Tools for comparative analysis of protein sequences include BLAST analyses of orthologs in hundreds of genomes, a protein sequence alignment tool, and several tools for finding amino acid patterns.
Pre-computed BLAST results are available on each protein page via a button that will open a table of bidirectional best hits. For any protein focused on, the best BLAST hit from every subject genome in the database is listed in the table, if the focus protein is the best hit when its genome is queried with that subject protein. From the table of bidirectional best hits, it is possible to select orthologs and align them with clustalW in one click.
One-click links to tools for finding motifs and signal sequences are provided at the bottom of each protein page. These tools include TMpred, PSORT, and PPSearch, which locate transmembrane regions, predict cellular localization, and find signature amino acid patterns. There is also a link to NCBI's Position Specific Iterated BLAST (Psi-BLAST), which may discover protein family relationships among proteins with fairly low overall sequence similarity.
Tools for comparative analysis of protein context will allow you to examine both the genetic and functional context of your chosen protein.
Two environments are offered for examining the genetic context of any protein. One familiar environment is GBrowse, the generic genome browser tool developed for the GMOD project and implemented in many model organism databases. The NMPDR protein page provides a rich environment for comparative analysis. Every protein in NMPDR is displayed in a table and a graphic centered on the focus peg flanked by its neighbors within about 8 kb upstream and downstream on the chromosome. Moreover, the colors of the flanking proteins in the graphic correspond to their functional clustering score. Red proteins are simply near by, while the relative proximity of the blue proteins is conserved in other genomes, which implies a functional relationship.
To compare the genetic context of the focus protein with the five most homologous genomes, simply click on "Show Compare Regions." The colors of the genes in this display are shared among homologs in the different genomes, and the numbers are the rank-ordered frequency of co-localization with the focus peg, which is red. The number of genomes and the size of the regions displayed may be adjusted in this tool. To view the maximum number of homologous regions, click the PINS button.
Finally, the functional context of the focus gene may be explored by comparing protein families and by looking at subsystems. If a curator has added the focus protein to a subsystem, a link to the populated subsystem spreadsheet is provided.
Comparative analysis of all proteins in whole genomes is made possible by the Signature genes tool, linked in the left navigation pane. This tool will allow you to find proteins in common to two sets of genomes, or those that discriminate one genome from another. Keywords may also be included to narrow the search, for example, to all kinases shared by a group of genomes. The output may be sorted by protein function, and is downloadable as a tab-deliminated table for use with Excel.
|