Contents
- Introduction to Protein Analysis
- Obtain a sequence of interest.
- Identify ORF's and translate into protein
- Identify Similar Proteins from the Databases
- Align your sequence vs similar sequences and look for Gene Families
- Determine the putative function of your protein
- Determine the putative structure of your protein
- Protein Structure Visualization Tools
- Other Interesting Things You Can Do With Proteins
IV. Identify Similar Proteins from the Databases
- Search against plain sequence databases
- Search the databases (GenBank, etc.) using BLAST 2.0 from NCBI
Other variations of the basic BLAST program are available from NCBI. They are listed below. They all use the same basic parameters that we have already discussed for BLAST 2.0.
- PSI-BLAST (Position-Specific Iterated BLAST)
long & horrible URL
Example
- Used for detecting weak similarities between protein sequences
- Program first does a gapped BLAST
- Information from the resulting hits is used to build a position specific scoring matrix
- The scoring matrix is used instead of the query for the next round of searching
- PSI-BLAST may be iterated until no new significant hits are returned
- PHI-BLAST (Pattern Hit Initiated BLAST)
same form as above
- Input is protein sequence and a motif that is found in that sequence
- Input patterns must be in Pro-Site format
- Statistical significance is still reported as an E value, but the calculation method is different
- PHI-BLAST is integrated with PSI-BLAST therefore results can be used to initiate a second round of searching
- Search against species-specific sequence databases
- Advanced BLAST
- 22 species available
- Copy and paste your protein or DNA sequence of interest
- Choose from the available organism menu.
- submit the sequence for analysis
- Organism Specific Databases
http://restools.sdsc.edu/biotools/biotools10.html
- One of the most complete listings available of all the various organism specific database projects around the world
- Search against the PDB database
- PDB's FASTA interface
http://www.rcsb.org/pdb/cgi/queryForm.cgi?PDBId=1&TypeSelection=1&ExperimentalTechnique=1&Fasta=1
- Advanced BLAST
- GenQuest Q Server
http://www.bis.med.jhmi.edu/Dan/gq/gq.form_rm.html
- Perform BLAST, FASTA, Smith-Waterman against
- PDB - Protein Databank sequences of proteins with solved structures
- Swissprot - a protein sequence database
- Prosite - a library of protein motifs
- The Genome Sequence Database (GSDB)
|
|