Introduction to Bioinformatics - Final Exam Answers

No. Question Answer
1. The following sequence has a problem with it. Using the tools available at NCBI, identify what the problem is. The sequence is actually contains both vector sequence and gene sequence
2. What is the Unigene cluster id number? Hs.154210
3. What is the LocusLink locus for this gene? 1901
4. What is the open reading frame for this gene? +2 reading frame, 251 to 1396 bp
5. What are some characteristics of the 3D structure of this protein? It is a seven-transmembrane receptor
6. List 4 potential motifs this protein may have, based on Prosite classification.
  1. N-glycosylation site
  2. cAMP- and cGMP-dependent protein kinase phosphorylation site
  3. Protein kinase C phosphorylation site
  4. Casein kinase II phosphorylation site
  5. N-myristoylation site
  6. Leucine zipper pattern
  7. G-protein coupled receptors signature
7. Does this gene have any close homologs in other species? Rat and mouse have close homologs. Chicken, dog and drosophila have distant homologs.
8. What are the top 3 fingerPRINTS associated with this protein? Which ones have a high confidence level? GPCRHODOPSN , EDG1ORPHANR, CANABINOIDR
GPCRHODOPSN & EDG1ORPHANR are high confidence matches
9. Using the Molecules R Us website, find the structure for the molecule 2VUB. What is this molecule? This molecule is CCD2, a topoisomerase poison from E.coli
10. Send me the ribbons-image picture of 2VUB as an attatchment