Final Exam for Introduction to Bioinformatics

Please email the answers to

helgew@sdsc.edu by 5pm Friday, October 12, 2001.

Late submissions will NOT be accepted

Question 1

The following sequence has a problem with it. Using the tools available at NCBI, identify what the problem is.

ATACTCTATAATAACCACAAGTTTACTAACGCAAGTAAAATTATTAAAACAGATTTTGGGAGTCCAGGAGAGCCTCAGATTA
TTTTTTGTAGAAGTGAAGCTGCACATCAAGGAGTAATTACCTGGAATCCCCCTCAAAGATCATTTCATAATTTTACCCTCTG
TTATATAAAAGAGACAGAAAAAGATTGCCTCAATCTGGATAAAAACCTGATCAAATATGATTTGCAAAATTTAAAACCTTAT
ACGAAATATGTTTTATCATTACATGCCTACATCATTGCAAAAGTGCAACGTAATGGAAGTGCTGCAATGTGTCATTTCACAA
CTAAAAGTGCTCCTCCAAGCCAGGTCTGGAACATGACTGTCTCCATGACATCAGATAATAGTATGCATGTCAAGTGTAGGCC
TCCCAGGGACCGTAATGGCCCCCATGAACGTTACCATTTGGAAGTTGAAGCTGGAAATACTCTGGTTAGAAATGAGTCGCAT
AAGAATTGCGATTTCCGTGTAAAAGATCTTCAATATTCAACAGACTACACTTTTAAGGCCTATTTTCACAATGGAGACTATC
CTGGAGAACCCTTTATTTTACATCATTCAACATCTTATAATTCTAAGGCACTGATAGCATTTCTGGCATTTCTGATTATTGT
GACATCAATAGCCCTGCTTGTTGTTCTCTACAAAATCTATGATCTACATAAGAAAAGATCCTGCAATTTAGATGAACAGCAG
GAGCTTGTTGAAAGGGATGATGAAAAACAACTGATGAATGTGGAGCCAATCCATGCAGATATTTTGTTGGAAACTTATAAGA
GGAAGATTGCTGATGAAGGAAGACCTTTTCTGGCTGAATTTCAGAGCATCCCGCGGGTGTTCAGCAAGTTTCCTATAAAGGA
AGCTCGAAAGCCCTTTAACCAGAATAAAAACCGTTATGTTGACATTCTTCCTTATGATTATAACCGTGTTGAACTCTCTGAG
ATAAACGGAGATGCAGGGTCAAACTACATAAATGCCAGCTATATTGATGGTTTCAAAGAACCCAGGAAATACATTGCTGCAC
AAGGTCCCAGGGATGAAACTGTTGATGATTTCTGGAGGATGATTTGGGAACAGAAAGCCACAGTTATTGTCATGGTCACTCG
ATGTGAAGAAGGAAACAGGAACAAGTGTGCAGAATACTGGCCGTCAATGGAAGAGGGCACTCGGGCTTTTGGAGATGTTGTT
GTAAAGATCAACCAGCACAAAAGATGTCCAGATTACATCATTCAGAAATTGAACATTGTAAATAAAAAAGAAAAAGCAACTG
GAAGAGAGGTGACTCACATGTCGTTTACTTTGACCAACAAGAACGTGATTTTCGTTGCCGGTCTGGGAGGCATTGGTCTGGA
CACCAGCAAGGAGCTGCTCAAGCGCGATCCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAAT
CGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCA
GCCTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAGTGCGATCTTCCTGA
GGCCGATACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTACGATGCGCCCATCTACACCAACGTAACCTATCCCATT
ACGGTCAATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGGCTAC
AGGAAGGCCAGACGCGAATTATTTTTGATGGCGTTAACTCGGCGTTTCATCTGTGGTGCAACGGGCGCTGGGTCGGTTACGG
CCAGGACAGTCGTTTGCCGTCTGAATTTGACCTGAGCGCATTTTTACGCGCCGGAGAAAACCGCCTCGCGGTGATGGTGCTG
CGTTGGAGTGACGGCAGTTATCTGGAAGATCAGGATATGTGGCGGATGAGCGGCATTTTCCGTGACGTCTCGTTGCTGCATA
AACCGACTACACAAATCAGCGATTTCCATGTTGCCACTCGCTTTAATGATGATTTCAGCCGCGCTGTACTGGAGGCTGAAGT
TCAGATGTGCGGCGAGTTGCGTGACTACCTACGGGTAACAGTTTCTTTATGGCAGGGTGAAACGCAGGTCGCCAGCGGCACC
GCGCCTTTCGGCGGTGAAATTATCGATGAGCGTGGTGGTTATGCC
		

The following questions are regarding a gene called EDG-1

Question 2: What is the Unigene cluster id number?

Question 3: What is the LocusLink locus for this gene?

Question 4: What is the open reading frame for this gene?

Question 5: What are some characteristics of the 3D structure of this protein?

Question 6: List 4 potential motifs this protein may have, based on Prosite classification.

Question 7: Does this gene have any close homologs in other species?

Question 8: What are the top 3 fingerPRINTS associated with this protein? Which ones have a high confidence level?


Question 9: Using the PDB website, find the structure for the molecule 2VUB. What is this molecule?

Question 10: Send me the ribbons-image picture of 2VUB as an attatchment