Identify the open reading frame in the following DNA sequence, the protein that this gene encodes for, its function, and the source.

GACTATATCCGGCGTATGAAGAAGGTTTCTACGCTTGACCTGTTGTTCGTTGCGATCATGGGTGTTTCGC
CGGCCGCTTTTGCCGCCGACCTGATCGACGTGTCCAAACTCCCCAGCAAGGCTGCCCAGGGCGCGCCCGG
CCCGGTCACCTTGCAAGCCGCGGTCGGCGCTGGCGGTGCCGACGAACTGAAAGCGATCCGCAGCACGACC
CTGCCCAACGGCAAGCAGGTCACCCGCTACGAGCAATTCCACAACGGCGTACGGGTGGTCGGCGAAGCCA
TCACCGAAGTCAAGGGTCCCGGCAAGAGCGTGGCGGCGCAGCGCAGCGGCCATTTCGTCGCCAACATCGC
TGCCGACCTGCCGGGCAGCACCACCGCGGCGGTATCCGCCGAGCAGGTGCTGGCCCAGGCCAAGAGCCTG
AAGGCCCAGGGCCGCAAGACCGAGAATGACAAAGTGGAACTGGTGATCCGCCTGGGCGAGAACAACATCG
CCCAACTGGTCTACAACGTCTCCTACCTGATTCCCGGCGAGGGACTGTCGCGGCCGCATTTCGTCATCGA
CGCCAAGACCGGCGAAGTGCTCGATCAGTGGGAAGGCCTGGCCCACGCCGAGGCGGGCGGCCCCGGCGGC
AACCAGAAGATCGGCAAGTACACCTACGGTAGCGACTACGGTCCGCTGATCGTCAACGACCGCTGCGAGA
TGGACGACGGCAACGTCATCACCGTCGACATGAACAGCAGCACCGACGACAGCAAGACCACGCCGTTCCG
CTTCGCCTGCCCGACCAACACCTACAAGCAGGTCAACGGCGCCTATTCGCCGCTGAACGACGCGCATTTC
TTCGGCGGCGTGGTGTTCAAACTGTACCGGGACTGGTTCGGCACCAGCCCGCTGACCCACAAGCTGTACA
TGAAGGTGCACTACGGGCGCAGCGTGGAGAACGCCTACTGGGACGGCACGGCGATGCTCTTCGGCGACGG
CGCCACCATGTTCTATCCGCTGGTGTCGCTGGACGTGGCGGCCCACGAGGTCAGCCACGGCTTCACCGAG
CAGAACTCCGGGCTGATCTACCGCGGGCAATCAGGCGGAATGAACGAAGCGTTCTCCGACATGGCCGGCG
AGGCTGCCGAGTTCTATATGCGCGGCAAGAACGACTTCCTGATCGGCTACGACATCAAGAAGGGCAGCGG
TGCGCTGCGCTACATGGACCAGCCCAGCCGCGACGGGCGATCCATCGACAACGCGTCGCAGTACTACAAC
GGCATCGACGTGCACCACTCCAGCGGCGTGTACAACCGTGCGTTCTACCTGTTGGCCAATTCGCCGGGCT
GGGATACCCGCAAGGCCTTCGAGGTGTTCGTCGACGCCAACCGCTACTACTGGACCGCCACCAGCAACTA
CAACAGCGGCGCCTGCGGGGTGATTCGCTCGGCGCAGAACCGCAACTACTCGGCGGCTGACGTCACCCGG
GCGTTCAGCACCGTCGGCGTGACCTGCCCGAGCGCGTTGTAA

This question involves using a database. You will use a database for storage and mining of genome sequences. The procedure to identify the gene and the protein that it encodes is as follows:

How to identify the start site for transcription?

used EXPASY WEBsite translated this DNA sequence, as the IMAGE shown, there are genetic code is translated. i think I am doing wrong?

2. Click on the DNA sequence from the start site of transcription, select all of the sequence, and copy the sequence

Should I copy the letter which is highlitend RED, which is start site, and then past it to BLAST?

Transcribed Image Text:ttacaacgcgctcgggcaggtcacgccgacggtgctgaacgcccgggtgacgtcagccgc L Q R A R AGH A D G AER PGD VS R cgagtagttgcggttctgcgccgagcgaatcaccccgcaggcgccgctgttgtagttgct RV VA V L R R A N H PA GA A V V V A ggtggcggtccagtagtagcggttggcgtcgacgaacacctcgaaggccttgcgggtatc GG G P V VA V G V DEH LEG LAGI ccagcccggcgaattggccaacaggtagaacgcacggttgtacacgccgctggagtggtg PARR I G Q Q VERT V V HA A G V V cacgtcgatgccgttgtagtactgcgacgcgttgtcgatggatcgcccgtcgcggctggg HV DA V V V L R R V VDG SPVA A G ctggtccatgtagcgcagcgcaccgctgcccttcttgatgtcgtagccgatcaggaagtc LV HVA QRTA A LLD V VAD QE V gttcttgccgcgcatatagaactcggcagcctcgccggccatgtcggagaacgcttcgtt VLA A HI E L GS LA GHV GERF V cattccgcctgattgcccgcggtagatcagcccggagttctgctcggtgaagccgtggct HSA - L P A V D Q P G V L L G E A VA gacctcgtgggccgccacgtccagcgacaccagcggatagaacatggtggcgccgtcgcc DL VG RH V Q R H Q R I E H G G A VA gaagagcatcgccgtgccgtcccagtaggcgttctccacgctgcgcccgtagtgcacctt EEHR RAV P V G V LHA A P V V HL catgtacagcttgtgggtcagcgggctggtgccgaaccagtcccggtacagtttgaacac HV Q L V G Q R A GAEP V P V Q FEH cacgccgccgaagaaatgcgcgtcgttcagcggcgaataggcgccgttgacctgcttgta HA A E E MR V V Q R R I G A V D L L V ggtgttggtcgggcaggcgaagcggaacggcgtggtcttgctgtcgtcggtgctgctgtt GV GRAGE AERR GLA V VGA A V catgtcgacggtgatgacgttgccgtcgtccatctcgcagcggtcgttgacgatcagcgg HV D G D D VA V V HLA A V V D D QR accgtagtcgctaccgtaggtgtacttgccgatcttctggttgccgccggggccgcccgc TV VAT V G VLA D L L VA A G A AR ctcggcgtgggccaggccttcccactgatcgagcacttcgccggtcttggcgtcgatgac LGV G Q AFP LI E H F A G L G V D D gaaatgcggccgcgacagtccctcgccgggaatcaggtaggagacgttgtagaccagttg
Transcribed Image Text:5 https://web.expasy.org/translate/ Translate is a tool which allows the translation of a nucleotide (DNA/RNA) sequence to a protein sequence. DNA or RNA sequence gactatatccggcgtatgaagaaggtttctacgcttgacctgttgttcg ttgcgatcatgggtgtttcgccggccgcttttgccgccgacctgatcg acgtgtccaaactccccagcaaggctgcccagggcgcgcccggc ccggtcaccttgcaagccgcggtcggcgctggcggtgccgacgaa ctgaaagcgatccgcagcacgaccctgcccaacggcaagcaggt cacccgctacgagcaattccacaacggcgtacgggtggtcggcg aagccatcaccgaagtcaagggtcccggcaagagcgtggcggc gcagcgcagcggccatttcgtcgccaacatcgctgccgacctgcc gggcagcaccaccgcggcggtatccgccgagcaggtgctggccc Genetic codes - See NCBI's genetic codes Standard reset Results of translation Translate tool TRANSLATE! . 3'5' Frame 1 Output format O Verbose: Met, Stop, spaces between residues O Compact: M, -, no spaces O Includes nucleotide sequence Includes nucleotide sequence, no spaces Open reading frames are highlighted in red • Select your initiator on one of the following frames to retrieve your amino acid sequence DNA strands O forward ✔reverse Download all the translated frames ttacaacgcgctcgggcaggtcacgccgacggtgctgaacgcccgggtgacgtcagccgc LQRA RAG HAD GAER PGD VS R cgagtagttgcggttctgcgccgagcgaatcaccccgcaggcgccgctgttgtagttgct R V 17 A V T R R A N H PAG A A D V V A

Question

Identify the open reading frame in the following DNA sequence, the protein that this gene encodes for, its function, and the source.

GACTATATCCGGCGTATGAAGAAGGTTTCTACGCTTGACCTGTTGTTCGTTGCGATCATGGGTGTTTCGC
CGGCCGCTTTTGCCGCCGACCTGATCGACGTGTCCAAACTCCCCAGCAAGGCTGCCCAGGGCGCGCCCGG
CCCGGTCACCTTGCAAGCCGCGGTCGGCGCTGGCGGTGCCGACGAACTGAAAGCGATCCGCAGCACGACC
CTGCCCAACGGCAAGCAGGTCACCCGCTACGAGCAATTCCACAACGGCGTACGGGTGGTCGGCGAAGCCA
TCACCGAAGTCAAGGGTCCCGGCAAGAGCGTGGCGGCGCAGCGCAGCGGCCATTTCGTCGCCAACATCGC
TGCCGACCTGCCGGGCAGCACCACCGCGGCGGTATCCGCCGAGCAGGTGCTGGCCCAGGCCAAGAGCCTG
AAGGCCCAGGGCCGCAAGACCGAGAATGACAAAGTGGAACTGGTGATCCGCCTGGGCGAGAACAACATCG
CCCAACTGGTCTACAACGTCTCCTACCTGATTCCCGGCGAGGGACTGTCGCGGCCGCATTTCGTCATCGA
CGCCAAGACCGGCGAAGTGCTCGATCAGTGGGAAGGCCTGGCCCACGCCGAGGCGGGCGGCCCCGGCGGC
AACCAGAAGATCGGCAAGTACACCTACGGTAGCGACTACGGTCCGCTGATCGTCAACGACCGCTGCGAGA
TGGACGACGGCAACGTCATCACCGTCGACATGAACAGCAGCACCGACGACAGCAAGACCACGCCGTTCCG
CTTCGCCTGCCCGACCAACACCTACAAGCAGGTCAACGGCGCCTATTCGCCGCTGAACGACGCGCATTTC
TTCGGCGGCGTGGTGTTCAAACTGTACCGGGACTGGTTCGGCACCAGCCCGCTGACCCACAAGCTGTACA
TGAAGGTGCACTACGGGCGCAGCGTGGAGAACGCCTACTGGGACGGCACGGCGATGCTCTTCGGCGACGG
CGCCACCATGTTCTATCCGCTGGTGTCGCTGGACGTGGCGGCCCACGAGGTCAGCCACGGCTTCACCGAG
CAGAACTCCGGGCTGATCTACCGCGGGCAATCAGGCGGAATGAACGAAGCGTTCTCCGACATGGCCGGCG
AGGCTGCCGAGTTCTATATGCGCGGCAAGAACGACTTCCTGATCGGCTACGACATCAAGAAGGGCAGCGG
TGCGCTGCGCTACATGGACCAGCCCAGCCGCGACGGGCGATCCATCGACAACGCGTCGCAGTACTACAAC
GGCATCGACGTGCACCACTCCAGCGGCGTGTACAACCGTGCGTTCTACCTGTTGGCCAATTCGCCGGGCT
GGGATACCCGCAAGGCCTTCGAGGTGTTCGTCGACGCCAACCGCTACTACTGGACCGCCACCAGCAACTA
CAACAGCGGCGCCTGCGGGGTGATTCGCTCGGCGCAGAACCGCAACTACTCGGCGGCTGACGTCACCCGG
GCGTTCAGCACCGTCGGCGTGACCTGCCCGAGCGCGTTGTAA

This question involves using a database. You will use a database for storage and mining of genome sequences. The procedure to identify the gene and the protein that it encodes is as follows:

How to identify the start site for transcription?

used EXPASY WEBsite translated this DNA sequence, as the IMAGE shown, there are genetic code is translated. i think I am doing wrong?

2. Click on the DNA sequence from the start site of transcription, select all of the sequence, and copy the sequence

Should I copy the letter which is highlitend RED, which is start site, and then past it to BLAST?

ttacaacgcgctcgggcaggtcacgccgacggtgctgaacgcccgggtgacgtcagccgc
L Q R A R AGH A D G AER PGD VS R
cgagtagttgcggttctgcgccgagcgaatcaccccgcaggcgccgctgttgtagttgct
RV VA V L R R A N H PA GA A V V V A
ggtggcggtccagtagtagcggttggcgtcgacgaacacctcgaaggccttgcgggtatc
GG G P V VA V G V DEH LEG LAGI
ccagcccggcgaattggccaacaggtagaacgcacggttgtacacgccgctggagtggtg
PARR I G Q Q VERT V V HA A G V V
cacgtcgatgccgttgtagtactgcgacgcgttgtcgatggatcgcccgtcgcggctggg
HV DA V V V L R R V VDG SPVA A G
ctggtccatgtagcgcagcgcaccgctgcccttcttgatgtcgtagccgatcaggaagtc
LV HVA QRTA A LLD V VAD QE V
gttcttgccgcgcatatagaactcggcagcctcgccggccatgtcggagaacgcttcgtt
VLA A HI E L GS LA GHV GERF V
cattccgcctgattgcccgcggtagatcagcccggagttctgctcggtgaagccgtggct
HSA - L P A V D Q P G V L L G E A VA
gacctcgtgggccgccacgtccagcgacaccagcggatagaacatggtggcgccgtcgcc
DL VG RH V Q R H Q R I E H G G A VA
gaagagcatcgccgtgccgtcccagtaggcgttctccacgctgcgcccgtagtgcacctt
EEHR RAV P V G V LHA A P V V HL
catgtacagcttgtgggtcagcgggctggtgccgaaccagtcccggtacagtttgaacac
HV Q L V G Q R A GAEP V P V Q FEH
cacgccgccgaagaaatgcgcgtcgttcagcggcgaataggcgccgttgacctgcttgta
HA A E E MR V V Q R R I G A V D L L V
ggtgttggtcgggcaggcgaagcggaacggcgtggtcttgctgtcgtcggtgctgctgtt
GV GRAGE AERR GLA V VGA A V
catgtcgacggtgatgacgttgccgtcgtccatctcgcagcggtcgttgacgatcagcgg
HV D G D D VA V V HLA A V V D D QR
accgtagtcgctaccgtaggtgtacttgccgatcttctggttgccgccggggccgcccgc
TV VAT V G VLA D L L VA A G A AR
ctcggcgtgggccaggccttcccactgatcgagcacttcgccggtcttggcgtcgatgac
LGV G Q AFP LI E H F A G L G V D D
gaaatgcggccgcgacagtccctcgccgggaatcaggtaggagacgttgtagaccagttg

5
https://web.expasy.org/translate/
Translate is a tool which allows the translation of a nucleotide (DNA/RNA) sequence to a protein sequence.
DNA or RNA sequence
gactatatccggcgtatgaagaaggtttctacgcttgacctgttgttcg
ttgcgatcatgggtgtttcgccggccgcttttgccgccgacctgatcg
acgtgtccaaactccccagcaaggctgcccagggcgcgcccggc
ccggtcaccttgcaagccgcggtcggcgctggcggtgccgacgaa
ctgaaagcgatccgcagcacgaccctgcccaacggcaagcaggt
cacccgctacgagcaattccacaacggcgtacgggtggtcggcg
aagccatcaccgaagtcaagggtcccggcaagagcgtggcggc
gcagcgcagcggccatttcgtcgccaacatcgctgccgacctgcc
gggcagcaccaccgcggcggtatccgccgagcaggtgctggccc
Genetic codes - See NCBI's genetic codes
Standard
reset
Results of translation
Translate tool
TRANSLATE!
.
3'5' Frame 1
Output format
O Verbose: Met, Stop, spaces between residues
O Compact: M, -, no spaces
O Includes nucleotide sequence
Includes nucleotide sequence, no spaces
Open reading frames are highlighted in red
• Select your initiator on one of the following frames to retrieve your
amino acid sequence
DNA strands
O forward ✔reverse
Download all the translated frames
ttacaacgcgctcgggcaggtcacgccgacggtgctgaacgcccgggtgacgtcagccgc
LQRA RAG HAD GAER PGD VS R
cgagtagttgcggttctgcgccgagcgaatcaccccgcaggcgccgctgttgtagttgct
R V 17 A V T R R A N H PAG A A
D
V V A

Accepted Answer

The genome or genomic data of an organism refers to the complete set of its hereditary material…