Monarch geneset OGS2.0

DPOGS210552
TranscriptDPOGS210552-TA2991 bp
ProteinDPOGS210552-PA996 aa
Genomic positionDPSCF300304 + 94402-106789
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0064277e-9145.30% 
BombyxBGIBMGA013446-TA0.052.66% 
DrosophilaCG10841-PA8e-6926.36% 
EBI UniRef50UniRef50_D6W9472e-10540.19%Putative uncharacterized protein n=3 Tax=Neoptera RepID=D6W947_TRICA
NCBI RefSeqXP_975461.14e-10640.19%PREDICTED: similar to GA10590-PA [Tribolium castaneum]
NCBI nr blastpgi|3287823718e-11435.12%PREDICTED: hypothetical protein LOC551661 [Apis mellifera]
NCBI nr blastxgi|3287823712e-10934.90%PREDICTED: hypothetical protein LOC551661 [Apis mellifera]
Group
Gene OntologyGO:00055092.4e-06calcium ion binding
KEGG pathway 
InterPro domain[46-185] IPR0119922.4e-06EF-hand-like domain
Orthology groupMCL16323 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210552-TA
ATGGTATATCACCCTACTCCACCTCGAGTCTCTGCGCACGCCATGCCACATGTAACATCAAAAGAGATAGTGCGTCTCGCGGATGTTTGGAAAACTTGCAACAAGATCCGCGCGGCGGTGTTCCGCGTGGGACTCAACTTGTGGGACTGGTACCGTCCGCTGGATCCCGAGGGCAACTCACTCATTTCCGAATCAAAGTTCGTGTCTGTGCTGGCGGGCCCGCTGAGGTCGGTGGTGGGTCTGTCGGAAGCGGAGATCGCTCAGCTGGCGGACTACTTCCGAGCGCAAGACGGCCGCGTGCTCTACCACCAGCTGTGCCAGATCATACACGGAGAGGAAATTCTGTATGAGACAAACAAGAATGTCATGGAATTAACCTCCCTGAGTGACCATGTGGCTAAAAACGATGGACGAATAACATTCGCGCACTTCGCCCGCATCCTTGACTACATCGGCGTGATCGTGTCTCCTGAGGACTTCAACCTTCTGGTGCGTCGCTTCATTAAGGATTCCTACACACTCGACTACGTGGAGTTCTTGAAAGCAGTGGAGGCTGTCAAGAAGGAGGGCATACAAGGGCTTGGACCGGCTTACAATAATCCGAAAGCGGTAATCGATACGACGCTGCCCAAGCTATCTCGCCCGGAGATAGAAGCGGGAATATCGCCGACAGCACTCGGCCATACTGATGTGTTCCATCCCGCGCTGGAACCACGGCGACCACCTCGACACTTGCTCGATATCATGATGAGAGTGCAGGAGTTTGTATTGCAGAGAAGGATACGAGTCTCCGAATTTCTTAGGGATTACGATCCACTGAACTGTGGTCGCATCTCCCCGCAACAGTTTCTCCGCGCGATGGACGCCATGGGTCTACAAGCCGTGCTGTCAGAGCGTGAAGCGCGCTGCGTCTGCGCACATTACACCAGCCCCAACGATCCTGTAGCCGTCTGCTGGAGAACGTTTGAGGACGATTGCGACCAGGTTTTCACCATCAAGGAGCTAGAGAAGCACCCTGAGGTGGTGGTGGGTGGTGCCGCCGCAGAGGTGTCGGAGCTACCGGCCCTGGGGTCCGCGGATGACCGCGGGGCGGATAGGCCTGGAGGAGGGGTCGGGGAGCGCGAGCTGGACGCCGCACAGGCTGCCCTGCTGAGAGTGCGCGCTGCTTGCCAGGAACGATCTATTGATCTTAGGCCACTATTTGGAGACCACGACGAACACAACAACGGCCACGTGTCTCGGTCGCAGGTCCGCCGCGTGTTAGCGAGAGCGGGCGTGTTGCCGGCCGCGGCACAGCTCCGGGCTCTGGAGACGCGATACCTAGACGACTGTGGCTTCAAATACGTTGCCCTTCTCGATGAGTTAGAAGAGAAGCCGGTTGAAAGCGCCACCATCTCTCGACCAGTGGCAGCGGGACATAAGGCTAAAACCAGCGTCGTCGATCCGAGAGAGACTGACATAGTGCAGATACTTGCCAAGATCAAAGGAAAAGCGTGTCCGCCGCCGCCCGAGCCCGAGTGTCACAAACTGGACCAGTGGCAGTTGAGACGCCTGTGTTGTCTGCTGGCTACCATCGCTCAGAGAGACCTGCCGCTCCGACCCTACTTCCAGGACTATGAATTGGTGGCTAAAAACGATGGACGAATAACATTCGCGCACTTCGCCCGCATCCTTGACTACATCGGCGTGATCGTGTCTCCTGAGGACTTCAACCTTCTGGTGCGTCGCTTCATTAAGGATTCCTACACACTCGACTACGTGGAGTTCTTGAAAGCAGTGGAGGCTGTCAAGAAGGAGGGCATACAAGGGCTTGGACCGGCTTACAATAATCCGAAAGCGGTAATCGATACGACGCTGCCCAAGCTATCTCGCCCGGAGATAGAAGCGGGAATATCGCCGACAGCACTCGGCCATACTGATGTGTTCCATCCCGCGCTGGAACCACGGCGACCACCTCGACACTTGCTCGATATCATGATGAGAGTGCAGGAGTTTGTATTGCAGAGAAGGATACGAGTCTCCGAATTTCTTAGGGATTACGATCCACTGAACTGTGGTCGCATCTCCCCGCAACAGTTTCTCCGCGCGATGGACGCCATGGGTCTACAAGCCGTGCTGTCAGAGCGTGAAGCGCGCTGCGTCTGCGCACATTACACCAGCCCCAACGATCCTGTAGCCGTCTGCTGGAGAACGTTTGAGGACGATTGCGACCAGGTTTTCACCATCAAGGAGCTAGAGAAGCACCCTGAGGTGGTGGTGGGTGGTGCCGCCGCAGAGGTGTCGGAGCTACCGGCCCTGGGGTCCGCGGATGACCGCGGGGCGGATAGGCCTGGAGGAGGGGTCGGGGAGCGCGAGCTGGACGCCGCACAGGCTGCCCTGCTGAGAATGGTTCGTGAGGGTGTCCGTCCCCGCGAGTTCGTTTCTCAGTTCGATCCCCGTCACGAACGAGTGGTTCCTCGCGCAGACTTCTACCGCGGTCTGGCAGCGGCCGGGCTGGCTCTCACACCGATAGAGATGGACACACTCATGGAGGTTTTCAGTGCGCCGGGTCGCCGGCGGTACGTGGAGTATGAGCGATTTTGCGAGACAGTGGGCGAGAGTCTGGTGCAAGGAGGCCTGGAGCGAGCCCCGCTACTGGCACCGCTCCAACACGTGCCTGCCAGAGACACGCCACTCAATTACCTAAACTACGAAGAACGGGCGCTAGTAGCCGCCGCTCTAGACAAACTATCACACTTCCCCGATCAGCTGTCTAATATTATGGAGGTGTTCAAGGACGCTGACAAGGAGCGCTGCGGTACCATCCCTAGGGTGAGCGTGGAACGCGCGCTCTGCCAGCGTGGCTTGTTGGCGAGGCTGTCTGCCAGGGAGAGAGACCTGCTGTACAAGTGTTTCGGATACAGACGAGGGTGTGGAGACGAGGTGGACTACCGAGCATTGTGCAAGGCGCTCGACGTCCTACACGCAACATCGAGCGCGCAACCTTGCTGA

Protein sequence:

>DPOGS210552-PA
MVYHPTPPRVSAHAMPHVTSKEIVRLADVWKTCNKIRAAVFRVGLNLWDWYRPLDPEGNSLISESKFVSVLAGPLRSVVGLSEAEIAQLADYFRAQDGRVLYHQLCQIIHGEEILYETNKNVMELTSLSDHVAKNDGRITFAHFARILDYIGVIVSPEDFNLLVRRFIKDSYTLDYVEFLKAVEAVKKEGIQGLGPAYNNPKAVIDTTLPKLSRPEIEAGISPTALGHTDVFHPALEPRRPPRHLLDIMMRVQEFVLQRRIRVSEFLRDYDPLNCGRISPQQFLRAMDAMGLQAVLSEREARCVCAHYTSPNDPVAVCWRTFEDDCDQVFTIKELEKHPEVVVGGAAAEVSELPALGSADDRGADRPGGGVGERELDAAQAALLRVRAACQERSIDLRPLFGDHDEHNNGHVSRSQVRRVLARAGVLPAAAQLRALETRYLDDCGFKYVALLDELEEKPVESATISRPVAAGHKAKTSVVDPRETDIVQILAKIKGKACPPPPEPECHKLDQWQLRRLCCLLATIAQRDLPLRPYFQDYELVAKNDGRITFAHFARILDYIGVIVSPEDFNLLVRRFIKDSYTLDYVEFLKAVEAVKKEGIQGLGPAYNNPKAVIDTTLPKLSRPEIEAGISPTALGHTDVFHPALEPRRPPRHLLDIMMRVQEFVLQRRIRVSEFLRDYDPLNCGRISPQQFLRAMDAMGLQAVLSEREARCVCAHYTSPNDPVAVCWRTFEDDCDQVFTIKELEKHPEVVVGGAAAEVSELPALGSADDRGADRPGGGVGERELDAAQAALLRMVREGVRPREFVSQFDPRHERVVPRADFYRGLAAAGLALTPIEMDTLMEVFSAPGRRRYVEYERFCETVGESLVQGGLERAPLLAPLQHVPARDTPLNYLNYEERALVAAALDKLSHFPDQLSNIMEVFKDADKERCGTIPRVSVERALCQRGLLARLSARERDLLYKCFGYRRGCGDEVDYRALCKALDVLHATSSAQPC-