Monarch geneset OGS2.0

DPOGS211079
TranscriptDPOGS211079-TA2703 bp
ProteinDPOGS211079-PA900 aa
Genomic positionDPSCF300007 - 1334946-1339172
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0022120.083.13% 
BombyxBGIBMGA002959-TA0.072.68% 
DrosophilaCG4751-PA2e-13152.64% 
EBI UniRef50UniRef50_Q170S01e-14256.01%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q170S0_AEDAE
NCBI RefSeqXP_001652940.12e-14356.01%hypothetical protein AaeL_AAEL007827 [Aedes aegypti]
NCBI nr blastpgi|1571170274e-14256.01%hypothetical protein AaeL_AAEL007827 [Aedes aegypti]
NCBI nr blastxgi|1571170271e-14043.56%hypothetical protein AaeL_AAEL007827 [Aedes aegypti]
Group
Gene OntologyGO:00055154.8e-09protein binding
KEGG pathway 
InterPro domain[247-340] IPR0005554.8e-09Mov34/MPN/PAD-1
Orthology groupMCL16763 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211079-TA
ATGGATTTACAAACATCGCATTTGGCAGCTGCACTGATGTCTAATAACAATTCATTTAACAATGAACAACCTGGGGCTGTCATAGCTCCTACCATCAACGATAAAGATGAACCTATGCAAAATGATCCGGTACCAGCGGTTGGTGAACCAGCAAAACCAACCACATATGAAGAAAAAGAGGAAATAGGCAGTGGTGAAGAGTTCTCCGATGAGGAGTCAGAAGCACAACCTGAAAACAAATGTATGCCAGGAAGAGGTGTTACTTTGCAAATGCTTTTGGAAGAGAAAATGTTAGAACCCGGACATGCTGCTATGACAATAGAATATTTGGGTCAAAAGTTTGTTGGTGATTTACAAGCCGATGGAAAAATAAAATCACATGAAACAGAAACAATATTCTGTTCACCGTCTGCTTGGGCCATTCACTGCAAGAGAATTATAAACCCCGACAAAAGATCGGGATGCGGGTGGGCATCAGTGAAATACAGAGGCAAGAAATTGGATACCATTAAAGCTACTTATTTAAGAAAGAAACAACTACAGAGGGAGAACATGCATAGTGATGAAGAAACAGAAATGGAGGTGGAGAGTCCTCCTGAGCCTCCCCCGCAGAGGATTGTAATGAAACATAATACTGTACCCAATAGAATGATGCAACATGATGCGAACATGCTGATTGAAGCTGTGTCATTCTCGACAGCGGGAAAGATTCAGCCGTTTTTAGTATCAGTTAACTCCAATGCCTTACTGATACTAGATATACATTGTCATTTGAAAAAGGAAGAAGTTTATGGCTATTTGGCTGGTACATGGGATCTGAATAATCATAATGTCTCAATCACTCATACATTCCCATGCTTGATAAGCAAGAATGACTCGAGACCAAGGGTTTTAGTTGAGTTGGAAATACAAATGGAAATTGAAAAGTTAGGGCTGTCTTTATTAGGCTGGTACCACTCTCACCCAACCAACCCGGCCATGCCCAGTCTTAGGGACTGTGATAATCAACTTGAATACCAGATAAAAATGAGGGGCCCTACAGAAATATCGTACATTCCTTGTATTGGAGTTATTTGTTCACCCTACAATCCGGAAAGTCCTGTAATGGAATCATCATTAACATTCTTCTGGGTTATGCCTCCTCCAGAACAGAGACCCACAGAATACCCGAAACCTCTTCTTTTGCAATACAATATGATTCATGATACACATTTATCAACTCACGCTATGGAACAGATAAAGAAAAGCATCAAATACTATGGCACATTCGCTGACGACTCGCTAGTCAGTTTTAAGGATAACTTTAAGCCTGATATCACTTATTTGGATAAATTAAAATGTACCCTCACTCCGAAATTCCCGCGAGAGCAAAGTGATGGGTTGTTATGGCATTTTATAAGGGATGAGTTAGGTTGTTCGTCAGAAAATGATGATAAGATGGATTTAGATGCGTTATTGGCTGTTCCACAACCGATTCCCGCATCTAAACCGCAATCCACTTCCATACCTAACTTCCCATCAGTTAGCACACTGCAACAGATGGTGAGTAGACCGGCCGGAGTGCCACCAATCAATGTTTCTTCGGCTATAGGTTCAGTTTCCCCTCATAAATTTGAGACGCCACCGCTTAATATACCCGTTTTACCGACATCCTCTAAACTTTCTAAATCGACAACTCCTTCCCTTCCCCCAACATATCCAACCGGTCTGGATATGTTAACTAGTATGGCTTTAGGACTTGGATCTACGAATATGCCATTACCTCTAGGTACATCAGGTCTAGAGAGTCTAGCGGCTGCTAACAGTATGCTAACAGGCTTTAATCCAGCATTATCTTCGAACTTAGCATCCACATTATCTTCAAGCAAACTCCCGGATTTGCCTTCATACGCTGCCTCTTTGCAAAATCTATCGAATAGTATGGTTAATTATGAGAAAACTTCTACTACGACTTCAACAAGTTGTACAACTTCCGTCGCCCCTATACCAGCCAGCATCGCCTCTAACCTTATGATGAGTTCGGCTGATATAGCCAATGCATTATTTTCAGCTAGTAAATATTCTAGTGCTGGTATATTAGGAATACCAGATCCAATGTCGAAATCCACTCTGGCTGCCAATAACATGTTTTTGTCGCCTTCTTTGCTTAAAATGCAAGAGTCATTGATGAAGCCTTTGTCAAGTAGTAGTCCGATCCCGTCTAAAGTTGGTCTCGATCAGAACATGTTAATGAAAAGTCCCCATGACCTCATTAAACCATCTAAGGACTATCTGCCCCCTGATTTTGGAAGTATTAATAAAACAAAAAGTAGTTCCCACGATCCCATAAAACACCCGGACGTGAGCTCGTCTAAATCAGAGACTTCTAAGTCGGTTTTATCTGAATCACAACTACCAGAATACCCTCAAATTTGCTCCTCGCCAAGGGTTGGGGGTGATCCATTTTTGAATCAAATGTTGGAATTGACTAAGAAGACGACGATTCCTGACTATCCGGCCGACTACAGCCAGCCGCGGAAGATGGATGAAGATGTCCAGAAGCTAACTCCGACAACCCTTTCATACTCAAGCGCCGCTAGCATAGCTGACACTATAGCCCAGGTTGCGATGGGAAATTTTAATAAAATGGAAGATGCCATGGACTATTCTACTGGTCAAGATTATTCAACTACGAAAAATACAAGCAGTGAAACAGAAAATTAA

Protein sequence:

>DPOGS211079-PA
MDLQTSHLAAALMSNNNSFNNEQPGAVIAPTINDKDEPMQNDPVPAVGEPAKPTTYEEKEEIGSGEEFSDEESEAQPENKCMPGRGVTLQMLLEEKMLEPGHAAMTIEYLGQKFVGDLQADGKIKSHETETIFCSPSAWAIHCKRIINPDKRSGCGWASVKYRGKKLDTIKATYLRKKQLQRENMHSDEETEMEVESPPEPPPQRIVMKHNTVPNRMMQHDANMLIEAVSFSTAGKIQPFLVSVNSNALLILDIHCHLKKEEVYGYLAGTWDLNNHNVSITHTFPCLISKNDSRPRVLVELEIQMEIEKLGLSLLGWYHSHPTNPAMPSLRDCDNQLEYQIKMRGPTEISYIPCIGVICSPYNPESPVMESSLTFFWVMPPPEQRPTEYPKPLLLQYNMIHDTHLSTHAMEQIKKSIKYYGTFADDSLVSFKDNFKPDITYLDKLKCTLTPKFPREQSDGLLWHFIRDELGCSSENDDKMDLDALLAVPQPIPASKPQSTSIPNFPSVSTLQQMVSRPAGVPPINVSSAIGSVSPHKFETPPLNIPVLPTSSKLSKSTTPSLPPTYPTGLDMLTSMALGLGSTNMPLPLGTSGLESLAAANSMLTGFNPALSSNLASTLSSSKLPDLPSYAASLQNLSNSMVNYEKTSTTTSTSCTTSVAPIPASIASNLMMSSADIANALFSASKYSSAGILGIPDPMSKSTLAANNMFLSPSLLKMQESLMKPLSSSSPIPSKVGLDQNMLMKSPHDLIKPSKDYLPPDFGSINKTKSSSHDPIKHPDVSSSKSETSKSVLSESQLPEYPQICSSPRVGGDPFLNQMLELTKKTTIPDYPADYSQPRKMDEDVQKLTPTTLSYSSAASIADTIAQVAMGNFNKMEDAMDYSTGQDYSTTKNTSSETEN-