Monarch geneset OGS2.0

DPOGS211550
TranscriptDPOGS211550-TA2748 bp
ProteinDPOGS211550-PA915 aa
Genomic positionDPSCF300159 + 156413-165127
RNAseq coverage420x (Rank: top 29%)
Annotation
HeliconiusHMEL0113300.068.38% 
BombyxBGIBMGA014363-TA1e-15853.34% 
Drosophilacrm-PB8e-5435.90% 
EBI UniRef50UniRef50_D1ZZL34e-5639.14%Putative uncharacterized protein GLEAN_07439 n=2 Tax=Tribolium castaneum RepID=D1ZZL3_TRICA
NCBI RefSeqXP_973783.27e-5739.14%PREDICTED: similar to cramped [Tribolium castaneum]
NCBI nr blastpgi|2700053891e-5539.14%hypothetical protein TcasGA2_TC007439 [Tribolium castaneum]
NCBI nr blastxgi|1571064234e-6529.50%cramped protein [Aedes aegypti]
Group
Gene OntologyGO:00055156.9e-06protein binding
GO:00036773.6e-05DNA binding
KEGG pathway 
InterPro domain[71-138] IPR0090576.9e-06Homeodomain-like
Orthology groupMCL26191 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211550-TA
ATGTCTTCGGGGGTTCAACAGGATGAAGAGCGGACCGAGCTGTTGGGATCGTTAACAACACAACAACAACAGAGGACCAGCGCCAGGGTCATTAAGAAACTGAGACTGGAACCGCAAATTGACAAACGAGACGTCATAGAATGCGAGACTCCAAACAAAATAGACGATAAAGACCCTTTAAAGTTCCCAACGGTCAAACAACGTATGCCCAAGGCGTTGTGGTCGGCAGATGAGAAAAGCCTTTTCTTCGAAGCCTTGAACGAATACGGCAAGGACTTCGACTCAATCACCTCGTACATTTGCGCTAAAATGAAGAAGAAGGGCATGCTAGATGGAAATCTGAAGACTAAGACCCAAGTCAGCCATTTTTATTACAGGACGTGGCATAAGCTGTCCAAACACGTGCGTTTTGATGAAAATGTCAAAAAAGTAGCCCAGGAGCTTTACGCTCTGATAAATTATGGCGAGTTGAGAAGGAAGCTGGTGTCTGTTAATGAGAAGATCTGCGCTCGTTTGGGAGAAATGGTCCGTGGAGGATCCATAGCTGTGAGGACCAAGGGAAAGACGATCAGAGTCAGAACACCAATGTGCAGAGCTCTCAGACGACTCAATCAAATCGCGGAGCGCGCGTACGGCGCTCGTGTGTGCACCCGGGCTCAAGTGATATTGCGCGCTCGGGACGCGGCGTCTTGGACGCGCGTGCAGGCCGCCGCACACAACCCGCGGGCGGTCGTCGCGCTCACTCTCAGGACGAGGCTCGTGGCGCTGCTGTGGGCTCTCGAGAGACGCGTTCAGATAAGTCACGTCATGTTGATGGAAAATGGAAGTGATTTAAAAACCGAGGAATATTCTTCGCCATTTTGTGAGAACGGAGACAAGCTGGACGAACACCTCAACTTGGAAACAGACAGGACCATAATCCAAGAGCGTGCGCCCCCCGGTGTGTCGCTCCACGTCGGTCCTCGGCCTGAGGCCGACGTCCGTCTGCCGGAGCTTCGCCCTCGCGAGCCTCTGTCGAGTCAGAAGATCTGCTTTGCTTCATATCTGGAAAGAATGGGCGCTTTGAGAAGACAGGACGGAGATGTCAAAATTCGTACGCCGAAACGGCACAGAAAAGACAGCGTGTCCGATAAAGATAAGGAAAATGACAAGAAAATTAAAATAGAGGACATAGAAACAAACAAACTGATTAACATAGAAGAAACAGCCATAGACGGCATAGAGCTGATGGCGCATTACAAGAACAACCAGGAAGATGAGGAGAAACCCAGCACTGAGGAAGAGAAGGAGATGAGGTCGGAGGAGATCGTGGAGGAAGATTCAGAGAGGGACAAAGACGTGATAGAGAAAGACAATGACGTGTTGGACAGAGACAGAGACATGCGAGAAAGAGACAAAGATATGTTGGACAGAGAGAAAGAGATGCCAGAGAGAGACAAAGATATGTTAGAGAGAGAGAAAGACAGCTTCTCAGAAATGGAAGACGATGAGAAATATAACAAGAGTGACACAGACAATGAGAGCGACGGGCGAGAGAAGAAACAGATGAAATATAAGAATTTGAAGGTGAAGTTCCGTATACGTCCCAAGAAGAGAGGTGGCTCCGTCTACACCCTGGTGTTGGATCATGACAAGAATAAGGAGGAAGATGCCAAGATAGATGAAAAGGAGGAGAAGGAAGAAGAACAAAAGCCGGATATAGATGTAGACTTCGCGATGAGGCAGGTCAGGAAAGGATGGAGCGTTTGGGACGCGGGGGATCTTACCATAGGAGATTTGTATTTGATGTTCGGTTCCCGCTCGAGATTAGAATTGGACTACTGGTGGGCGGAGCCTACACCCCCACTACCTAAGACGAGGACGGAAGATAGGCTCGATAGGAGAGAGAAAGGGAACAAAACGCCTGACAAGGGAGACAGCTCAGTAGAAGACGAGAGGGAGAGGTGTGACAACGACATACTCTCACCCAAGAACACGTTCAGCCAAGACAGTAACGACGGCCTGTCCGGCGACGAGAGGAAGAGTGAGCAGCTGTCGTCGCCGGACCACAAGTCCGGGACCTTGAAGCTGGTCAGTAAACTGATCAACCGACCCACGCACGTCTCCACCAACAACGGCTACTCGCTGGTTAGCGACAGACTGCGGCGACTACTGGCGCTGGCGGGGAACAGTCACTTTGGTGGAGGGGCGAGAGGGGTGCACGCGCGGAAACACGGAGCGACACATGTGCAGAAGCCCCCCGCGTGTAGATTGAGTCCGACGAGGTCTCCACCCGCCACGGGCCCCGCGCTCTTCAGACACCCCGCACCCATCGCTCCCAAGCCAGAGCCGGAGTCGTGTCAGTCGCCTATTAGTCTCAATGGTCTGCCTAAGTGGCGCCGCGGGCGACCTTCCACCGACAGACGGGTCGTGGTCCAACGTCTCCTGCCCCTCATGCCCAAACTACCACCACCAAACAATCTGATTCCCGTGAAGATGGTGTCCAACTCCCAGCCGGTCCAACCGAGGCTGGTCCCCAAGCCGCCGCCGTCAAGCTCCTCCGACCTGTCGATGTACTACGTGCTGAGTCAGTCCAACGGACAGTTCTTCTTCCACGACGGCGACAGACGGATCCCCATCCTCACCGACTCCGGGAACACCTCCCAGAACGCCGGTGACTATCACTCATACAGAGCATGTTCATATTTTGATACGACGAAAGCGTTTTTGGGCATTGAACCTGTTTATTTTGTCTTGTGCACATTTTAA

Protein sequence:

>DPOGS211550-PA
MSSGVQQDEERTELLGSLTTQQQQRTSARVIKKLRLEPQIDKRDVIECETPNKIDDKDPLKFPTVKQRMPKALWSADEKSLFFEALNEYGKDFDSITSYICAKMKKKGMLDGNLKTKTQVSHFYYRTWHKLSKHVRFDENVKKVAQELYALINYGELRRKLVSVNEKICARLGEMVRGGSIAVRTKGKTIRVRTPMCRALRRLNQIAERAYGARVCTRAQVILRARDAASWTRVQAAAHNPRAVVALTLRTRLVALLWALERRVQISHVMLMENGSDLKTEEYSSPFCENGDKLDEHLNLETDRTIIQERAPPGVSLHVGPRPEADVRLPELRPREPLSSQKICFASYLERMGALRRQDGDVKIRTPKRHRKDSVSDKDKENDKKIKIEDIETNKLINIEETAIDGIELMAHYKNNQEDEEKPSTEEEKEMRSEEIVEEDSERDKDVIEKDNDVLDRDRDMRERDKDMLDREKEMPERDKDMLEREKDSFSEMEDDEKYNKSDTDNESDGREKKQMKYKNLKVKFRIRPKKRGGSVYTLVLDHDKNKEEDAKIDEKEEKEEEQKPDIDVDFAMRQVRKGWSVWDAGDLTIGDLYLMFGSRSRLELDYWWAEPTPPLPKTRTEDRLDRREKGNKTPDKGDSSVEDERERCDNDILSPKNTFSQDSNDGLSGDERKSEQLSSPDHKSGTLKLVSKLINRPTHVSTNNGYSLVSDRLRRLLALAGNSHFGGGARGVHARKHGATHVQKPPACRLSPTRSPPATGPALFRHPAPIAPKPEPESCQSPISLNGLPKWRRGRPSTDRRVVVQRLLPLMPKLPPPNNLIPVKMVSNSQPVQPRLVPKPPPSSSSDLSMYYVLSQSNGQFFFHDGDRRIPILTDSGNTSQNAGDYHSYRACSYFDTTKAFLGIEPVYFVLCTF-