Monarch geneset OGS2.0

DPOGS207522
TranscriptDPOGS207522-TA2340 bp
ProteinDPOGS207522-PA779 aa
Genomic positionDPSCF300177 + 52647-58701
RNAseq coverage259x (Rank: top 41%)
Annotation
HeliconiusHMEL0055233e-13343.49% 
BombyxBGIBMGA001884-TA1e-8445.33% 
DrosophilaCG9305-PA3e-3834.78% 
EBI UniRef50UniRef50_D6WTH84e-4235.49%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WTH8_TRICA
NCBI RefSeqXP_974424.16e-4335.49%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910863551e-4135.49%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|3838529301e-4829.66%PREDICTED: uncharacterized protein LOC100874658 [Megachile rotundata]
Group
Gene OntologyGO:00055152.3e-07protein binding
KEGG pathway 
InterPro domain[458-525] IPR0090572.3e-07Homeodomain-like
Orthology groupMCL26664 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207522-TA
ATGTCTACACGGAGAGCAAGAATTAAAGCTGTTAATTTTTTGCCACTGCGACGAAAAAATACTGAAACATCTGAGAATAAAAATAAAGTCTCTGATTCAAAAGACAATGCTGAAAAAAATTCCAAAGATTCTCAAACTCCTCTGCCATCAGCCTCCGCAAATCAAGAAGAAAATACGAGTGTAATAAAAAATACACCGGAAAACCAAAACTCAAGTACGAACACGCCGTCAAAAGAACAGAGCTTCAATAAATCGCCGGCCACCAACAGTACCACACCCGGCGAAACTCCTCATAACGCAAACAATTCAACGTCATTCAAAGATAAAGCACAATCAATTAATTCAATAAAAGATACCAACATTTTTGTATCCCCTTTGAGACGAAATAGTCCAAAAAAGACATTCGCATCTCCAATAGTCCCTTCCCCTAAAGTCATAAGAAATGTGGAACAAAAATTAACCACAACACTCATTGGTAGGAAGACTCCTGTAGCACAAAAAATCAATGAAAGTAATGAACTCCAAATATCAGGAAGTGTAATTAATATCACTCACAATATTACCAATGACAGCCGAAGGAATGAACAAGCCGGACCTGGCACACCGACCACAGCAAATGATGAAGCGATGGATGGCATAATTCCCTTGCAATCAGCATCAACTGCACCTAAGCCAATAGAGTTGTTGAAGAGTGAAATTATATCGGAAAATGCTGAGGTTCTTTTTGATCCAATCGTCCCTCTCCCCTCACCGAGTAAGGTGAGACCTAAATTGCGTCCAGCTCCAAGATTGGGGCCTCATCGACGGAACAGTGTACAGGGTAGTGCAAGTGAATCCGAGGATGAAAGTCGAAGGAGTCTTCTGTCTGGAGGCAATACACCAGCTCATCAAAGACAAAGGCATGACTCTCAAATGTCTCATAACACACTTGTGTCCTTACCCAACAGGGATGTCAGTCGCGTGAGAAATGATTCTGTGTGCTCAGGTGTTAGTGTCATTACGAAACCACAGCCGACGTCTCCAGTTAAAGAAAAACATACAAAGAGATCCCGTAATGAACGTCGTTTGAACGCCATGAGACGTCGAGAAAACGTTAAACGTGACTCCCTGACGATGTACGACCTCATCTTCTACAATCCAACCTCGAACCCTATTGTTCCGGACGACGACGAAATAAATGCTAAGGAAGCGAACGAGAAGGAGATTCTGGAGAGTAACAAAAAGGAAGAGACCAAAACGGATAACCCGAAGGAGAACGCCGCGCCCGTGCCGCAGATAAAACTTGGACCTAACGGGACAATAATGATCGACGAGGAAAGCTTGGTCATAAAACAGACTGAGTCCGACCGTAAAGTGTCTTCAGTGGTGCACGAGGGGTCCTGGTCAAAGAGCAGCGGGTATAAGGCGAAACATCTCCGCTCTAGAGACTGGAGCTCCGCCGAGACTGTCAGGTTCTATAGAGCCTTGGCTGTTATTGGAACAGATTTCTCGTTGATGGCACCTCTATTCCCTGATAGGACGAGACGAGAACTTAAATTTAAGTTCAAGAAAGAAGAAAGGATGAACGGCGCCCAAGTTGATAAGGCGTTGCGTTCGACCATCGAATGGGATGTGCTTAGACTCAAAGAAGAGTTCAAAGAAGAAAGAGCCCTGGCCGCTAAACAAGCGGAGAGAGAAAGACAGTCACTGATAGAAGAGAAGAAGTTGGAGAGAGAAAGACTGAAGGCTGCCAGAGAAACACGTGTTAGATCTAGCCGAGGTTCAAAAGCTCTGTCATCAAACATGTTGCCGGGGCTGAATAAAGTTCACAACGAAGTGTTCACAGCAGATGGTATTATAGAAAGAGCTAATCGGAGACCACATAAAAACAAACATGGAACAATCCAGTCATCGGATAAAAATAAAGATAATCAATCAAATGTAAGCCAGAACACACAAAATCAAGACGTAGCTGTACTAACAAAATTACCACCAATGCAAACTAAAACGCCCGAAGCCGTCAATACTTTGCCACCTATTCCACCGAATATTGAAACCGGCTCTTTGGTAGTTTTAACGGTCGATGACCCCTCTTCACCCGCCAAAAAAATGCTACAGACATACATCGCTCGTGGCCCGGGTCAGTTGACGCCAGTCGCATTACCCACAACCTTCTTAAACTCCGTGGTAGGGTATATGAAGAAAAACAAAGGTCAAGGGTCACCACAAATCATGTCGCCGGGCAGCGCTGCAAGCTACGACAGCAGATCGAGCGGAACTCCCGGAGTCCCAAACATTTCAGTGTTGCCAAGTCCAGCAAAAAGACAAAGACACAGCTCATTCACCATAACGCAGCTTTGA

Protein sequence:

>DPOGS207522-PA
MSTRRARIKAVNFLPLRRKNTETSENKNKVSDSKDNAEKNSKDSQTPLPSASANQEENTSVIKNTPENQNSSTNTPSKEQSFNKSPATNSTTPGETPHNANNSTSFKDKAQSINSIKDTNIFVSPLRRNSPKKTFASPIVPSPKVIRNVEQKLTTTLIGRKTPVAQKINESNELQISGSVINITHNITNDSRRNEQAGPGTPTTANDEAMDGIIPLQSASTAPKPIELLKSEIISENAEVLFDPIVPLPSPSKVRPKLRPAPRLGPHRRNSVQGSASESEDESRRSLLSGGNTPAHQRQRHDSQMSHNTLVSLPNRDVSRVRNDSVCSGVSVITKPQPTSPVKEKHTKRSRNERRLNAMRRRENVKRDSLTMYDLIFYNPTSNPIVPDDDEINAKEANEKEILESNKKEETKTDNPKENAAPVPQIKLGPNGTIMIDEESLVIKQTESDRKVSSVVHEGSWSKSSGYKAKHLRSRDWSSAETVRFYRALAVIGTDFSLMAPLFPDRTRRELKFKFKKEERMNGAQVDKALRSTIEWDVLRLKEEFKEERALAAKQAERERQSLIEEKKLERERLKAARETRVRSSRGSKALSSNMLPGLNKVHNEVFTADGIIERANRRPHKNKHGTIQSSDKNKDNQSNVSQNTQNQDVAVLTKLPPMQTKTPEAVNTLPPIPPNIETGSLVVLTVDDPSSPAKKMLQTYIARGPGQLTPVALPTTFLNSVVGYMKKNKGQGSPQIMSPGSAASYDSRSSGTPGVPNISVLPSPAKRQRHSSFTITQL-