Monarch geneset OGS2.0

DPOGS211235
TranscriptDPOGS211235-TA1941 bp
ProteinDPOGS211235-PA646 aa
Genomic positionDPSCF300385 - 18457-21260
RNAseq coverage58x (Rank: top 69%)
Annotation
HeliconiusHMEL0164845e-13269.92% 
BombyxBGIBMGA005171-TA2e-6858.57% 
DrosophilaCG1113-PA3e-0633.33% 
EBI UniRef50UniRef50_E3X1Y72e-4034.46%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3X1Y7_ANODA
NCBI RefSeqXP_001807138.11e-5838.65%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892391752e-5738.65%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892391757e-6833.72%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[265-581] IPR0195795.4e-34Uncharacterised protein family UPF0564
Orthology groupMCL21877 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211235-TA
ATGCCTAAAACGGCGTATGAACGAAGTGAAAAGGAAGGTTTAACTGAACAGTGCTCAATTGAATCATCATCTTGTTCAGTTTCGGCTGCTGATGTGGATGCCAACAAACTGAAAGACTTTTACCGCAGCATTCCTGATTATAGTGATATAAACCATCTATCAGAAAGTGAATTTTATTCAACCTTGAAAATCCTTAGAGAAAAGAAGAAATTAATGCTTGGTCTTGCCGTAGAACATATAGATAATTGCAAAGTGAATAGTGATAAACTATTTGAAGATATAGATAGAGAAATTAATATATCACTTTCTTGTACTACTAAAAATAATCCAATCCATCGAAGAAAAAACTCAGTTTCTCAAGTAAATAAAACATCGGATTCTAAAGAGGAGAGTGTGTTTAAATCTCAAAAAGATTCTTGTAAAAGACCAACGTTATGTAACAAAAATTCCAGTCTCGATGTCAGTGCTAAGGAAGTGGATAAAAATTTACAAATTGGTACAACAAAAAATAACTTATTAATGGTTAAAGGAGGCCCTAAGTTGGATAGACCAAAACGAAATCATTCTGCCTGCTCAATTTCTTGGCATGATGATAAGGTCGAGCCAAAAAGTGAGGTCGATGATAAGTTCAAAAAGTTTTTTCATGATAATGACCATAGTCCAACTGTTAATGATGATAGTATATATAAAACCCAAAGTATGCCTTCAAGTCCACTAAGAACAAGACAAACAGGATCATGTTTACGACCTACAAAAATTCTTACTGTACCCCAGCCGTTTAAAATGACGCAAAGAAAACGGTTTCGTGCGAGGCCAGTTCCAATTGAATCTCGTATTCCACTCTATGACAAGATCCTTCAGGATCAGGCAATGAGACGAGCAATTACAAAAATAAACAGTGAAGCAGAACTCCGGGCCCAAATGAAACCCTTCAGTTTCACTAAAAGGGAAGAGAATGGAATAAGTGGAACATGTGATCGTGCGATAAATATTTTACCTAAAAGCAAAAGGAAAAAACGATTTAAAGCTCGACCCGTTCCTAAGAATCTGTTTTCAAATTATTTTTATGATAAAGTAAAAGATGAAAACTTTTTTCGGTCATTAAATAAAAGAATTCGTGCAGAGGAAATGTTAAGACATTCCAGTTTCCCAGGTAGCATAGGCGTCAGAGAACGTAGCCGATTGTCTACTCCCGCTGTACATAGCGACTTACCTACAGATCCTTCTCCAGTAGTTCCATCTATATCTTCTTCTGAGCCAAATAGGTCGTCCAGTCGAACAAAAGAATTAAGAAGAGCAAGGCGTATTAAAGAAGACTTTGTTACAACTAGTCCTCGACCATTTCGTTTTAGCACTGCGGACAGAGCTTCTAAAAAGATTAATGATATTTCTAAAAAAATATGTCAAGATAGTAAAAGTTGCGACAGTATTGAAAGAAATGAAACACCGTTTGATAAAACTACAGCTTACACAGCGTTGGATTTAAAGGCCATTGTTGGAGGAAGATCTAATTTAGCTGCTTTGTTAAGAGCAGAAGCTGTGAGACAGAAATTTGAACTCGAATCTGCCCAACGATTAGCAGAACAACGAAGACGAGTGGAAAACAGACAAAGAGACAGATTACTTCGTTCCAAGCCAGCCTGGCACCTTGTTAAGAATAATTACGAAGAGGATATAGCTATGAGGCTACAAACTCGACGGGATGAAGAGAGATTAAGAAGGGAAGAATTTCTACATGAAATGGAACTAATGTACGGACGAGTTCAGCAACAACCAATGTTATTTGAAAGATACTATGCTCCTAGGTCTGGTGCGTCCACAATAGATTTTGTAAAATTGTCACCAAGAAAAACTATTAAGAAAAAACATGTTCCTAGAAAAAGTGGATCACAGTTATTGGCCAGTCCTTGTTTGTCGTTTGAAGACCAAGGGACTTTATAA

Protein sequence:

>DPOGS211235-PA
MPKTAYERSEKEGLTEQCSIESSSCSVSAADVDANKLKDFYRSIPDYSDINHLSESEFYSTLKILREKKKLMLGLAVEHIDNCKVNSDKLFEDIDREINISLSCTTKNNPIHRRKNSVSQVNKTSDSKEESVFKSQKDSCKRPTLCNKNSSLDVSAKEVDKNLQIGTTKNNLLMVKGGPKLDRPKRNHSACSISWHDDKVEPKSEVDDKFKKFFHDNDHSPTVNDDSIYKTQSMPSSPLRTRQTGSCLRPTKILTVPQPFKMTQRKRFRARPVPIESRIPLYDKILQDQAMRRAITKINSEAELRAQMKPFSFTKREENGISGTCDRAINILPKSKRKKRFKARPVPKNLFSNYFYDKVKDENFFRSLNKRIRAEEMLRHSSFPGSIGVRERSRLSTPAVHSDLPTDPSPVVPSISSSEPNRSSSRTKELRRARRIKEDFVTTSPRPFRFSTADRASKKINDISKKICQDSKSCDSIERNETPFDKTTAYTALDLKAIVGGRSNLAALLRAEAVRQKFELESAQRLAEQRRRVENRQRDRLLRSKPAWHLVKNNYEEDIAMRLQTRRDEERLRREEFLHEMELMYGRVQQQPMLFERYYAPRSGASTIDFVKLSPRKTIKKKHVPRKSGSQLLASPCLSFEDQGTL-