Monarch geneset OGS2.0

DPOGS209613
TranscriptDPOGS209613-TA3366 bp
ProteinDPOGS209613-PA1121 aa
Genomic positionDPSCF300015 + 303832-312522
RNAseq coverage151x (Rank: top 53%)
Annotation
HeliconiusHMEL0164660.093.00% 
BombyxBGIBMGA006678-TA0.088.81% 
Drosophilatio-PA9e-17843.01% 
EBI UniRef50UniRef50_E0V8X90.052.02%Tiptop, putative n=5 Tax=Neoptera RepID=E0V8X9_PEDHC
NCBI RefSeqXP_001809602.10.055.29%PREDICTED: similar to tiptop [Tribolium castaneum]
NCBI nr blastpgi|2700142840.056.02%teashirt-like protein [Tribolium castaneum]
NCBI nr blastxgi|2700142840.057.45%teashirt-like protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15545 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209613-TA
ATGATTGCGTATGGCCGCACTCCCAGACCACGGGCAATAATTCAAAAGCGTCACGATAACGCCACGCGTCATTTAAATTTGACTTTGGAGAGCGCTCAGAGGCAGGCACTAGCGGATGCACAGGCTTCCTTTGATCGGAATCGAGCGTGTTTATTCGATTGTGAGGAATCAACAAGTCCAGAAAGCGGAGTGAAGGAATTAGGAGGACGCGAACGGGAGGCGCGGGGAGAGGCAGGGGAGTCGCGCTCTCCATCGCCAGCATCCCGTGCCTCCCCCACACCCGAAGATCGGGATATAGAGCACAGCATACCAGCTACCCTCATACAGGATCCCAATGCTGAAAGGGAGAGTCCAAGATGTTTATCGCGGGAGTCGTCCGGCGCGCCGCGATGTCCCTCTAACGACTCGGTGTATTCGGGTCGGAGCGCGCCCAGCCTGCCCTTGCCAGCCGCCCTATCAGCAGCGTTACCGGCAGCCCTGCCCGCAGCGCTGATGCCACCCCACTCCGCTGCCGTCGCAGCCTATCTCGGAGCAGCAGCTGCGGCAGCCCAGCAACGATTACTCATGTCCTACCAGGAAGACATTACGGACGCTGAAAGAGCGGATGCCGTATTAGACTTCAGCACTAAACGAAGTGAATCCCCGGTCGACGATGAGGAGGATGACGCCGTTAATCTCACAAAGAATGAAAATGGTCCATTAGACTTATCTGTAGGTACTAGAAAAAGGGGGCCAGAGGATTCTCCATCTCCCGTCCCTAGTAGAAAAAGTTCTCGTACTTCCGACTTCAAAGCTTTATCGACACCTTGGTCTACACCGGTCGCGCCACATCTTCCTTATTTTGCTGCCGCCGTTGCTGCTGCAAGCTTATCACCAAAAGGTGGAGTTCCAGCTGATTGGAATGGTAAACTTAAACATGGAGCGCCTACACCAAGCGATGCTACTAAAGCACTGGAAAAAATGAGCGAATTGAGTAGATTAGGTGGAGAAGAACTTTTTAGATCTGTTCAAAGTGCAGCTTTGGGTGCAGGTCTTACACCAAATGCAGCTGCACGACATTCAGCTTGGCAATCTCATTGGCTGAATAAAGGAGCAGACCAGACAAAAGATGTCCTAAAATGTGTATGGTGCAAAAAGAGCTTCAATTCACTTGCTGATCTAACTGTTCACATGAAGGAAGCTAAGCATTGTGGAGTTAACGTTCCTGTACCCCCTTCAACTGGAGCTCCGATTCCGCCTTCACTACAACCACCATCAAGTTCGCCTTCCACGCCATCCCATAATTCGTCGTCCTCGAGTGGGTCGTCAAAACCAAATCATAATGATTTAAATATGCTTATAAAAGAAAACATGCCGATTCCTAGAAAATTAGTACGAGGTCAAGATGTTTGGCTAGGAAAGGGTGCAGAGCAAACTAGGCAAATTCTAAAATGCATGTGGTGTGCAGAAAGCTTTCGTTCCTTAGCTGAAATGACGAGTCATATGCAACGCACTCAGCATTATACTAATATTATATCACAGGAACAAATAATTTCCTGGAAATCCTCAGATGAAGCTAAGGGATCTAACTCTAGCACCCCGGGTACAAATAACGCTGTTCCTCCAACAACAGGAACAAGTAGCCATGTTAGCGCGGTATTAACTTGTAAGGTTTGCGACCAAGCGTTTAGTTCCTTAAAAGAGTTAAGCAATCATATGGTAAAGAATTCTCATTATAAAGAACATATTATGCGATCTATTACGGAGAGTGGTGGTAGAAGACGCCAGACACGCGAAAAACGAAAGAAATCGTTACCAGTAAGAAAATTACTTGAACTTGAACGAGCCCAACATGAGTTCAAAAATGGCGAAGGTAACGGTGTTCCCATGGGAAAACCGATCAGGGATTTCGGTGCTGGTAGCCGTATTACTTGCGAAAAATGTGGAGACAAAATAGAGACTGCTGTATTTGTAGAGCATATTCGTCAATGCATTGGATCACCAATGTCAAACACCCAAAGGAATTTTCTAAAAAGTGCTCTTCTTTCTAATAATATTATTCCACCTGATGTACCTGGCCATATCACCCCCACTAGTCGCGATGGTCGAAAAAGCATTAACGAGGAAATTCCATCTCCTGGTTCAGCTCATCACCGTTCCCCTTCTTCGGTTAATGATTCTTCTCCCAGTTCCAAAGATCATAATGCCAGCAACGACAAAAGTTCATCTCCATCGGTGCTTAATGCTATAGAACAATTAATAGAAAAAAGCTTTGATACACGCTCCCGACATTCAGTACCAGGTATACCAGGTGGAGCTTCACATGCTCCAATCGGGTCAAGTATCCTAAAAAGGTTAGGAATAGATGAAAGCGTAGATTATACCAAACCGTTAGTAGATCCTCAGACGATGAATATGCTTAGAAGTTACCACCATCAACAGGGATACGGTCGCCGTGAACGCAGCGGTAGTGAGTCTAGTTCTATGTCAGAAAGGGGTGGTAGTAGGGTTGAATCTCTAACCCCAGACAGGAAGCTGGATTCCTACCACATGACGCCTCGTACTACTCCTGATACTCGTGGCTCTCAAACTCCGGCATCTGAGGAACGGCTCACTGAGGTTAGGATAAAAAAAGAAGTCACAGATGAAGAAGAACGCGAAAACGGTGTAGACTTGAGTAGCCAACCAGTTAGAGTAAAAACTGAAGTTGAGGATGAGGAAGAGCAACAGAGACCAAGCAGTGCAGTTGACGAGGACGTAAAGCCAACTGTTCCAAAACGTGAAAGTGAGGGCCCAAGTCCAGCTGCTAGTCCTCGCAGTCCGGCCAGTGACCGATCAGCGCCAACGCCCGGTACTGACAGGAAACCGGCTTCCAGCCTAGGAGCTCTCTCTTCTATGTTTGATAATCTAACCGGCGGAGGTTCCTCAAACGAGCCAAGTTCTTCTCGTCGCGGAGGCAGTCACCCTTTAGCAGCTTTACAAAAACTTTGCGATAAAACGGAAACGAATTCATCTCGTGCTCCTGCCCCAGCCCCATCTCCCGCTGGTCCACCTAGCATCCTTACTTTTAGCTGGGCCTGCAACGATGCAGTAGTGACTGACTCTATAATGAAATGCGCCTTATGTGATACACCGTTTATATCAAAGGGCGCTTATCGGCATCATTTATCGAAGATGCATTTCGTTAAAGACGGCGCCCTGCCGGAGCCTGTGCCAGTGAAGGCTCCACCGGCGGCACCATCCCCAGGACCTCACAAGAGCAGCGGATCAAACGCGGCCTCACCTCAAGATCCGAGAAGTCCGTCTCAATCTTTCGATGAGAGTCCTCACTCTAAATTCCTCAAGTATACGGAACTGGCTAAACAATTATCCAGCAAGTACGTCTAA

Protein sequence:

>DPOGS209613-PA
MIAYGRTPRPRAIIQKRHDNATRHLNLTLESAQRQALADAQASFDRNRACLFDCEESTSPESGVKELGGREREARGEAGESRSPSPASRASPTPEDRDIEHSIPATLIQDPNAERESPRCLSRESSGAPRCPSNDSVYSGRSAPSLPLPAALSAALPAALPAALMPPHSAAVAAYLGAAAAAAQQRLLMSYQEDITDAERADAVLDFSTKRSESPVDDEEDDAVNLTKNENGPLDLSVGTRKRGPEDSPSPVPSRKSSRTSDFKALSTPWSTPVAPHLPYFAAAVAAASLSPKGGVPADWNGKLKHGAPTPSDATKALEKMSELSRLGGEELFRSVQSAALGAGLTPNAAARHSAWQSHWLNKGADQTKDVLKCVWCKKSFNSLADLTVHMKEAKHCGVNVPVPPSTGAPIPPSLQPPSSSPSTPSHNSSSSSGSSKPNHNDLNMLIKENMPIPRKLVRGQDVWLGKGAEQTRQILKCMWCAESFRSLAEMTSHMQRTQHYTNIISQEQIISWKSSDEAKGSNSSTPGTNNAVPPTTGTSSHVSAVLTCKVCDQAFSSLKELSNHMVKNSHYKEHIMRSITESGGRRRQTREKRKKSLPVRKLLELERAQHEFKNGEGNGVPMGKPIRDFGAGSRITCEKCGDKIETAVFVEHIRQCIGSPMSNTQRNFLKSALLSNNIIPPDVPGHITPTSRDGRKSINEEIPSPGSAHHRSPSSVNDSSPSSKDHNASNDKSSSPSVLNAIEQLIEKSFDTRSRHSVPGIPGGASHAPIGSSILKRLGIDESVDYTKPLVDPQTMNMLRSYHHQQGYGRRERSGSESSSMSERGGSRVESLTPDRKLDSYHMTPRTTPDTRGSQTPASEERLTEVRIKKEVTDEEERENGVDLSSQPVRVKTEVEDEEEQQRPSSAVDEDVKPTVPKRESEGPSPAASPRSPASDRSAPTPGTDRKPASSLGALSSMFDNLTGGGSSNEPSSSRRGGSHPLAALQKLCDKTETNSSRAPAPAPSPAGPPSILTFSWACNDAVVTDSIMKCALCDTPFISKGAYRHHLSKMHFVKDGALPEPVPVKAPPAAPSPGPHKSSGSNAASPQDPRSPSQSFDESPHSKFLKYTELAKQLSSKYV-