Monarch geneset OGS2.0

DPOGS206173
TranscriptDPOGS206173-TA3768 bp
ProteinDPOGS206173-PA1255 aa
Genomic positionDPSCF300396 + 127045-138134
RNAseq coverage99x (Rank: top 61%)
Annotation
HeliconiusHMEL0049700.071.51% 
BombyxBGIBMGA012100-TA0.064.20% 
DrosophilaCG14215-PA3e-5324.03% 
EBI UniRef50UniRef50_UPI0000D5634E1e-14732.41%UPI0000D5634E related cluster n=1 Tax=unknown RepID=UPI0000D5634E
NCBI RefSeqXP_972713.12e-14832.41%PREDICTED: similar to suppressor of yeast mitotic catastrophe [Tribolium castaneum]
NCBI nr blastpgi|910837895e-14732.41%PREDICTED: similar to suppressor of yeast mitotic catastrophe [Tribolium castaneum]
NCBI nr blastxgi|3071770415e-9228.10%AT-hook-containing transcription factor 1 [Camponotus floridanus]
Group
KEGG pathway 
Orthology groupMCL17869 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206173-TA
ATGCAAAAGCTACAAGAAAGTGTGTTTTCTATTATAAAAACTACAAATCTATCATCTGCTGTATTTAGTTTTCTACAACCAACTGAAGAATATTCAGACAAAACACCGCTGGGCGGAATTTTGCGGGATACTAAATATGGATGGCTTGCCCTTGGTCCTAAATTTTGTGTTGTCGATTTACGGAGTGGATTGAAAATTGCTGCTCGTACTTTTGGAAATGGTGTGAGCAACAAGATTACTGTCACCAGTGTTGTGGAATTACCCACACCTCTCACTGATAACTCACAACAGCTCATTATAAGCCTTGAGTGTGATGATGCCACTGGATTGATTTGCATATTCCACATTAATGGATCTCAATTGCTGCGTTGCATTCAGACGGATGGTGTCATAACAGAACTGACAGTTTGTGATGCAATACCAAATGGACCACTTATGTGCTTTGATGGCCTCATCATGGCTGGCACTCATAGAGGAGAGATATTTATATTTGATCTCAATAAAGCCTCTTTGATCCAGGCACTCAAAGACTTATCTCAAGGCTATGAAGAGCAAGTTAGAGGTGAACAGAATGTGGCAAATTTGATATTCCTGCCTCTGAAAGCTGTACATAAAATTGAAAGCCAAAGAGATTTGGCCGTTGAGAACGATGATCATTTGGCTTTGCTACTGAATGAGAATTCATTCCATGAAAATCAGTATGTATTTTGTAACCCAGATGGCACTGTTAGAATGAAGGCGAAACAGAGTCACATAAGGACAACCGTTGTTCAATACATTCCGCAATTAGGAAGCTTGGCCGTGGGTTTTAATTTTGGTGCATTCCAATTATGGAACCTTCTGAATTTGGAATTGGAATTCACATCTCAGGTCAATGTTGAATGTCTGCCAGTCACTCATTTTGGTTTCCAGGAACCCTGTGATGATCCGAGAGCATTTTGTTATTTGTGGGTAGTGTTTTCAGTGATAGATCGATTTGAGGAGGAGGAATTTCCTTTGGCTGTGATGTATTCTCTGACTTATCAGGGAAAAAGAATGCTGGCTGAGAACAAATATTTATATCAGGAATTTTCTATGGCGACAATTCGCTTCCAAATAGAGTTGGGTGCCATTGAAGATGTGTCAGTGTATATCGGTGGCAGATGCATCTCCTGTCACACTTATTCCATCAGTTCTGTTTTAGGAGAAGAAGGGGAGGACACGATGCTCAACTTATGCCAGCTAGTGTGGGAATGTTGGGGCGAGAACAACACAGTGACTCAACACTGTATGTTGCTGTTTGATCTTGATCAGTGGTACAAAGATCAAATGCCCGCCACATATCGATTAGAAAACAATGCGTTTATGAGCACCGCGCGGTGTCCTGAGATCAGCCGCGGGTCATGCGTCACGTTAGACTTGAGGTTGGATTCAGAATCGGTCACGCCATACTCACACGCCACGCGACTCGAAGAACATTTCTTTCCTAATTCCTTGCAATACAACTGCGTGGCTCTCAGCACGTGGTCGACGGCGGTGGTTCAGACGGCGGGCGTCCAGCGTCAGCTGGTCCGCGGGATGAGCGCGGCGGGCCCGGCGTGTCTGCTCGCCCCGGCCAGATTGTACGCCGCGTGTAAAACCGCCGGTCTGACACCGCTCTACGCTCCCTCTGGGGACACACCGGAGGATCAACGTCGTTTTCTTCTGTCTGTGGCGCTAGAGGCGAGGCTGTCGTCGTTCCTGAAACGATGTGCTCACGACTGGGCTACGGGGACCCACAGTGGTGTCGGATGTACATTGCCTTTCTTGGTGGATTGGTCGTGGAGTAGAGCTATTGAATTGAAAGAAAACGCTAAAGAACTGACCGCGCCGCTGTTTACTTCCTCTATGATGCCAGACAGGAATGTTATAAGATGTCTGGAGCATTGTGTGCAACAACTGAGTCAACTGACTGGTCTGTTAGACGCCATCCTCACTAAATGTTGTAATCTAGTTGTACCAGACGCGTTAAGCGAAATGGAAGAAAAATATAAAGGCATAGGCACAGTATCCTTATACTTCCAAGTTGTACAATGGTTTGTTAGGGTCGGACTGTTGCCTGAGAAGAGTTCGGACAGACATTCGCACACGTTACCCTACCCGGTGCATCAGCTATGTGGGATATATAACAAGCGTCGCATCAAGCTCAACCGTCTTCAAGACAAGTCGGACGACGAGTCGAGCAACGAGTCGTGTTCGCTTCTGTACATAGATCAACTCATTGAACACGAGTTCGGGGGTGATAGAGTCCATCAACTATGGATGGTATGTGGATCGAGCGGCGGTCTCTATCCCCCCCCTTCTCTGTTTTCCCTCTTGAGACTATACTTGCTGCCGGATGTTCCCGAGGAACACAAGCACTCTCTCCTACTCTACTTACTCCTGGATTATTCTATGATCTATGATGATATGCGCCACGAGTCCGTGATACGTCGGTTGATGCAGTTCCCAACTATGTTCGGTCTCAGTAACACGGCGATTAAAGCCACCCAAGCCTTCTGGCATCTCGACCACAGAGACTTTGATTTCGCTCTCGACCAACTTCAATGTCTAACTGGCAACACTCTCTCCGACTGGCAACATCACGTAGTCCTGTCTTCGTTACTGGCGCAGAAGAAAACTCAATCAGCGTTACAATACTTACACGTGAGAAAACCGGCTCCGATACACGTTAGTGACAACAATGATTATGACAAACTAGACGATTGGCAAACCTCCTGCAACCTGTACCTGGCCCGGGGCCTGGTGTTCGAAGCTTTGGATGTTATAAGGATGTGCGTAGAAAATGCCGGCTCCAGCGACGATAAGACGCAATTGTTGAACTATTTCTATAAAGGCTGTAGGAACAGCGGTCAACTGGCTAAGGTGCTGCAAGTAACGTTGTTGCCGTTTGAGGAGGAAGTGTTCATCAGATATCTCAAGGAGTGTAACGAATCCCACACATCAGACATCCTCGTTATGTACTACTTGCAACAGGCGAGGTACCTAGAAGCGGAACAGTATAACAGTAAATTGAAGACTCGTCACGAACAGTCTGACCGAGGGTCGGCCCGCGACGCGCTGGTGGCGACGCTGTGTAGAGACCTGCCGGATGTCACAGGAGACGTACTGAGATGTGCTATGAACGAAGCCGAGCCCAGGATTTTCAAACCGAAGCCTATGTCTGTATACGTTCAAGCAAATTCGCCGAAAAACACCTTCACATACAAGTCGTCGTTTATACAGGATACTATTGAAAATGCCAGTGAAACGTGGATAAACAAACCCAAGACCAGGAAGGGCATTAAGCGAGCGCTCAACATAGAAGACACTCCATTCATTTGCACTCCCAAGGCATATAAAAGCAAGAGCGTCCTACAAGAGAAGAGTGACGGTACTCCGGCGAAGCGAGCCAAATTGGATCTCAATAGCTCGGGAAGAACGCCGAAAATGGCTCACAGGGCCAACGACAGCGTCAGCTGCCAAGTGGCCCGTCTACTTCATATGCCAGACTTGGAGAGCCCTCACAACTACCCCGCCAGGACGGACACGCCACAAAGCATACTCAAGTCCCGTCCGTCATCCCGTCGCGGTAACACGCCGTACGACAACGACCACGACCACAGCGACCACAGTGACCACGACCACGATGAACACAGGCACCTAAGGTTCACAATACCGACGTCATCCGAGGCGGGCTCTACGCCGTCACCGGTAGCATTGCCCTTGTACGCGGAAAGTGAAACAGATAATGTATGTATTTCGGAATAA

Protein sequence:

>DPOGS206173-PA
MQKLQESVFSIIKTTNLSSAVFSFLQPTEEYSDKTPLGGILRDTKYGWLALGPKFCVVDLRSGLKIAARTFGNGVSNKITVTSVVELPTPLTDNSQQLIISLECDDATGLICIFHINGSQLLRCIQTDGVITELTVCDAIPNGPLMCFDGLIMAGTHRGEIFIFDLNKASLIQALKDLSQGYEEQVRGEQNVANLIFLPLKAVHKIESQRDLAVENDDHLALLLNENSFHENQYVFCNPDGTVRMKAKQSHIRTTVVQYIPQLGSLAVGFNFGAFQLWNLLNLELEFTSQVNVECLPVTHFGFQEPCDDPRAFCYLWVVFSVIDRFEEEEFPLAVMYSLTYQGKRMLAENKYLYQEFSMATIRFQIELGAIEDVSVYIGGRCISCHTYSISSVLGEEGEDTMLNLCQLVWECWGENNTVTQHCMLLFDLDQWYKDQMPATYRLENNAFMSTARCPEISRGSCVTLDLRLDSESVTPYSHATRLEEHFFPNSLQYNCVALSTWSTAVVQTAGVQRQLVRGMSAAGPACLLAPARLYAACKTAGLTPLYAPSGDTPEDQRRFLLSVALEARLSSFLKRCAHDWATGTHSGVGCTLPFLVDWSWSRAIELKENAKELTAPLFTSSMMPDRNVIRCLEHCVQQLSQLTGLLDAILTKCCNLVVPDALSEMEEKYKGIGTVSLYFQVVQWFVRVGLLPEKSSDRHSHTLPYPVHQLCGIYNKRRIKLNRLQDKSDDESSNESCSLLYIDQLIEHEFGGDRVHQLWMVCGSSGGLYPPPSLFSLLRLYLLPDVPEEHKHSLLLYLLLDYSMIYDDMRHESVIRRLMQFPTMFGLSNTAIKATQAFWHLDHRDFDFALDQLQCLTGNTLSDWQHHVVLSSLLAQKKTQSALQYLHVRKPAPIHVSDNNDYDKLDDWQTSCNLYLARGLVFEALDVIRMCVENAGSSDDKTQLLNYFYKGCRNSGQLAKVLQVTLLPFEEEVFIRYLKECNESHTSDILVMYYLQQARYLEAEQYNSKLKTRHEQSDRGSARDALVATLCRDLPDVTGDVLRCAMNEAEPRIFKPKPMSVYVQANSPKNTFTYKSSFIQDTIENASETWINKPKTRKGIKRALNIEDTPFICTPKAYKSKSVLQEKSDGTPAKRAKLDLNSSGRTPKMAHRANDSVSCQVARLLHMPDLESPHNYPARTDTPQSILKSRPSSRRGNTPYDNDHDHSDHSDHDHDEHRHLRFTIPTSSEAGSTPSPVALPLYAESETDNVCISE-