Monarch geneset OGS2.0

DPOGS204341
TranscriptDPOGS204341-TA4680 bp
ProteinDPOGS204341-PA1559 aa
Genomic positionDPSCF300142 + 39249-59294
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0211730.061.31% 
BombyxBGIBMGA007245-TA7e-16848.65% 
Drosophilasnama-PA7e-10743.14% 
EBI UniRef50UniRef50_UPI0002246B681e-12966.15%UPI0002246B68 related cluster n=1 Tax=unknown RepID=UPI0002246B68
NCBI RefSeqXP_001603499.13e-13066.15%PREDICTED: similar to GA16823-PA [Nasonia vitripennis]
NCBI nr blastpgi|3838601609e-13064.91%PREDICTED: uncharacterized protein LOC100877553 [Megachile rotundata]
NCBI nr blastxgi|2700123590.035.16%hypothetical protein TcasGA2_TC006501 [Tribolium castaneum]
Group
Gene OntologyGO:00056349.3e-28nucleus
GO:00082709.3e-28zinc ion binding
GO:00036761.9e-05nucleic acid binding
KEGG pathway 
InterPro domain[3-76] IPR0148919.3e-28DWNN domain
[239-312] IPR0130831.7e-15Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL25327 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204341-TA
ATGTCTGTGCACTACAAATTCAAAAGCGCATTGGATTATGACACCGTTACTTTTGATGGCTTACATATATCTGTAGGCGATTTGAAAGCGGCTATTTCACAGCAGAAGCGAATAGGAAAAACTTCGGATTTTGATCTTCAAATTACTAATGCTCAGACAAAAGAAGTCTATGTGGATGACAACACATTAATTCCAAAAAATACTTCATTGCTCGTGGCCAGAGTACCACTTGCTCAGCAGCCAAAAAAGCAGTGGGAAGGTGCAAGCAGTAGTCAATCAAATCCTCTAAAAGATGTTACCCTCAATAAGGGATTGGCGGACTTGTCTCGCATGGAGGGTAGTGAACAGGATAAAATTAATGCTATGATATCACAGTCCACATTTGATTATGATCCAAGCAATTATCAAAAAATCAGAGGGCAGAATCAACGAGGAGCTGTACCATCTAATTATATTTGTTACAAGTGTCAGAAACGTGGTCATTGGATCAAAGACTGCCCAGCAGCTCTTTCCGGTGATCCAGTAGAGATAAGAAGAAGTACAGGTATTCCCCGTAGTTTTATGGTGCCAGTTGATGGGCCTAAGGCACCAGGAGCCATGATGACCCCAAGTGGAACTTTTGCTGTGCCAGCAGTGGATCATGAGGCATATTTAGCATCAGAGAGTGCTGGAGGAGCTGATACGCCAACAAATGCTCCCGTAGCTCCAGAACCAACAATTCCTGATGAATTAATTTGCAGTCTCTGTAGAGATCTGCTTACTGATGCTGTCATGATACCATGCTGTGGAAATTCCTTTTGTGATGAATGCATAAGAGGAGCATTGTTGGAATCTGAAGACCATGAATGTCCTGACTGTCGCGAGAAAGAAATTGCGCCTACTACATTGATTCCTAATAGATTTTTAAGAAATTCAGTATCGTCATTTCGTAATCAAACCGGCTACAGCCGACGAGCGCCACACCGACCCTCGGCTGCCCCGCCAGTTATTGAACCACCTCCAGTTGCACCGGTCCAACCTATAAATATAAACGGCCCGCCCCAACCAGCGCCCCTCAATAACGATTCGCGCGTGTCCGGGTGTGGCGGCGATAAAGAGACGAAGGGAAGCAAGGCCGAGGAGTCGGACGGATCAGCCGATGATAATATCACCGTGACCGTGCCACCCGCACACGCGCATCACTCTACGAACGAACCACACGGGCCTGTCGGGCCTCAGACTTCTCACGGACCCCACGGGTCTCGAGGGTCTCACGGGTCCAGCGGCGCTCACGGGTCTAAAAGGCCCCGCGGTCCCCAAGGGCCTTACGGTTCCCACTCGTCCCGTCCATCTCACTCGAACCATCGATCTCGCTACGGTCCTCCTGTTAAACCACCTGAGATGGACACGCCGCCGTTACATATGCCTCGAATCGATGAACAACGAGCGAGCACACCGACAATAGACGAACGCAGGGATGTTATTTCTTCTCAGGTGATTTATAATCGCGGTCCACCGCCGTACTTGCCCGCCGTGCCGCCTCCGTCTCAGCATTACCCTAATCCTGCTTTCGGTCGTCGCTACCCTCAAGACGCGTACCAGCCACCACCTCCAGGCATTAAGGACTCCCCCAACGGGTCTCCGTCTCCTCGGCGCCGTCACCGCTCCCCCCTCCGTTACCGCAGCCCCCTCCGCTCCCCGGCCCGCTCCCCTCCGCGCTCCCCCCGCACCCCGCTCCGCTCCCCTCACCGCTCGATGCCGCCACACTCGCCGGTGAGAACGCCGCGACGATCCGCCGCCAGGTCGCCGCCGAGGACTCACTCGCCTCACAGATTCGCCGGCGTTTACGATGAACTGTTGTCGCCACCACGTCGTAGTGTAGAGCCGCCATTCGGAGCCAGGTACGCCTCTCGAAACAATATTCCGCCGCCTGGATATCCTCCTATCACTAGTAAACCAATGCCTTTAATGGCAAATTTAATCTTACCCAACGAGCCCCCTCCGGGTTATCGCAGTCGTTACGATGCGCCACCATTTGAAGATATTCCACCTGGAGTTGAACCGACTGTTCCAGGATTTGAGCCATCTCCATTTGAGAAACCTATTTTTGGACCATCAGGTACTATGGAAAGAGATAGATTACCACCATCTAATTATAGAGATCCCTATAGAGCACCGTACGTCGAGGGTCCGGCTGGATATAGGGATAACCAAATGCCACCACTTCAAAGTGATATACCAGGTCCCATAGCGGAGCATTCCCAACGTTATAGAGATAACTTTAGAGCTACAACACACCCATACCGTGATCAAGGTCCAGTATCTTATAGAGATGGACAACCATACCGAGAAACTGGATCTCAGGCTTCATTCCGCGATAATAACTTTAGAAATGGGCCACCCTCTCTTTTTAGAGACAACAATTTTAGAAACAATCCATATAGGGATCCGAATTTCCGTGAAAATGTCCCACATAATTTCCGTGATGGATCTGCGCCTTTTAGACCGCCTAGCGTGGATCCCCGAGAGCCAAATTCTGTGGTGTATCCTGATCCTAATTACAGAGAAGGTTTTCGTGATGAAAATAACACTAGTAGGGATGTGCGACCTGGGTATCGTGGTAGTGTCCGCAACAGAGGTGGTTCTAGGCGTAATCCAAATGACCATGAAAGACATCGTGAGAGAGACGGACGAGAAAGAGATCGTTTCAATGAAAATCGTGAAGCTCCCGATCGTACAGAAAGACCACGTGATGTTGATAAGAGATCTGGTAATAGAGAAAACCGTGAAGAACGCTCTAGAGAGTATGATAGAAATCGAGATTATGAAAAAGAAAGAGATCGTGGTCACGAGAAACAATCCCCAGATAGAAAACAACGTGTCTCACCCAAACGTAGTCGAGATACCCGTGAAAGAAAACAAAGTGAAACGAGGGCCCGTTCTCGTGATCGGGAAAGTAGGAAAGAAAAGAAAGAAGAACGTACACGTGATAAATCATCAGCGGAAAGAAATAGAGATCATAAAGAGAAGGATAAAAAGGTTAAAGATAGGAAAAAGAAAAAGAGAGAAAAAGAAAAAGAAGTTGAGAAGAAAAAAAAGCGCGATAAAAAGGACAAAAAAGATAAAGATGTAATAAAGAAAGAAGAAGACGAACAGGAAATATCGGATTCTAAACAAGATGCTAATGCAGAGCCTAAAGAAAATACTGATCAGACTCTTGAGCTCAAAGCAGAAAATCCTGAATCATTAACAAAAACTACCGAGAATGATAATCTTGAAAAATCTGATATTCAAGAAAAAACTAATAATGACCTGTACGGTGACGAAGGTACTGAACTACTAGGGAAGGAAGTACAAACCTATAACAAAACCGAAGAAAAGGAGGACGTCAATGAAAACAAGTCTATTGAGACATTAAACAAAGAAGAGCCGTTTGACGGTATTGAACTTCAGGTGCCAACTGATGAATTGGAAGTGGATATCGAAGCTACACCTAAAAATAATAATAACAAAGAAATGTTAGCGCCACTACCCGCTTTATCAAAATGGGAAGTTGAAGATGATAATGTTGAAAAGTCTAAGGAACCAGGTGAAATAACCTCACCAGAAGAAGAAGAAGACGGAGGTAAAGTAACGTCTGAGGTTATAAAACGTGCTGAAAATGCAATTTTTTCCAAAGCCATTAGTACGTTGCGACCTATAGAAATTAAGAAGATTAGTAGCGATCGATTGAAACTGTATAGCGATGACACTCAAATAAAGGGTTCTTTAGATAACATACAAATCACTGTTCCTGTGTTGAATGAGGATCAACAACTAGCAGATCCCAACAAAAAGAAAAGATATTCAAAAACACCTCCTCCTCGACTGTCAGTCAAAGAAAGACTCGGGGGAAAAGTTGAAGAGGTTCGAAAAGTACGAGAACCTCGAGTCGTCCAAAGTACAGTTGAAAGAGTAAAATCCAGGTCAAAAACACCTAAACATGAGCAGATACCTTACCGTCGAGTAACTGTTGAAAGAGATCGAAATCGAAAGCCTGAAATAGTGGCCAGGTTAGACGGATTAAAGGGTGAAAGAAAAATTAGTTCTGACGTACAAAAACCCGACGAAAGTTATCGTTTCAACAATGACAATGATTATAAGAAAAGGCATTACGGAAAAGTTAAAGATGAAATGAAGTCTATAAATGATAGACTTGATACAAACTCACAAAACATACAAAAAAATGACATTACTCAAGAAGTTAAAGTTTTAAATGAAAGAGAACGTAAAAAATCTGTCTTAGACGAAGCACACTTCGAACCCGATTATGATGAAAATGTTGAATCTGATAATGAAGCAAAAGTTGAGCCTGGGAAGAAACGTGAACATTCCCGGGATCCGTTAATCGCTGGAACTAATGAACCAAAAAAGGCAAAATTCGACACTGAAACAATTAAATTAGATCTGACGAACGTCAAAAAGAAACCAGATTCTGACAGCGAATCATCGAGCGATTCTGAATATTCTTATTCCTCCTCATCATCTGACGCTCGCAAGCGTAAAAAGAAAAAGAAGAAAAATAAAAAGAAAAAGAAACGAGCTGCCAGCGACAGCGATAGCGAGTCGGACTCCAGCTCCGACGATCATAAGAAAAAGAAGAAGAAACGTAAACATAAGAAGAAATCGAGTAAGAAGAAAAAGAAGTCTAAACATAAGTAG

Protein sequence:

>DPOGS204341-PA
MSVHYKFKSALDYDTVTFDGLHISVGDLKAAISQQKRIGKTSDFDLQITNAQTKEVYVDDNTLIPKNTSLLVARVPLAQQPKKQWEGASSSQSNPLKDVTLNKGLADLSRMEGSEQDKINAMISQSTFDYDPSNYQKIRGQNQRGAVPSNYICYKCQKRGHWIKDCPAALSGDPVEIRRSTGIPRSFMVPVDGPKAPGAMMTPSGTFAVPAVDHEAYLASESAGGADTPTNAPVAPEPTIPDELICSLCRDLLTDAVMIPCCGNSFCDECIRGALLESEDHECPDCREKEIAPTTLIPNRFLRNSVSSFRNQTGYSRRAPHRPSAAPPVIEPPPVAPVQPININGPPQPAPLNNDSRVSGCGGDKETKGSKAEESDGSADDNITVTVPPAHAHHSTNEPHGPVGPQTSHGPHGSRGSHGSSGAHGSKRPRGPQGPYGSHSSRPSHSNHRSRYGPPVKPPEMDTPPLHMPRIDEQRASTPTIDERRDVISSQVIYNRGPPPYLPAVPPPSQHYPNPAFGRRYPQDAYQPPPPGIKDSPNGSPSPRRRHRSPLRYRSPLRSPARSPPRSPRTPLRSPHRSMPPHSPVRTPRRSAARSPPRTHSPHRFAGVYDELLSPPRRSVEPPFGARYASRNNIPPPGYPPITSKPMPLMANLILPNEPPPGYRSRYDAPPFEDIPPGVEPTVPGFEPSPFEKPIFGPSGTMERDRLPPSNYRDPYRAPYVEGPAGYRDNQMPPLQSDIPGPIAEHSQRYRDNFRATTHPYRDQGPVSYRDGQPYRETGSQASFRDNNFRNGPPSLFRDNNFRNNPYRDPNFRENVPHNFRDGSAPFRPPSVDPREPNSVVYPDPNYREGFRDENNTSRDVRPGYRGSVRNRGGSRRNPNDHERHRERDGRERDRFNENREAPDRTERPRDVDKRSGNRENREERSREYDRNRDYEKERDRGHEKQSPDRKQRVSPKRSRDTRERKQSETRARSRDRESRKEKKEERTRDKSSAERNRDHKEKDKKVKDRKKKKREKEKEVEKKKKRDKKDKKDKDVIKKEEDEQEISDSKQDANAEPKENTDQTLELKAENPESLTKTTENDNLEKSDIQEKTNNDLYGDEGTELLGKEVQTYNKTEEKEDVNENKSIETLNKEEPFDGIELQVPTDELEVDIEATPKNNNNKEMLAPLPALSKWEVEDDNVEKSKEPGEITSPEEEEDGGKVTSEVIKRAENAIFSKAISTLRPIEIKKISSDRLKLYSDDTQIKGSLDNIQITVPVLNEDQQLADPNKKKRYSKTPPPRLSVKERLGGKVEEVRKVREPRVVQSTVERVKSRSKTPKHEQIPYRRVTVERDRNRKPEIVARLDGLKGERKISSDVQKPDESYRFNNDNDYKKRHYGKVKDEMKSINDRLDTNSQNIQKNDITQEVKVLNERERKKSVLDEAHFEPDYDENVESDNEAKVEPGKKREHSRDPLIAGTNEPKKAKFDTETIKLDLTNVKKKPDSDSESSSDSEYSYSSSSSDARKRKKKKKKNKKKKKRAASDSDSESDSSSDDHKKKKKKRKHKKKSSKKKKKSKHK-