Monarch geneset OGS2.0

DPOGS215612
TranscriptDPOGS215612-TA5580 bp
ProteinDPOGS215612-PA1859 aa
Genomic positionDPSCF300041 - 2218184-2226247
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0059170.074.41% 
BombyxBGIBMGA003672-TA0.065.71% 
Drosophilashn-PD2e-17341.64% 
EBI UniRef50UniRef50_UPI00021A68770.046.54%UPI00021A6877 related cluster n=2 Tax=unknown RepID=UPI00021A6877
NCBI RefSeqXP_001976135.12e-17440.51%GG20167 [Drosophila erecta]
NCBI nr blastpgi|3407123850.046.54%PREDICTED: hypothetical protein LOC100649920 [Bombus terrestris]
NCBI nr blastxgi|3838510380.036.57%PREDICTED: uncharacterized protein LOC100882107 [Megachile rotundata]
Group
Gene OntologyGO:00036762.6e-14nucleic acid binding
KEGG pathway 
InterPro domain[200-228] IPR0130872.6e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15757 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215612-TA
ATGAAACGAGCCAACGACAAAGTACAACTAAAAACAGTTGAAGACAGGCGCGGCGCGGTATCGTGCGTCGTATGTACAAAAGTCGCGTACAAGATGCCGCTGACACAGGAAAAGCGAATGGACAATTTTCCCAATAGCAATGATCACAGTTACCTTCACAAGAAGTTCAAGAAAATGGCATCGCCAATATCGGTGCTTCCTGTGAACTCTATGAAAAATGAGGAAAGTGTGCATAAGGAATTAGGATTTTCACAGATCACAAGCAGCCCAAACCTAAAAAATGGAACCATTTCAGCCGCTCATCCAGAAATAGAGAAAATTAAAGATAAATACGCGAACAGAGGTATAGAAACAGATAATATTGTACAAACTAATTATAATGATGAAATCTGTAGGTTAAGTGGTGTTAATAACTATGTTCAAATAACTGGTAAGACCAGTTATAATGATGAATTTTTAAATAAGACTGAAAACGGTGAAATCGATGTAACTAAAGACACTTACAGTGGAGGAACTGGCCGTTACATCTGTCCGTATTGTAAATTACCATGTGCCAAGCCATCAGTACTACAAAAACATATCAGGGCTCATACGAATGAGAGACCGTATCCATGTATACCCTGTGGCTTTGCCTTTAAAACTAAATCTAATTTATACAAGCACAAGAGGTCAAGGACGCATGCTCTACGATCGCAAGGAGCAGATGTGTCCGTTGCTATTAACGAAGAGGATTTATCTGGAGGTTCCGAAAGTGATACTTCATCTACACCAACTTTTATGTCGGATCCTACCTCGGATGCTTCTTTGATACGCTTCCTTAATACGAGGCCTAATGATTTTTCCTCCCCCGAGCTAGCTAGCGATGGCAACACACAAATCAGCTCAAATTCTTATAATGACCATAAATCTAAAACAATTTATAAGCCCAAGTTCAGAGCAGCGTTATATCAGGGAAATGATGATAAAGATAAGATAAAAAAGAGTATTTCACACAATGCTGAGTTTCTTACAGAACATATCTCTAAAATTATATCAGATAATGAGGCCATTGTTGATGTTATTGAAACCCCTTTACAAAAAAAATATGGTAAAATTAAACAAATTGCTGAGAGTAAGCAATTCTCAACGGAAATCGACATAAAATCTGAAGTGACGCCTTTAAATTTAACAAAAAACAGTTACGAAACAGAGAGCTTGATAAGAAAAAGATCGCATTCGGAAAGTTTCGCACTGACTTCTGACGATCACAAGCATCCGTTGAATCCTGAGGGATCAATAATAAAGGATCTTTTGCTAAAAACTAAAGCCAATGGATTGAACTCCACCAACAGCTTGACTGGTGAATTGGTAGATGGGCTAGGGCCACTATATGTGTGTTCACAATGTCAAATAGTATACAGAAGTGCAGACAACTTAGAAATTCATAGATCGTACTATTGCAAAGGTGCACTTACAAATAATTGTAACATTTCTAGTAATGTCCCAAAAGAAGCAAAGTATGTCAGACCAGATAATGGTTACGTGAGAAGTAATTCCATTAACGTCCGATTGCCCGAAACAAGTGCGTCATCAACTAAAGTGAATTATTTAATGAATTCACCTCCAATAAAAAACAAGCCAGATAATCTTGTCATATTAAAAACGGAATGTAGCGACGTGATAGCACCGCTACCATCACCAGGGCCGTTACTTGGTAACACAAGACTTGTTGATAGCAGACTACCCTCAGAATGTAATAAAAAAACGGAGGGTTTGAAATTGAAAACAAAAGAATGTAGCCCCAAAAGAAGGTTTGACAGTAGATCGGAAACTAATAGTCCGAGGCTTATAGAAAATATATCTCCTCGTTCAGCTGATTTGTACGTTCATTCGAAAATAAGGTGTGTTGATATTAGCTCTCCGTCATTGAGAACTATTGAAGAAATATCGCCACATATAAGACACAATTCTACATCGCTTCAAATGTTCGGAGGTGAAGTTAAAATAGTCGATCATTCTGGCAGCACTACCACTCTCCGAATCGAACCTAGTAAAACACAGCTATCACCGATTCTAATCCACCAAAACCTTTCGCCATCTAAATTTGGAAATGACTCTGAAGCAAGTAGTGTCGTTGTAAGATCGGGTCTTCATTCCGGTGGTACTATAGTGCATAATCCGCCAACACCGAAGGAAGCTATTAATAATCATCAAGTTCAAACTCCTAGAATTGCGTCATCAACCCCGAACGCTCAAAACACGAACATGCACGTTCATGATATTCCACATTTCCAGTTTCCGCCGCTGAGTGCTATAACAGCTTATAATCCATTAACATTGCCGCCTCTCAGTCCATCTCCGTCACCAAACGGTGCAACTACTATTATGCACGGCGGCAAACTCATACCTCATGTTCCCGGAATACCAGGACCTAATATACCTGGTTTATTCATGACAAATAGCAATTTACAAATGAAAAACAGTGATAGCCATAAAAGTGTCACGAAAACGACCTCAGATAATAATCAACATTTATTAACTCTTAATGTATCTACGGGAAAGGGAAGTGTTATATCAAACTACGATAGGATTCCTAACAACTCGAGAAGTCCAAACATCAGATCGATGAGTATGGAAGTAGAACATAACATTAGTAATAAAGATACGGATATTCAAAATATGTCTTCAATGCCAATAATTAAAATAAAACACGTCGATGAACCTATATTAAAGAATTTCTCTTCTAGTTGTCTTCTGAAATCGATTAAGAGAAATGCCGATGGAGTACCTAAACATCCTGTATTAAAGGAAAATACGAACACATCATTTAATAAGACCAAAAATAAAGACTCTCACGTTCATCCGTCTCATAAAGTTATCGATATAAAAGCTGAGAATGATATAAAGAATTTCAATTTCGAAAACTTCATCACTAAAGCCGAAATATACAATAATCAAATTCAAAATAAAATCGACATCGAAAGAAACAGTAATGTTTCGGAAACCTTTGTGACTTCAGTGCAAAATGAAAGGTCAGAGACATCTTATTTTCAGAAGAGTTCAATATCAAAGTCTAATTCTGAGGAGAGAAAACCGAAATTTCTTAGACCATCTACTCTACCTCTAAAGCCGGGCACGTTTACCCCAAAAAGACATCACGGCATAACGCCCAATGCGAACACAATGCCATTGGTGTCACCAGAAACCCCACGCCCAGCAAAAGCATACGGACAACTTTATTTAAACGGAAATGCATATACATATCTGGGTTTGAAATGTTCAACAAAAGTATTTTATTGCACTATCAATCGTCCACAGCCCACATACGTACCGAACCAGCATTTCCTATCCATGTACAGTAACTGGCAGTTATTATCTGAGTTGACGCCAGACCCGCTGGGATTGTCAGCCTCGTCTGCTATGTCTCTATATGACTCACGTCACAGACCGCAGAGCTTGGCCGTTTCTGTAATCAAACAGGATCTCATTCTGACTCATTCATCGCAATGGAACAAAAATTCGAAGGACGGCAAACAGGTGATAACTTCTATAGACTCTAAAAAATCGGAAGAGATAAAAAATATTTCCGATAACACAGCTACATCCAAGAAAGAATTAACCGGTGGATTTGAAAGTAACGAGGAATATACATATGTTCGCGGACGTGGCAGGGGACGATATGTTTGTTCGGAATGTGGAATAAGATGCAAAAAACCATCAATGTTGAAAAAACATATCAGGACACACACTGACGTCCGACCGTATACATGCGTCCATTGCGTTTTTAGTTTCAAGACGAAAGGGAACTTAACAAAACATATGAAAAGCAAGGCACATTATAAGAAGTGCTGTGAGCTAGGAATAAATCCAAATGAAGGGAACGATGCCGAAGGCTCTGAAATGGCGCAGTGTTCCGGTGAAACTGATGATGAAACGGATTCAGACGGTGATGAGGGAAATGAGGGTGAAACAGAATCCAGTGATACAGAGGTTTTTAAATCTCGCCTGCCGGAACACGAGGCTGCCCACTGCTTGCTATCTCTCGGCGGCAGTAGACCTGCCACCTCAGCCACTCCGGGCTTAATAACTAGCGCTAGGCCTACAACGTACCCCTACACTCCTATGTTACTAGAAAATACGTTAGATGTTGATCAAGATAAAGTCGAGAGTGTAAGAACACCCTCTACTGATTCTAGAATAGATACGGACAATGAGCCCATGGATCTCAGTAAAAATGAATTAAGAACTCCAACGAGCGTGATGGAAATTCCCACTGAAAGAGAATCTAGCGTCATGGCCTGTTTGGCTTCCAATACTGCGAAGCTTCCCCATCATCAATCACAGTGGACCAACGGAGAGCCAATGCTGCACACGTATCTAACAGAAAGGGCACTTTTAGATTCTAAGATTAAACAGAGCCAATTAACATGTAATTCTAAAATAAGGAAGATTGATCTCGAAAATTCTTTATACCTTGAAAAAGAAACCGCTGAACAGGAAATTTCGAATCCAAGAAATGTTCTTGATACGATTACAACAACAACAGCATTTTCTAAAGATGAGTCAGTTTCTATTGATAATTCTCAGAATTGTTTGAATCTAACTAGTGAATCCAGAGCTCGTACACCTAACAACTCCAATCCAGAAAATGCTAAACATGTTGTGTCCGAGTATTTAAAACATGCTAGGATAAATCATATGAAAACTCTCGACGATCCTAACCATTTAGATATATCCAGTGACGACAGCAACAGTGGTAAAGTTAACATAGAAGAAAAGACTAAGGTAGATGAGGTCTCAGACTGTGATGGAATGAAATTATCTTCTTCAGAATACGATCCTGTAGCTTCCAAAGTAGTGATCGGAGTCGGGGGAGTATTTAAAGTGACCAAAGGGAAAGAGTTTGACGGATCGGCTTCTTACTCGCCAGGAAAACTCATGGAAGATGGACGTAGAGTTTGTGATTTCTGCAACAAAACATTTACAAAACCTTCACAGTTAAGGTTGCATCTAAACATACACTACATGGAGAGGCCGTTCAGATGTAGTGTTTGTGCTGTTAGTTTTCGTACCAGAGGTCATCTGCAAAAGCATGAGCGTTCTGGGTCCCATCACAATAAAGTGTCAATGACCTCAACTTTCGGGGCAGCGACATCGTGCAATCCTCGACCTTTTCGTTGTTCAGATTGTAATATAGCATTCCGAATACACGGACATCTCGCCAAACATCTCAGAAGCAAGATGCATGTGATGCGTTTGGAGTGCTTATTCAAATTACCGTTTGGAACGTTTACGGAAATAGAACGTGCTGGTCTCAGTCTAACAGATATAGATACGACAGATTGTGCCAGTTCTTTGGCTAGTTTGCAATCTCTCGCCAGAAAATTACATGAAAAGGACCCATCGAAACTTGAGTACCGAGAGCCGAGTGGGGCAGCGCTTAACCTACCTGCAGGGAGGGAGTCTTCTGAAGACGAAGATGCTTTAGTTTATTTAGAAAAGACCTGTGACAGTTTAAAAGATAGTGAGATAAAAACGATTGAAAACAGTGACTGTCAAGAAACAGAAACTCGAGTTAATTATAGTGCCACAGATAATTAG

Protein sequence:

>DPOGS215612-PA
MKRANDKVQLKTVEDRRGAVSCVVCTKVAYKMPLTQEKRMDNFPNSNDHSYLHKKFKKMASPISVLPVNSMKNEESVHKELGFSQITSSPNLKNGTISAAHPEIEKIKDKYANRGIETDNIVQTNYNDEICRLSGVNNYVQITGKTSYNDEFLNKTENGEIDVTKDTYSGGTGRYICPYCKLPCAKPSVLQKHIRAHTNERPYPCIPCGFAFKTKSNLYKHKRSRTHALRSQGADVSVAINEEDLSGGSESDTSSTPTFMSDPTSDASLIRFLNTRPNDFSSPELASDGNTQISSNSYNDHKSKTIYKPKFRAALYQGNDDKDKIKKSISHNAEFLTEHISKIISDNEAIVDVIETPLQKKYGKIKQIAESKQFSTEIDIKSEVTPLNLTKNSYETESLIRKRSHSESFALTSDDHKHPLNPEGSIIKDLLLKTKANGLNSTNSLTGELVDGLGPLYVCSQCQIVYRSADNLEIHRSYYCKGALTNNCNISSNVPKEAKYVRPDNGYVRSNSINVRLPETSASSTKVNYLMNSPPIKNKPDNLVILKTECSDVIAPLPSPGPLLGNTRLVDSRLPSECNKKTEGLKLKTKECSPKRRFDSRSETNSPRLIENISPRSADLYVHSKIRCVDISSPSLRTIEEISPHIRHNSTSLQMFGGEVKIVDHSGSTTTLRIEPSKTQLSPILIHQNLSPSKFGNDSEASSVVVRSGLHSGGTIVHNPPTPKEAINNHQVQTPRIASSTPNAQNTNMHVHDIPHFQFPPLSAITAYNPLTLPPLSPSPSPNGATTIMHGGKLIPHVPGIPGPNIPGLFMTNSNLQMKNSDSHKSVTKTTSDNNQHLLTLNVSTGKGSVISNYDRIPNNSRSPNIRSMSMEVEHNISNKDTDIQNMSSMPIIKIKHVDEPILKNFSSSCLLKSIKRNADGVPKHPVLKENTNTSFNKTKNKDSHVHPSHKVIDIKAENDIKNFNFENFITKAEIYNNQIQNKIDIERNSNVSETFVTSVQNERSETSYFQKSSISKSNSEERKPKFLRPSTLPLKPGTFTPKRHHGITPNANTMPLVSPETPRPAKAYGQLYLNGNAYTYLGLKCSTKVFYCTINRPQPTYVPNQHFLSMYSNWQLLSELTPDPLGLSASSAMSLYDSRHRPQSLAVSVIKQDLILTHSSQWNKNSKDGKQVITSIDSKKSEEIKNISDNTATSKKELTGGFESNEEYTYVRGRGRGRYVCSECGIRCKKPSMLKKHIRTHTDVRPYTCVHCVFSFKTKGNLTKHMKSKAHYKKCCELGINPNEGNDAEGSEMAQCSGETDDETDSDGDEGNEGETESSDTEVFKSRLPEHEAAHCLLSLGGSRPATSATPGLITSARPTTYPYTPMLLENTLDVDQDKVESVRTPSTDSRIDTDNEPMDLSKNELRTPTSVMEIPTERESSVMACLASNTAKLPHHQSQWTNGEPMLHTYLTERALLDSKIKQSQLTCNSKIRKIDLENSLYLEKETAEQEISNPRNVLDTITTTTAFSKDESVSIDNSQNCLNLTSESRARTPNNSNPENAKHVVSEYLKHARINHMKTLDDPNHLDISSDDSNSGKVNIEEKTKVDEVSDCDGMKLSSSEYDPVASKVVIGVGGVFKVTKGKEFDGSASYSPGKLMEDGRRVCDFCNKTFTKPSQLRLHLNIHYMERPFRCSVCAVSFRTRGHLQKHERSGSHHNKVSMTSTFGAATSCNPRPFRCSDCNIAFRIHGHLAKHLRSKMHVMRLECLFKLPFGTFTEIERAGLSLTDIDTTDCASSLASLQSLARKLHEKDPSKLEYREPSGAALNLPAGRESSEDEDALVYLEKTCDSLKDSEIKTIENSDCQETETRVNYSATDN-