Monarch geneset OGS2.0

DPOGS215954
TranscriptDPOGS215954-TA3105 bp
ProteinDPOGS215954-PA1034 aa
Genomic positionDPSCF300078 - 996486-1003922
RNAseq coverage268x (Rank: top 40%)
Annotation
HeliconiusHMEL0164590.053.84% 
BombyxBGIBMGA001069-TA6e-12650.76% 
DrosophilaCG3407-PA1e-5357.23% 
EBI UniRef50UniRef50_E3X4Z02e-5831.87%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3X4Z0_ANODA
NCBI RefSeqXP_395120.23e-6052.68%PREDICTED: similar to CG3407-PA [Apis mellifera]
NCBI nr blastpgi|3838539923e-5961.68%PREDICTED: zinc finger protein 195-like [Megachile rotundata]
NCBI nr blastxgi|3320162894e-6243.98%Zinc finger protein 192 [Acromyrmex echinatior]
Group
Gene OntologyGO:00036762.1e-11nucleic acid binding
KEGG pathway 
InterPro domain[791-819] IPR0130872.1e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26840 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215954-TA
ATGAGGAACGTTGAACCTGTGTTAGAAGATAAAATGGACGAAGCTGTCACTCATTATATAAATTCACATCAACCACCTGAGTCAGTACATATAGATGCAAATTCAATGGCTCTCAATGTGCATGATGATTCCACTGGCAGAGTGGTTACTGTTATGCACCCCCATAATTTTACGACTCAAATACAAGTATCAGTAGGTGACCAAGTTGGAGAAGGACCATGGGTAGAGGAATTCCTAGATCCTGAAGGTAGACTAGCAGCTATAGTGGCACATCTAGCACAACACTCTAGGACGCAGCATCATTTGCATAAAGTTCCTGTTAGGAGTGATCTGGATTCTTCAGTAATTTTAGATGCACCAATGAGACTTTTCAACGGTCAAACTTTGCAGGAACCACACTCTGAACCCATTGTAGTACAAGTTCCACCTCTGCCTCCATTACCACCATTACAAGAGATGAAGAGATGTCCGGAAGCGGACTGGTTTAATAAAGACAAAAATGAGCTTAAGGAACAGCAACTCCTTTCACAGGATACTGTTCTGGAGTCCAGCGGATCTCCAGCACATAAGCAGAGTAAAAAGAGCCTGCCTCATAAGAAGAGGATCTCACGGAAACTAAAACGAACCACCAATAGCACTCCGCAGCAGGATATAGTAGTGATCAATTGTGGATCTGAGGTTCCACAAGAAGAAATTCTACCAGATAGTTTCTCACACGCTCACGAATCCCACGATACCAATAACGGACACGACGCCCATGGAGCTCACGATGCCCACGATCCCCATAATCCTCACAATACTCACAATCTCCACAACCCCCACCATCCTCACAATCCTCATAACCCCCATAATGCACACAATCCTCATGATCCCCATGATCCTCATGACACTGTTTTGCCTGAAACCCATCATATTGGACAGGAGATACGGTCGCCCCTGATTTGTCAACTTTGCGGCGAATACTACGGTCACGAACAGCTGAAGTTCTACCATCATCTCAAACAGCATTACGAGCCACAGCTCGTGTTAGACACTCCCGACCTACCAATTGACAAGATGACAAATACGTGTATAGACAATGTGGCAACTCTTCCAGATTCGATAGTGGAACTATCACTAGAAAATACAGTACCAAGGATAATATACCCTCCGCAGGACAAAACATTTTGCAGCTATAAGATTCCGTTTACTTCAACTAATATGGAAAAGGAACAGGAACAGGGAAGAGTTGATTTATTTGATTCATTGGACAAACTGGAAATGTATTTCTGCACAAAGTGTAATAAATCATTCAGGAAACAGAAACAATGTGAGATCCATATAAAAGAAGCTCATCTGAATCAGAAGATGGATGATATATCCGAGTTCAGTGACCCTGAGGATCTCATGGAGGGCATACACGTGGCTGTAGAAGAGGGAGGAGGAGAGGGGGAAGGCGGTGGAGACTGTAGCAGTGAGCAGTACGACCAGGCTCTACTACCGCATCTGACAGTAGAAAACGGACACGTACATCAAGAACATGTTAGACACTGGTATATGAGGAATGGGTCAAACAGTTCGGTGCCACTTTGCACATGTGGCGGTGCGGGTTACTGTGCTATGTGCACTCATGCACACACGCCCACCATTCAACCCACACAGACGATACACACACAGACCATACATACCCAGAGCTCTGACATGCCCTTGCCCAGTGTGACCAGTGTCAGCCACGTGACAACCAGCCACGTGACCAATGTCAATCATGTGACCAGCGTCACTCACACCAGCAATGTTAACGTCAGTCAGAATGTAGCAGTCAGTCAGAGGACGGAAAAAGATGAGTCGCTACAGAGGATGTTCGAAACTGAGAATCACAGTCAGGAAAACTTCACAGAGAACATTCTAGAACAACAGGAGCTGAACATCAAAGTTGAGCAGCCAGTAGAAGTTAAACAGGAGAAGAAAAAGCCCACTAAGACCTTCGAGTGTTCGCATTGTGATAGAGTGTTCCATCATAGAAACAGCCTTTTCTATCACACGTTGATGCATAGCGAGAAGCAACAAGTCTGTAGAGAGTGTGGGAAGGAATTTTACACCGTCAATGCTCTTAAGATCCACAAGCGAGTTCACAGTGACTACCGGCCTTGTAAGTGTGACGAGTGCGGAAGAGATTTCAGGCAGTGGTCAGATCTGAAGTACCACAAGGCATCCATACACTCCGATAAGAAAAACTTCAAATGCGAATTCTGTGGTAAGGAGTTTGCTCGGCGCTATTCTCTTAACGTACACAGACGTATACACACCGGCGAGAGGAACTACAAGTGCGAGTACTGCAACAAGTCTTTTAGGGCCTCGTCCTACAGGCTGATTCATATGAGGACACACACTGGTACTAAGCCCTATAAATGTACACAATGTGAGAAATGTTTCCGTGTGGCTTACGATCTGCGACGACACATGCTGATACACGACAAAGTGAGAGTCCGAGGTGAAGAACAAAAGAATAAAACAAAGGAAAAGAAACAGAACGACACTAAGGAAGAAACGAAGGAAACAAAAGCCGCTAAGGCAGACGAGAAGAAGGAAACAAAGCTACCAATATTAAAAAGTCTGCTAGATAAGAAACAAACTAAACCACCAAAGAAATCACCTAAAAAGGCACCAAATGTAACAGTTCAGAGTAGATCCAATGAACAATTTAAAATGGACCCAGACTATAACAACGAAGTGTTCGACACGAGACAGGAACAATATAAATTTAAAGTGTACTCAAATGAATTTAAACAAAAAGACTATATAGAAGAAGTCAGACTAGAGGGAAGGTCAGAGGACAGAGAACTGGCAACACTAAGGACATTGAAAAATCAGGAATGTGAAAATATTGAGACGAACAAGCTGCCTTATAGAGAAAACACTGACGGGAAGATGCAGGTTTACACACAAATAGAAAAAACAAAGGAATATAGTGGACCTATAGTTACCAATGCAGTATCATTAAGTGATATGAGGAGTTTAGACCGTGAGTCCAGGGACATGAGGAACGAGGTTCACGGTGAGACCATTGACAGTGCACTCCTGGAGCGTTTGTCTGCTTATTATAATAATAACATTCCGGCCGTATGA

Protein sequence:

>DPOGS215954-PA
MRNVEPVLEDKMDEAVTHYINSHQPPESVHIDANSMALNVHDDSTGRVVTVMHPHNFTTQIQVSVGDQVGEGPWVEEFLDPEGRLAAIVAHLAQHSRTQHHLHKVPVRSDLDSSVILDAPMRLFNGQTLQEPHSEPIVVQVPPLPPLPPLQEMKRCPEADWFNKDKNELKEQQLLSQDTVLESSGSPAHKQSKKSLPHKKRISRKLKRTTNSTPQQDIVVINCGSEVPQEEILPDSFSHAHESHDTNNGHDAHGAHDAHDPHNPHNTHNLHNPHHPHNPHNPHNAHNPHDPHDPHDTVLPETHHIGQEIRSPLICQLCGEYYGHEQLKFYHHLKQHYEPQLVLDTPDLPIDKMTNTCIDNVATLPDSIVELSLENTVPRIIYPPQDKTFCSYKIPFTSTNMEKEQEQGRVDLFDSLDKLEMYFCTKCNKSFRKQKQCEIHIKEAHLNQKMDDISEFSDPEDLMEGIHVAVEEGGGEGEGGGDCSSEQYDQALLPHLTVENGHVHQEHVRHWYMRNGSNSSVPLCTCGGAGYCAMCTHAHTPTIQPTQTIHTQTIHTQSSDMPLPSVTSVSHVTTSHVTNVNHVTSVTHTSNVNVSQNVAVSQRTEKDESLQRMFETENHSQENFTENILEQQELNIKVEQPVEVKQEKKKPTKTFECSHCDRVFHHRNSLFYHTLMHSEKQQVCRECGKEFYTVNALKIHKRVHSDYRPCKCDECGRDFRQWSDLKYHKASIHSDKKNFKCEFCGKEFARRYSLNVHRRIHTGERNYKCEYCNKSFRASSYRLIHMRTHTGTKPYKCTQCEKCFRVAYDLRRHMLIHDKVRVRGEEQKNKTKEKKQNDTKEETKETKAAKADEKKETKLPILKSLLDKKQTKPPKKSPKKAPNVTVQSRSNEQFKMDPDYNNEVFDTRQEQYKFKVYSNEFKQKDYIEEVRLEGRSEDRELATLRTLKNQECENIETNKLPYRENTDGKMQVYTQIEKTKEYSGPIVTNAVSLSDMRSLDRESRDMRNEVHGETIDSALLERLSAYYNNNIPAV-