Monarch geneset OGS2.0

DPOGS205032
TranscriptDPOGS205032-TA1641 bp
ProteinDPOGS205032-PA546 aa
Genomic positionDPSCF300388 - 91828-99302
RNAseq coverage580x (Rank: top 22%)
Annotation
HeliconiusHMEL0225010.074.42% 
BombyxBGIBMGA001656-TA4e-13670.31% 
DrosophilaCG11247-PC4e-2529.81% 
EBI UniRef50UniRef50_UPI000223FD7C4e-2828.08%UPI000223FD7C related cluster n=1 Tax=unknown RepID=UPI000223FD7C
NCBI RefSeqXP_784091.14e-2727.15%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|2914132807e-2927.76%PREDICTED: zinc finger protein 26-like [Oryctolagus cuniculus]
NCBI nr blastxgi|3660399615e-3829.00%zinc-finger protein 80-like [Mus musculus]
Group
Gene OntologyGO:00036763.8e-08nucleic acid binding
KEGG pathway 
InterPro domain[474-496] IPR0130873.8e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26585 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205032-TA
ATGGACCACACATATATGTCCCAATCATTGTCGTCATTGGAATGCGTGATCAAGACCGACTTCGACCAGATCTTAGTCGACTACAACTTGCAAGAAAAGGACTGGCCGCAGTACATAGCGATTAATGGAGAGGAGAACGTTAAAAATGAGGGTTACGATGTAGCACAGGCTTTGAGAGACCACATAACATTGAGTATTGATGACCAGGTCAACGTTATGGGCGTTCCGGAACTAGTTTTGGAGAATCCGGTGACCGGTGTGATGTCACACATCGTGGTCAATGCCGGATCTCTGACCGATTACAGCAGTGTCAAGAGGGAAATGCCAGAACTGACCATAGATCCCACAATAAAGCCTGATACGGTGATAATAAGCCAGAATCCAAGAAATAGTCCGAGGAAGGAGAAAGCTGATCTCAAAAGCAAATATACTAAGGAGCTTATGACTGATGAGGAAATGCTCGCATTGAGAGAGGAAGCCAAACAGAAAGTGCAATACGTGAGTTCTGTGTATAAATGTGAACTGTGCATCATAGGATTCTACACGCAGCAGCAGGTGGAAGATCACTTCGTGGCTATGCACAGAGAGAAGCCGGGTTACGTACCGTGTAAAGTATGTTTCGTATATACACCGGAGAACAAGGTGGACGAACACACGGACACGCACTACAGCAGGTACACGTGCAAGATGTGCAGCCGGCGGGAGACCAGCCTCAAGATGATGATGGTGCACCTCAGGGCCCACGAGAACAGGACGCCCAGGGCGCTCATACAGATAGACGGGGAGAAGAAGAGCAGAAAGAGAAAAAACACGAAGTGTGATGAAGAGGAAAAATCACCTCCCAAGCCGGGAGACCTCAGGAAGCTACTGTCCAAGACTACTATAGTCGGCTACAAGTGTTTGGAATGTGACATGTTCTTTAAAAATTCAAGGGCACGCAAGAACCACGTGGATAGATTCCACCGGGAGGGTCTGCAGTGTGATCATTGTAAGAAGAGATTCGTTAACAGGACCACTCTCGCCACACATTTGAGGCTCCACGAAGGCCCGCTGCCCCGCGAGGAGTGCCCCATCTGCCACAAGATGGTCCGCACGATACAGATCAAGTACCACATACAGAGGCATCAGAGCACCACCAAGTACGAGTGCAGGGACTGCAACAAAATCTTCTCCCACCTGGCGACCTATCAGGCGCATCTGAAGTTCTCGAGGGCCCACGCATCCGATCAAGTTTTTAAATTCCCGTGTCCCATGTGCAACAAGGGCTATCCGACAAAACAGGCTATGCAGGACCACTTCAACTATCAGCACCTCAACAAGACAACGCACAAATGTCCCATATGCAGTAAGCCGATAGCATCCAAAGCGAATGTTGAGAAACACATGATGAGGGTCCACGGGGAGAAGAAATCTAAGCCCAGGAAGCATGTGTGTCAGATGTGTGGCAAGGGGTTCACGGACAAGAAAGCCTTAACTCAGCACGAGGTCATCCACTCCGGGGAACGGCCTCTCTCTTGTGATATTTGCCAGCAGACGTTCAAACAGAAGGCATCCCTGTACACACACAAGAAACGAGTCCACAAAGTATTCCCAGCTAAGAGAGTCGTTGAATTTATGGACAACGGTGAAAATAATACCTAG

Protein sequence:

>DPOGS205032-PA
MDHTYMSQSLSSLECVIKTDFDQILVDYNLQEKDWPQYIAINGEENVKNEGYDVAQALRDHITLSIDDQVNVMGVPELVLENPVTGVMSHIVVNAGSLTDYSSVKREMPELTIDPTIKPDTVIISQNPRNSPRKEKADLKSKYTKELMTDEEMLALREEAKQKVQYVSSVYKCELCIIGFYTQQQVEDHFVAMHREKPGYVPCKVCFVYTPENKVDEHTDTHYSRYTCKMCSRRETSLKMMMVHLRAHENRTPRALIQIDGEKKSRKRKNTKCDEEEKSPPKPGDLRKLLSKTTIVGYKCLECDMFFKNSRARKNHVDRFHREGLQCDHCKKRFVNRTTLATHLRLHEGPLPREECPICHKMVRTIQIKYHIQRHQSTTKYECRDCNKIFSHLATYQAHLKFSRAHASDQVFKFPCPMCNKGYPTKQAMQDHFNYQHLNKTTHKCPICSKPIASKANVEKHMMRVHGEKKSKPRKHVCQMCGKGFTDKKALTQHEVIHSGERPLSCDICQQTFKQKASLYTHKKRVHKVFPAKRVVEFMDNGENNT-