Monarch geneset OGS2.0

DPOGS215562
TranscriptDPOGS215562-TA1482 bp
ProteinDPOGS215562-PA493 aa
Genomic positionDPSCF300097 - 332392-336613
RNAseq coverage243x (Rank: top 43%)
Annotation
HeliconiusHMEL0169120.071.31% 
BombyxBGIBMGA000362-TA0.062.16% 
DrosophilaCG6654-PA6e-3137.88% 
EBI UniRef50UniRef50_F1R8F33e-3443.04%Uncharacterized protein n=7 Tax=Danio rerio RepID=F1R8F3_DANRE
NCBI RefSeqXP_001811801.15e-3338.25%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|1417957791e-3343.04%LOC100005466 protein [Danio rerio]
NCBI nr blastxgi|3485433154e-3640.00%PREDICTED: zinc finger protein 665-like [Oreochromis niloticus]
Group
Gene OntologyGO:00036761.3e-10nucleic acid binding
GO:00082701.2e-05zinc ion binding
GO:00056221.2e-05intracellular
GO:00056342.7e-05nucleus
KEGG pathway 
InterPro domain[375-401] IPR0130871.3e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26822 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215562-TA
ATGCTTACAGCTATTGCCGATGTAAAGGTATGTTCAAACGATGGGCTACCTGAAAAGATTTGCCACAAATGTTGCACAGCTCTAGAAAAAGCTTATATTATTAAAACTGTTGCTGAAAGATCAGATCGCAAACTGAGGCAATTAGTTGCAAAGGCAAATGAAAAGTTACCAACAAAAAAGATAGCATTTGACTCCTTCACTGATATTAAAAGTGAAATTGGACCTCTGACTGTGAAATGTGAACCAAAATGGGAAATGGAAATTGATGTTAACAATAAAACCGGAGCTCCAATAATAGAGACTATTGAAGGAAGAGATATTGTTATTGCCGATACCGTCATAAGAAAAGTTGATTCAAATAATGACTTTAGCTCGTCAAACGAAAACAAGAACTTTGATTCTGATACAGATTTCCTTGACGATCACATATTCCGTGATGATGATGATGATGATTATTTACCTCCAGCTGAAAAAACTAAGAGTAAATTTAAGAAGCCGACATTGGCAGATATAAAAGTATTTAAAGCCAAGAAACCGGTTGTTAGAAAAGTGAAGGTGATAAAGCAGTACAAACAAGAACCAATAAATGATGAAGACACAAATATGAAATCAAAGACGACTACAATTACGTGCTATCCAGTTAATAAAAGTGGTGTAAGAGGACCACCTGTACTCCTCAAGAGGCCGAAGGATCTGACAGTTTTAAAAATTCAGAAAAATTATCAGACCAAAATAGAACCGAAAGATAGACCGGCAAGAAGCAATAGTACTTTAAGAAGAGTTAAAAATCCAAAGGGGGTTTTGAAAGAGAGAGAAAAACTTGTTTGTCCCATTTGTGGAATCTTAACATTCAGTTTAGGCAACCATATAGCCACTCATGAAGAAAAGAAAAAATTCACATGTTCAGAATGTCCCCGGTCATTTGTGCAAAAGAGTAATTTATTGGTGCATTTAAAAAAACATAATGGAGTCAAGGATCATATCTGTGAAGTTTGTGGAGCTGGGTTTTATACACAGAAATCATTAGCAAGGCATAATCTGATACATAAAGGAGAAAGACCATTCCCATGTAATTTATGTTCCAAAAAATTTATTGCGCGTTGTGATCTCAACCGCCATCTTCGTATCCATGCTGGTTATAAGCCGTACAAGTGTGGAACATGCGCTATGTCCTTCAACGCCAAACATCAGCTGCAGAACCATGAAAGAATGCATACCGGAGAGAGACCGTATTCATGTCAGATATGCAATGTCGCGTTCAGTTATAAAGTGAACCTTAACAATCATGTATACAAAGTGCACGGTATCAATTTGAAGTTCAAATCCATTCACACTGTAACGGAGGAGGTTTTACGTCGCGAGCTGGGATTAGCAAGTGAGGCTGCGGTCGCACAGATGATGCCACATCTGGACCGAATGACAGATGTACCTACACCAGCTGCACACGAGACTGTTCATCATACTGTTGACTTCACAGTCTAG

Protein sequence:

>DPOGS215562-PA
MLTAIADVKVCSNDGLPEKICHKCCTALEKAYIIKTVAERSDRKLRQLVAKANEKLPTKKIAFDSFTDIKSEIGPLTVKCEPKWEMEIDVNNKTGAPIIETIEGRDIVIADTVIRKVDSNNDFSSSNENKNFDSDTDFLDDHIFRDDDDDDYLPPAEKTKSKFKKPTLADIKVFKAKKPVVRKVKVIKQYKQEPINDEDTNMKSKTTTITCYPVNKSGVRGPPVLLKRPKDLTVLKIQKNYQTKIEPKDRPARSNSTLRRVKNPKGVLKEREKLVCPICGILTFSLGNHIATHEEKKKFTCSECPRSFVQKSNLLVHLKKHNGVKDHICEVCGAGFYTQKSLARHNLIHKGERPFPCNLCSKKFIARCDLNRHLRIHAGYKPYKCGTCAMSFNAKHQLQNHERMHTGERPYSCQICNVAFSYKVNLNNHVYKVHGINLKFKSIHTVTEEVLRRELGLASEAAVAQMMPHLDRMTDVPTPAAHETVHHTVDFTV-