Monarch geneset OGS2.0

DPOGS216089
TranscriptDPOGS216089-TA1650 bp
ProteinDPOGS216089-PA549 aa
Genomic positionDPSCF300415 + 73658-75376
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0062380.080.31% 
BombyxBGIBMGA007797-TA0.075.35% 
DrosophilaCG11966-PA5e-1951.25% 
EBI UniRef50UniRef50_D6WNP92e-10344.92%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WNP9_TRICA
NCBI RefSeqXP_970854.14e-10444.92%PREDICTED: similar to CG4374 CG4374-PA [Tribolium castaneum]
NCBI nr blastpgi|910820417e-10344.92%PREDICTED: similar to CG4374 CG4374-PA [Tribolium castaneum]
NCBI nr blastxgi|910820414e-10645.18%PREDICTED: similar to CG4374 CG4374-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036762.7e-10nucleic acid binding
GO:00082703.1e-05zinc ion binding
GO:00056223.1e-05intracellular
KEGG pathway 
InterPro domain[508-535] IPR0130872.7e-10Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18785 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216089-TA
ATGACGGCCACTGCTCCTGAACTCTCAAAATCGGATTTTTTTGATTTTGTGACATCTAATGAAGTGACGGACGCTCAATATAAACAGCAAATGCGATCAGTCAACGTGTTCATGGAATCACCTGATTCAAGATCCAACCCTTTGTTGTCAGAGGAGCCCAAGGAGCAAAACAACAACAACATACTTGCCGTCAGCGGTGGTCAGTCGACCAGCGCAGGTATGACGACCGCCACCAGTTCGAATCCACTTCAGAGTTTCGACTCTATTTGGAACGTAGATCGGGATCGCGACCGCATCGAGACAGCAATGTTGGAGGATCTCAACAAATACTATTGGAATCAAGAAAACGATATTAACGGAACCCATCCATGTTCTGATACAGCAATATCAAATAAATTAATAAGCAATAATACGGATGGACAAATATACACACTGACAGTTTTAAATCAAGACATTAATGAAACAAACACTAACCGCTATTGGGTAAAAGAGGAAGATGTTTCAATGTCAAGTCCGATAGACGTTGAACAAAATCCCTCCTTAGACCTGGAATCTATACTTAACATGAATGGATTTCCAAATGATTTCAGTCAAGATACACTTAAATTAAGTTCAAACAACTTGGTCAAAATTGAACCATTCAATTATGATGACAGTGAATTTCAAAGTAGTGACAAAAAAGATGATTCCATAATAAGCAACCCCACATTATTAGAAGTGGAATATAATAACAATAACAACGACTGGAAATTGACTGACCAAAACACAGAATCAAACGAATCATTACTACGGAGTGCCTTACAAGGAAAAGCTTTTATTAGATATAGTACAATTCAAAAAAATACTGTCAATAAATTTGACGCAGAATCAGAGTTAAAGAGGGCTATTATTAATAATAACAATAAACCAGATGTACCATCTATGTATCAAGATAACAAAGATACCGATTCTAGCCTTTTGATGGCCATAGCACCACCAAACGGTAATGTAAATATATCAATATTATTAGAAGAACCTTCAGCGACTGTCTTATCGAATGGAGATAATCCTACATCAACACAAAGCATGGACGACATTTTACTTTCCCAACTTGATGCTAGTTATCCCGACGACTATGAAAAATTGAAGCGGATAGCTACTGAACTAGGTGAATCAGTGCAACCATTTTGTACTGTAGAGCCTATTGATTCCGTTCGTAATGTATACAATATTCATCATGTAAATGGCGAATTAGTAACTATGATTCCAGCAGGAGAAGTTCAGTTGCCTCAACACTTACAGATAGTAACGGCATCACCCACGGTGACGAGTACTAAGCCGGGAGGTAAGAAGATCAGAAGAATCCAAAATAAAAATTCACCCCCAACAACACAAACACAGTCGGGGACGGCTGTTCAAGCGGCGACATCTACTTCAAATGGTGTCCGAAAAGAACGGTCGCTACACTACTGCTCTATTTGTTCTAAAGGTTTTAAGGACAAATACTCTGTTAATGTGCATGTGAGAACTCACACCGGAGAGAAACCCTTTACATGTTCTTTATGCGGGAAAAGCTTTCGACAGAAGGCGCATCTCGCAAAACATTACCAGACACACATCGCACAAAAGAGCGCAGCTGCTAACGGTGGCTCTAAACCTTCGAAGAGGTAA

Protein sequence:

>DPOGS216089-PA
MTATAPELSKSDFFDFVTSNEVTDAQYKQQMRSVNVFMESPDSRSNPLLSEEPKEQNNNNILAVSGGQSTSAGMTTATSSNPLQSFDSIWNVDRDRDRIETAMLEDLNKYYWNQENDINGTHPCSDTAISNKLISNNTDGQIYTLTVLNQDINETNTNRYWVKEEDVSMSSPIDVEQNPSLDLESILNMNGFPNDFSQDTLKLSSNNLVKIEPFNYDDSEFQSSDKKDDSIISNPTLLEVEYNNNNNDWKLTDQNTESNESLLRSALQGKAFIRYSTIQKNTVNKFDAESELKRAIINNNNKPDVPSMYQDNKDTDSSLLMAIAPPNGNVNISILLEEPSATVLSNGDNPTSTQSMDDILLSQLDASYPDDYEKLKRIATELGESVQPFCTVEPIDSVRNVYNIHHVNGELVTMIPAGEVQLPQHLQIVTASPTVTSTKPGGKKIRRIQNKNSPPTTQTQSGTAVQAATSTSNGVRKERSLHYCSICSKGFKDKYSVNVHVRTHTGEKPFTCSLCGKSFRQKAHLAKHYQTHIAQKSAAANGGSKPSKR-