Monarch geneset OGS2.0

DPOGS200326
TranscriptDPOGS200326-TA1596 bp
ProteinDPOGS200326-PA531 aa
Genomic positionDPSCF300026 + 228188-240673
RNAseq coverage365x (Rank: top 32%)
Annotation
HeliconiusHMEL0000616e-10870.00% 
BombyxBGIBMGA005575-TA2e-5375.00% 
DrosophilaCG5245-PA3e-2926.46% 
EBI UniRef50UniRef50_D0ABB04e-8868.72%HM00061 protein n=1 Tax=Heliconius melpomene RepID=D0ABB0_9NEOP
NCBI RefSeqXP_001602302.12e-3250.00%PREDICTED: similar to ENSANGP00000015396, partial [Nasonia vitripennis]
NCBI nr blastpgi|2613359251e-8768.72%HM00061 [Heliconius melpomene]
NCBI nr blastxgi|2613359253e-9668.72%HM00061 [Heliconius melpomene]
Group
Gene OntologyGO:00036761.8e-11nucleic acid binding
GO:00082707.6e-06zinc ion binding
GO:00056227.6e-06intracellular
KEGG pathway 
InterPro domain[66-101] IPR0130871.8e-11Zinc finger, C2H2-type/integrase, DNA-binding
[433-455] IPR0070877.6e-06Zinc finger, C2H2
Orthology groupMCL22097 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200326-TA
ATGGAGTTAATGATAAAGCAGAGTCTCATAGCTAAGAGCAAGCTGGCTACAATAAAACGCTCTGAGGGGGTGACACACTTCACCTGTGTGGAATGCTCCGCTGAATATGATGATAAAGAGAAGCTTGAATTACACCTGTTCTCACACTATCATACTTATAGATTCATTTGCGGTGTCTGCGGGACAGGGTTGAAGCGGAAGGAACATTTAGACAGACATATGCAGGAACACACAGAGTACCGCCCGCATATATGCCCAGACTGTGGCAAAGGATTCAAGAGGAAAGAGCATCTCAATATACACATGACAATACACAAGGGTGATAAAAATTTGATGTGTTCTCTATGTCAAAAGGCCTTCTATCGCAAGGATCATCTGCAGAAGCACTTGCAGACTCACAACAAGAACTTCGTAAAACAAACTTCAATTCTGGAGCCAGACATAGACATCAAGCAAGAGACGGAGGAATATGAGGAGTTCAATCCGATCATAACAAATGTCATCGGTAATGATGAGGCACAGGCCTTCATGGAGACAGATGATCAGAATTCAAGCGAGTCCACATCCGATCAGAAGGATATAAAGCCAGAGCCTGTGGTCGATGCCTCGAGACCGTTTGTATGTCAGGACTGTGGCAAGAGTTACAAGAGGAAGGACCATCTCAAGATACATAGTTGGACACACATCAAGAGGGAGGTGCTCTGTGGACAGTGTGGAAGAGCATTCCACACGGCCGACCAGATGTTGGTTCACGTGAACCTGGTCCACATCCGCACACACGAGACCACGGGCGGAGTGGCGCAGCTGAGGGCGTTGCTCGGAGACCAGATCGACGTCGAGGTGCTGAACAACAGCTCGAGCCTACTGGTGGAGCGCGAGGCGTCGAGTCCGGAGCGCCGGCCCCACGAGTGTCCGGTCTGCCACCGGAAATTCAAAAGGAAACAACATCTGAAAGTCCACGCGAACGTACACTTCAAACAGACCCACACTGTGTGGTGCTCGCTTTGTAACGAAGGTTTCAGCGATAACACTCAGTTCGAGGGTCACCACTGCCAGTTCACGAACCAGAGCCAGGGTGAGGGGGCGGAGTACGAGGACACGCGGGACAGCCCCCCGCAGGACGCCAAGAAAGAAAACCACCCCATAGACTTCGTGGAGGTGGAGCTGAACGACCCGTCGAGCGAGTCCCGCCTGCCCCTGCCCCGGCGCGTGTATGTGTGCAAGTACTGCGGGAAGCCGTTCAAGAGAAAGGATCACTATAAGATACATCTGCACATACACACCGGCGTCAAGAGCTTCTTCTGCCCCGACTGTGGGAAAGGCTTCTACCGCAAGGATCATCTCCAGAAGCACATGATAGTTCACGCGAAGTTCAAGTTGAAGCCCAAAAACAAGAAGGAGGTGCCCGACCTCGTGCCCATAGACACCCTCAAGAAGGAAGTCAAGCCTGAGATTACTATACATGCTCCTTCGAACACCAAGCTCCGCATGCCGCTCCAGATCAAGGTTCCCTACCAGATGGTGATGAGCTTGGACAACGGGGAGCAGACAGCGGTCACCATAGACCCCACCGACGACACTCACATCGTCATATAG

Protein sequence:

>DPOGS200326-PA
MELMIKQSLIAKSKLATIKRSEGVTHFTCVECSAEYDDKEKLELHLFSHYHTYRFICGVCGTGLKRKEHLDRHMQEHTEYRPHICPDCGKGFKRKEHLNIHMTIHKGDKNLMCSLCQKAFYRKDHLQKHLQTHNKNFVKQTSILEPDIDIKQETEEYEEFNPIITNVIGNDEAQAFMETDDQNSSESTSDQKDIKPEPVVDASRPFVCQDCGKSYKRKDHLKIHSWTHIKREVLCGQCGRAFHTADQMLVHVNLVHIRTHETTGGVAQLRALLGDQIDVEVLNNSSSLLVEREASSPERRPHECPVCHRKFKRKQHLKVHANVHFKQTHTVWCSLCNEGFSDNTQFEGHHCQFTNQSQGEGAEYEDTRDSPPQDAKKENHPIDFVEVELNDPSSESRLPLPRRVYVCKYCGKPFKRKDHYKIHLHIHTGVKSFFCPDCGKGFYRKDHLQKHMIVHAKFKLKPKNKKEVPDLVPIDTLKKEVKPEITIHAPSNTKLRMPLQIKVPYQMVMSLDNGEQTAVTIDPTDDTHIVI-