Monarch geneset OGS2.0

DPOGS215796
TranscriptDPOGS215796-TA1401 bp
ProteinDPOGS215796-PA466 aa
Genomic positionDPSCF300041 + 2097281-2100121
RNAseq coverage142x (Rank: top 55%)
Annotation
HeliconiusHMEL0059290.067.94% 
BombyxBGIBMGA007089-TA5e-3128.71% 
DrosophilaMeics-PA7e-3224.55% 
EBI UniRef50UniRef50_F4W4811e-5532.79%Serendipity locus protein H-1 n=7 Tax=Formicidae RepID=F4W481_ACREC
NCBI RefSeqXP_972099.11e-6337.91%PREDICTED: similar to CG10366 CG10366-PA [Tribolium castaneum]
NCBI nr blastpgi|910943052e-6237.91%PREDICTED: similar to CG10366 CG10366-PA [Tribolium castaneum]
NCBI nr blastxgi|910943051e-6538.30%PREDICTED: similar to CG10366 CG10366-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036767.4e-13nucleic acid binding
GO:00056342.1e-11nucleus
GO:00082702.1e-11zinc ion binding
GO:00056222.8e-06intracellular
KEGG pathway 
InterPro domain[290-317] IPR0130877.4e-13Zinc finger, C2H2-type/integrase, DNA-binding
[11-84] IPR0129342.1e-11Zinc finger, AD-type
[271-293] IPR0070872.8e-06Zinc finger, C2H2
Orthology groupMCL18278 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215796-TA
ATGGTTGAATCGAATTCGGATTTCGAAAGATGTTGTCGGTTGTGTGCAGAAGAGCAAGAAGTTACCATAATGATTTTCAGCAAAGAAGCTGAAGTCATGCTTTTGCAAAATAAATTAAATAAATACTTACTTATAGAGGTCGATGAAGATGATAAATTGCCGAAAAATATATGCATCCAATGCTGTTCAAAGTTACAAATAGTAAGTGAATTCATTGACAATGCACACAGAGCTCAGGAGGTTTTACTTAAGCAAAGTCTGATGTTAGAAGATAATGATGAAAATAAACCTATTTTGATTACTGACATCAAATCTGAAACAGATTTTGACAGTGATACAAAATGCATGGAAGTAAATGTTGATCCAATGATGGTTCTTCAAAACTCTGAAGTGGAGGAGGCACCTAATTTAGAAAAATCTTGTGAATCCGTCGAATATGAAGATGAAATAACTTACCTGAATGGCGCTGATGGTGAAAATGTAACAATCAAATTAATTAAAAAAGGTGATAAATTGATGGAAGACGATAAAAAAGACACCAAGCCCTTTCAATGCATCACCTGTAATAGGGGCTACTATACAGAGTTGGCCCTCAAAAACCATTCATGGATACATTTCAATGAAGATAAAATTGTTAAACCCTTTAAATGTAGCTCTTGCGGTGACCAGTTTGAATATAAAAATGAATTGATATCACATCTTAAAGAACATAGAACACGAGGAATGTGTAATATATGTGGACGACTTTTCAGGAATGAAAACAATTTAGTGGAGCATATGGAAGCACACACTTCAACGAGTAGTAGATCATACACATGCAAAGTTTGTGGCCGCTCCTATAACACAAGTAGCAATTTAAAAACACATATGGTTACCCACAGCAATGAAAGACCTTACAAGTGCATTTATTGTAAGAAGAGTTTTAAGAGAAATCAAGACTTAAAGTTTCATATCAATCAGCACACAGGTGCCAAGCCATACAAATGTCCGTATTGTGACAAAAGTTTTGCTAGTTCTGGTAATTGTTATTCTCACAAGAGTAGAATGCATCCAGAAATTTCAATAGGCGATGGTAAGATGAAGAAGGATATAAGCGAAAAAATAAAGGAAACAAAGAAGCCAGTGATGAAACAACAATTAAAACTTAGGCCGATTGCTCCCAAACCTGTGGTACTGAAAGCTAATTTTAAATATCAATGTACTATATGCCATCATAGCTTTATGAAAAGGGATAATTTTATGTATCACATGTACCAGCATACGGGTGAAAAACCATTTCACTGCTTGTATTGCGATGAAAAGTTTGTAACAAGGAAAGGCCTTTTAATACACCATGATATAGTTCATAATGGGCAAGATCGGCCCTTAGCACTGCTGTCAAAAAACGTATTATTAAAGTAA

Protein sequence:

>DPOGS215796-PA
MVESNSDFERCCRLCAEEQEVTIMIFSKEAEVMLLQNKLNKYLLIEVDEDDKLPKNICIQCCSKLQIVSEFIDNAHRAQEVLLKQSLMLEDNDENKPILITDIKSETDFDSDTKCMEVNVDPMMVLQNSEVEEAPNLEKSCESVEYEDEITYLNGADGENVTIKLIKKGDKLMEDDKKDTKPFQCITCNRGYYTELALKNHSWIHFNEDKIVKPFKCSSCGDQFEYKNELISHLKEHRTRGMCNICGRLFRNENNLVEHMEAHTSTSSRSYTCKVCGRSYNTSSNLKTHMVTHSNERPYKCIYCKKSFKRNQDLKFHINQHTGAKPYKCPYCDKSFASSGNCYSHKSRMHPEISIGDGKMKKDISEKIKETKKPVMKQQLKLRPIAPKPVVLKANFKYQCTICHHSFMKRDNFMYHMYQHTGEKPFHCLYCDEKFVTRKGLLIHHDIVHNGQDRPLALLSKNVLLK-