Monarch geneset OGS2.0

DPOGS213107
TranscriptDPOGS213107-TA1602 bp
ProteinDPOGS213107-PA533 aa
Genomic positionDPSCF300016 + 125357-128547
RNAseq coverage435x (Rank: top 28%)
Annotation
HeliconiusHMEL0065480.085.56% 
BombyxBGIBMGA007867-TA0.079.78% 
DrosophilaCG1832-PB4e-14352.30% 
EBI UniRef50UniRef50_G6CSE70.0100.00%Zinc finger protein n=3 Tax=Endopterygota RepID=G6CSE7_DANPL
NCBI RefSeqXP_001848620.11e-15555.22%zinc finger protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700417602e-15455.22%zinc finger protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700417604e-15955.93%zinc finger protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00036764.3e-14nucleic acid binding
GO:00082702.1e-05zinc ion binding
GO:00056222.1e-05intracellular
KEGG pathway 
InterPro domain[430-456] IPR0130874.3e-14Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15743 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213107-TA
ATGTTCACCACAAATACCGTGCAAGGCGGCTCAGTGCCCACACTACAATACCAGGAACATCCACAGAAATCACAAGGGAAAAACATTGAAAATGTTCAACAAAAAACACAACAGACTCAGCAAACCCAACAGGAGTTTCCAACATTTTGTTATACAACCAATGTAAATATGATAGGTAAAATTGGATCTGGAACAACTGCTGGGACGGGGGGAGTAACAGGGGGTGTAAACATAGCCCAGCTCACAACCAGTGATGACAAAACATGTTACATAGCCCAGCCTTTCTCTTATAATTATGCTCTAGTAAATCAAATGCAAATTGCACCAAACGGATTACAGAATACAATATCAAATATAAGTTTCAAGTGTGATGTCTGCGGCCTTATGTTTGGCCATCTCACTTTACTTAATACTCATAAACGGATGCATACACAGGATTCAGATAGTAATGTGGTTGTGGTATCTACTGGTGTGAGTACGGCCAATGATGTCAGCATGCCGCCCCATATTCAAATACTATCCACGGATCCCAATGAAACCCAACAACACCATGTCCAGTTACAAGATACTAAACCGGTGATAATAGAAAAAGTCCAGAAATGTATAACATGTGGAGGTGCCATAACAAACAACCCGAAAAGGAAAGGTCCAAAGCTAATACGCTGTGAAAACTGTATAGCACAAGATTCAGTAGAGCAAAGAAATAATCAATTAAATACTACTCAAATATTTGTAGCAGCTGAAAATAATGTTAAGTTTGAAGTGGGAGGTGCTAGCGGGGTTCAAGCGCCTGGGGATCCATTATCGAATGCATCCAACAACCAACAACAACAGAAACCCTTGCCCGGTCATCATCCTGTTAAAAAAAGAAACCTAGCTTCGGTTACGAAATGTCAAAACTGCAACGGTTCTGGTATAGTATTTGTCGGGGGAAGAGACAAAAGCAAAAACAATCAAGCAATAGCAGACAAGCCGTTCCATTGTAACATCTGTGGTGGTTCGTTTTCAAGATATTCGTCGTTATGGTCTCATAAGAAGTTGCATTCTGGCGAAAAGAACTTTAAATGTGGAATATGTGGGATTGCATTTGCGAAAGCTGTTTACCTCAAAAATCATTCGAGAATACACACTGGCGAAAAGCCGTATAGATGTCAAACATGTGGAATGCAGTTCTCACAGTCACCACATTTGAAGAACCATGAAAGAACACATTCCGGAGAGAAACCGTATGTCTGTGAGGTTTGTGACAAAGGGTTCGCTAGGCACGCGACCTTATGGAATCATCGTCGTATACATACGGGAGAGAAACCATACAAATGCGAAACGTGTGGCTCGGCGTTTAGTCAAGCCGCGCATCTTAAAAATCACGCCAAAGTGCATTCTGGTGAGAAGCCATTCAAATGTGACATTTGTACCGCTGCCTTCGCTGACCGCTTCGCTTTGAAACGACACAGAGGCATACACGACAAATATGGTCAAACAGCTCCTCTGCCTTCAAGAATACAGCTCACCCAGCAGGAACAGTCACAACAACAGACAAACAACGAACAGCAAGCAACACCGACCGTGGAGGTTGAACATCGCACCGAACAAACTCTATGA

Protein sequence:

>DPOGS213107-PA
MFTTNTVQGGSVPTLQYQEHPQKSQGKNIENVQQKTQQTQQTQQEFPTFCYTTNVNMIGKIGSGTTAGTGGVTGGVNIAQLTTSDDKTCYIAQPFSYNYALVNQMQIAPNGLQNTISNISFKCDVCGLMFGHLTLLNTHKRMHTQDSDSNVVVVSTGVSTANDVSMPPHIQILSTDPNETQQHHVQLQDTKPVIIEKVQKCITCGGAITNNPKRKGPKLIRCENCIAQDSVEQRNNQLNTTQIFVAAENNVKFEVGGASGVQAPGDPLSNASNNQQQQKPLPGHHPVKKRNLASVTKCQNCNGSGIVFVGGRDKSKNNQAIADKPFHCNICGGSFSRYSSLWSHKKLHSGEKNFKCGICGIAFAKAVYLKNHSRIHTGEKPYRCQTCGMQFSQSPHLKNHERTHSGEKPYVCEVCDKGFARHATLWNHRRIHTGEKPYKCETCGSAFSQAAHLKNHAKVHSGEKPFKCDICTAAFADRFALKRHRGIHDKYGQTAPLPSRIQLTQQEQSQQQTNNEQQATPTVEVEHRTEQTL-