Monarch geneset OGS2.0

DPOGS210243
TranscriptDPOGS210243-TA1251 bp
ProteinDPOGS210243-PA416 aa
Genomic positionDPSCF300196 + 574353-577576
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0177555e-6440.85% 
BombyxBGIBMGA002377-TA2e-5635.32% 
DrosophilaCG34406-PA4e-2724.62% 
EBI UniRef50UniRef50_F4WB351e-5536.71%Zinc finger protein 84 n=7 Tax=Formicidae RepID=F4WB35_ACREC
NCBI RefSeqXP_001121352.15e-5736.36%PREDICTED: similar to Zinc finger protein 84 (Zinc finger protein HPF2) [Apis mellifera]
NCBI nr blastpgi|3838525614e-5636.07%PREDICTED: zinc finger protein 436-like [Megachile rotundata]
NCBI nr blastxgi|1107581572e-5836.14%PREDICTED: zinc finger and SCAN domain-containing protein 2-like [Apis mellifera]
Group
Gene OntologyGO:00056344e-15nucleus
GO:00082704e-15zinc ion binding
GO:00036763.1e-09nucleic acid binding
KEGG pathway 
InterPro domain[20-92] IPR0129344e-15Zinc finger, AD-type
[300-328] IPR0130873.1e-09Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL18233 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210243-TA
ATGCGAACGTATAAAAGAAAGCGACCAGAATACACATTTATCGAGGGTGTTCATGAGCTGTGTAGACTTTGCCTGGAAAAAGCAACCGAATCGGTTCCTATATTTAGTAATGAATCGAATATTTATGCATCTCTATCAGTGAGAATCATGATGTGTGTAGGATTTGAGGTGAGCAGAGAGGATTGTCTTCCTAACAGTATATGTTCCACATGTCACAAACAATTGAGTAGTTTTTATGAATTTAGAAAAAAATGTGCTGTCATGTATCAAAAATTAAAATGCCACGTACTAGCTGTGAAGCAGATGGAAAGTGAAAAGGCTCTAAAACAAATTCAGGAAAATAATATAACTAAGGCATTCGAGAGTGACACAGGCAGGCATGTTCAGAACCATGGAACTGTTGACGCAAGTAATGAAACACAAATTATAAGTGATTCTGTGAGTCTTATTAAGAATACTATTACAGAAAATATTGAGGAGGCACCGGATATATCAGATTTCCTCTCATCAATATTATTAGAAATTGGTATCTTGACAAAACAGGACGGACAAGTCGTGTCAAAAGAACATCTCGAGATGATAGACATAGACAGTGGAGCTGATCGAGTTACATTCCAGTTGTTAGAGGTGGACGAGGGACAAAAAACCAGTGATATTATAAAGGATAGTGCAAATAGTGAATTATTAAATGTTAGGAATGTTAATATATTAGAAAAGGAACTGTTGAGACCCATCAATATTAAAAGCAGTGTCCCGCGATGTACGGAGTGCGGTAAGAGCTACGCGACCCGCGGCGCGCTCCGGAGACACGCCCGCGTACACACGGGAGAGAGGCCGCACGTGTGTCACGTGTGCTCCAGGGCGTTCGGACAGAAGGAGGTGCTCAGGAGGCACGAGCTGGTACATACAGAGGATCGTCCATTCAAGTGTCAGAACTGTCCGAAGAGCTTCACTCAGCGCGGCGCGTTGTTGTCTCACGGCCGCGCGTGTCTCCCGCCACCGGCCGCGCGCCCGCTCGCCCTCCACAGGTGTACTGTTTGTCCTAAAGTGTTCCTACACGCCTCCGGTCTGTCTCGTCACGCGTTGGTGCACGCGGGTCGTGTGTTCTCGTGCTCGCCGTGCGGCCGTCGCTACAGCGACCGCAGCTCGCTGCTGCGTCATCTCCGCTCACACAAACACGCGCGCACGCATACAGACACGCGGCGCGACGCGCACACTGACGACGCTACGGCTTTGACTGTCACTGGATGA

Protein sequence:

>DPOGS210243-PA
MRTYKRKRPEYTFIEGVHELCRLCLEKATESVPIFSNESNIYASLSVRIMMCVGFEVSREDCLPNSICSTCHKQLSSFYEFRKKCAVMYQKLKCHVLAVKQMESEKALKQIQENNITKAFESDTGRHVQNHGTVDASNETQIISDSVSLIKNTITENIEEAPDISDFLSSILLEIGILTKQDGQVVSKEHLEMIDIDSGADRVTFQLLEVDEGQKTSDIIKDSANSELLNVRNVNILEKELLRPINIKSSVPRCTECGKSYATRGALRRHARVHTGERPHVCHVCSRAFGQKEVLRRHELVHTEDRPFKCQNCPKSFTQRGALLSHGRACLPPPAARPLALHRCTVCPKVFLHASGLSRHALVHAGRVFSCSPCGRRYSDRSSLLRHLRSHKHARTHTDTRRDAHTDDATALTVTG-