Monarch geneset OGS2.0

DPOGS202752
TranscriptDPOGS202752-TA1113 bp
ProteinDPOGS202752-PA370 aa
Genomic positionDPSCF300335 - 23169-54785
RNAseq coverage48x (Rank: top 71%)
Annotation
HeliconiusHMEL0058581e-17497.39% 
BombyxBGIBMGA010428-TA2e-7890.79% 
Drosophilagl-PA1e-3240.34% 
EBI UniRef50UniRef50_Q17NF22e-13779.74%Zinc finger protein n=6 Tax=Neoptera RepID=Q17NF2_AEDAE
NCBI RefSeqXP_001812645.13e-15475.28%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|1892414515e-15375.28%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastxgi|1892414515e-15475.49%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
Group
Gene OntologyGO:00036766.8e-13nucleic acid binding
GO:00082705e-07zinc ion binding
GO:00056225e-07intracellular
KEGG pathway 
InterPro domain[271-297] IPR0130876.8e-13Zinc finger, C2H2-type/integrase, DNA-binding
[224-245] IPR0070875e-07Zinc finger, C2H2
Orthology groupMCL14730 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202752-TA
ATGGAGACGTTAGTTCTAATTCCTCAAGAACTTAGTCTCGCTCTACGGCCAGGGGGAGCTAGGGGGAGAGAAGCTTCTGTCAGCGTTTGGACATCAACTGAGATTCAGAGGGGTGCATTATTATACCCCTTTCAAGGGACTATTAGGATCGACAAACTTCATGTCTACTCGTATGTACCTGATCACGATGTACGTCATAGATTTGGGCTGTTTGATGAAGTGTGTACTTCAGCAGGAGTCCAAGTTCGCCACTGTAATTGGGTTCGATTTCTACGGGTGTCAGAAAACTATGGACCACAGGTGAACCTAGTCTGTACAAAAGTAAAGGGAGAACCAGTGTACGAAGCGGTTAAACCAGTATCCGCGCACACTGAGCTGGTAGTCTACTATCTTCCAGAAAGACCTGAAGAGCTATTCTTCGTGAGAATGAGGAATAATCTGTACAGACAGACAATGGATTCGATACTGGAAGATTCACCTTTAGATCTTTCCACATCATTACTCTCAAGAGTTCTTCTACCAATATCTCCTCCATCAGGAGCTGAAGATGAACACAAATCTGTATCTGGTGACTCCTTAACATCTTCGTCAGCTGGATCATCGACTGAACTTTTAGAAATGACACCAAAAATTCAAAGGAGGCCTGCTAAGTCTGAAAGGGCTTTGCTCCCTTGCGAAGTTTGTGGAAAGGCATTCGACAGGCCATCACTTTTAAAAAGGCATATGCGAACACATACCGGTGAAAAGCCTCATGTTTGCATGGTATGTAATAAAGGCTTTAGTACATCCAGCAGTCTTAATACACATAGAAGGATACACTCTGGTGAAAAGCCTCATCAGTGTCAGCATTGTGGGAAAAGATTTACGGCGAGTTCAAATCTCTATTACCATAGAATGACACATATAAAGGAAAAGCCTCACAAATGCAGTCTGTGTTCGAAGTCGTTCCCGACCCCTGGCGACCTGAAATCTCATATGTATGTCCACAGCGGTTCGTGGCCTTACAAGTGCCATATCTGCTCCAGAGGCTTTTCAAAACACACCAATCTGAAGAACCATCTCTTCCTTCACACTGCCAAGCATCAACGCAATAGCACCAGGGACACAACTTGA

Protein sequence:

>DPOGS202752-PA
METLVLIPQELSLALRPGGARGREASVSVWTSTEIQRGALLYPFQGTIRIDKLHVYSYVPDHDVRHRFGLFDEVCTSAGVQVRHCNWVRFLRVSENYGPQVNLVCTKVKGEPVYEAVKPVSAHTELVVYYLPERPEELFFVRMRNNLYRQTMDSILEDSPLDLSTSLLSRVLLPISPPSGAEDEHKSVSGDSLTSSSAGSSTELLEMTPKIQRRPAKSERALLPCEVCGKAFDRPSLLKRHMRTHTGEKPHVCMVCNKGFSTSSSLNTHRRIHSGEKPHQCQHCGKRFTASSNLYYHRMTHIKEKPHKCSLCSKSFPTPGDLKSHMYVHSGSWPYKCHICSRGFSKHTNLKNHLFLHTAKHQRNSTRDTT-