Monarch geneset OGS2.0

DPOGS206284
TranscriptDPOGS206284-TA1827 bp
ProteinDPOGS206284-PA608 aa
Genomic positionDPSCF300290 + 214138-220600
RNAseq coverage679x (Rank: top 19%)
Annotation
HeliconiusHMEL0169000.075.32% 
BombyxBGIBMGA010744-TA0.059.78% 
DrosophilaPep-PB4e-5734.58% 
EBI UniRef50UniRef50_P410736e-5534.58%Zinc finger protein on ecdysone puffs n=13 Tax=Schizophora RepID=PEP_DROME
NCBI RefSeqXP_001957697.12e-5833.27%GF23899 [Drosophila ananassae]
NCBI nr blastpgi|1947507595e-5733.27%GF23899 [Drosophila ananassae]
NCBI nr blastxgi|126192904e-7632.63%cathepsin B mRNA 3'-untranslated-region-binding protein CBBP [Sarcophaga peregrina]
Group
KEGG pathway 
Orthology groupMCL18012 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206284-TA
ATGGCGAATCGTAGACCCCAAGGCAGTGGCAGGCGTATGGATTTCGGTAGAAATGACCGCGGCAAGAATTTCCGTGGGGGTGTATCCCCATGGCAAGGGGGTGGTCCGGGTGGTGATCTGCCCAATCTTCTGCCATTGGGCGGAGGCCCGACTGAGGCGACGCTAGCTTTGGCCAGCAATCTCATAAATCTTCTCCAACCACGTCAAAACCCCGTCCCTTCGTTACTCGACATGCCAATACGACGTGACTTCGGCCCTAGCATGAACCGGTATGATCGTGGTTATGGACCAAACAGGATGGGCAATCAAGGTAATTTCCGGCGTTCGGGAAATTACAATCGCGCTGGTGAGCGTGTCCACAACAATCGTAAACCGTTCAGACCCAATGATGGGATCAGGCAACAAAACAAGAGCTCCCCGAAAAAGGATGCCGATTCAAAGGCTAAGCCTGTAAAGGAGAGTGAGGAAAAGGACCAGGAAGGTTCTGAAAGACCGGAGAAGAATGAGGAAGACAAGAAAGACGTTCCGAAGACTCGCTACGACGACATCAACCCTCAACTTCTGAAATGTCACATTTGTAACAAGTCTATGTGGGATGGGCGTTCGTTTGAAAACCATCTGAGCGGTCGCGCCCACGCTATTATGATGCAAAAAACTGCGGAGAGCTACGCGCTGACAGCTGATACGATGCGTCAAGAGTTTAAGATACGTGAAATGAAACGCACCCGTAAAACAGGTCAACAACCCCCCCGCGAGTTCTACTGCGCTATGTGTGATATATACGCTGGGGATGGTGCCATACATCGCACCACCGTGGGACACCGGAAATTGAAGAAATATCTACATCCAACATGTACTTCCTGTCACAAGGAAATGCCAACACGTATTGAACTGGACGAACATAGACTGACAGCTGAACACTTGAAGAACATGCAAGAGAAACAGGAAACTATATCCAAACCGAAGCCTGAAGTAATGGTGATATCAACACTTAGTATGGAACAGACTTACTTACGTGATGACCGTCAGCGATGGAGACGGGAGAAACACGATCGCAAGGATGAGAAACAGGATGATGCTGATAAAGAAGTCAAGAAGGAACAAGAAGGTGAAACAGAAGGAGAAGTGAAAGCAGAAGATAAAAAGGAATTAAGCATCGACAATGAAAACACTGTTCTAGATTACAAGGAAGGGGTTGAAATTACCAACGTTACCCCCGATATGTTACCCGCTTACAGTACCGACCGCGGCGTTGGCGCGTCCTTCTTGAGTGAATTCCGTTGCGTTCAATGCACTTTGTGTCACAAACTGTTGGATGGGGAGGAAACGGCACAAGTACATCTCAGGACTTGGCGACACCACCAGCTGTTTGTTAGACTCATCAATGAGAAAGCTGGGAATATACAGCCACAGTCAGAGGCGGTGAAGCGGTCAATCAATGATGATTCAGGAACATGGAAACGCAGGAAGACCTCCAGGGATGAAGAAAATGGACACGAAATTGTTAAAGATGATCATAATGAAGGTACAGAACAGAACCAAGAAAATCCTGAGAATATGAATGAGCTAGCAGCGGATGAGCTCGAGGATTGGGAACAGTCTGTGGATGATATACTCAATGATGAGAGCGAAATGATAGAGAAAGAGCTTGACACAGAAACACAAGAAGAAACACAGACAGAAGTAGAGGAAGAAAAGAAGAGTCCGCAAGAAACATTGACTTCAACAAGAGAAGAGAAAAATGGAAATGAATTAGAAGAAAAACCGGCGGAACTAAAGAAAACACCACAAAAAACAACGAGAGGTCGAGGTAGACGCCGCTTTTAG

Protein sequence:

>DPOGS206284-PA
MANRRPQGSGRRMDFGRNDRGKNFRGGVSPWQGGGPGGDLPNLLPLGGGPTEATLALASNLINLLQPRQNPVPSLLDMPIRRDFGPSMNRYDRGYGPNRMGNQGNFRRSGNYNRAGERVHNNRKPFRPNDGIRQQNKSSPKKDADSKAKPVKESEEKDQEGSERPEKNEEDKKDVPKTRYDDINPQLLKCHICNKSMWDGRSFENHLSGRAHAIMMQKTAESYALTADTMRQEFKIREMKRTRKTGQQPPREFYCAMCDIYAGDGAIHRTTVGHRKLKKYLHPTCTSCHKEMPTRIELDEHRLTAEHLKNMQEKQETISKPKPEVMVISTLSMEQTYLRDDRQRWRREKHDRKDEKQDDADKEVKKEQEGETEGEVKAEDKKELSIDNENTVLDYKEGVEITNVTPDMLPAYSTDRGVGASFLSEFRCVQCTLCHKLLDGEETAQVHLRTWRHHQLFVRLINEKAGNIQPQSEAVKRSINDDSGTWKRRKTSRDEENGHEIVKDDHNEGTEQNQENPENMNELAADELEDWEQSVDDILNDESEMIEKELDTETQEETQTEVEEEKKSPQETLTSTREEKNGNELEEKPAELKKTPQKTTRGRGRRRF-