Monarch geneset OGS2.0

DPOGS215704
TranscriptDPOGS215704-TA1314 bp
ProteinDPOGS215704-PA437 aa
Genomic positionDPSCF300041 - 119016-120442
RNAseq coverage58x (Rank: top 69%)
Annotation
HeliconiusHMEL0096363e-16662.95% 
BombyxBGIBMGA000834-TA2e-11259.53% 
Drosophilargr-PA4e-1526.70% 
EBI UniRef50UniRef50_D7EI312e-1925.16%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EI31_TRICA
NCBI RefSeqXP_001850004.11e-1823.29%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|2700166817e-1925.16%hypothetical protein TcasGA2_TC006842 [Tribolium castaneum]
NCBI nr blastxgi|2700166812e-2624.67%hypothetical protein TcasGA2_TC006842 [Tribolium castaneum]
Group
Gene OntologyGO:00056341.5e-09nucleus
GO:00082701.5e-09zinc ion binding
KEGG pathway 
InterPro domain[13-62] IPR0129341.5e-09Zinc finger, AD-type
Orthology groupMCL26826 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215704-TA
ATGGAAACTCCTCATGTATCAATATTAGAAGATCCGACAATACACCTGCACATCAAATCCTGCCTCCCAATAACTATAAATGCAGGTGATCACCTGCCAAAATCAATTTGTGAATCCTGTATATCGCAATTGAATGATTTTTACAATTTCCAATTAAACGCACGATGTTCGCAGGATTGGTTGGAAACATCGCTTCAAGAAAAGACAAGAAAGACAAACGAAATAAAAACTTATGTACAACCTCTTCCTGACTCAGAATATAATTCTGATTCCCTTCTGGAGTTCCTGAATAACACGGTAAACATTGATGAATATCTAAATAATCTAGGTAAAGAAGACATTCCTGGGATAGTGAATATGTTAGATAGAACTGTCGGCCCGGAGTACAAAATGAATAACAAATTAAACAAGGCACCTAGTCCAAAAAAGAAAGAATCACCAAATAAAACCAAAAAACCTAAAAAAATGGATATAGATATCCTAGACTCTGATATAGAGGTTGTGGAAGAGTTATTAAAGAAGGAAATTGAGCCAAAGAGGAGGAATATAGACAAACAATATTCCTGTTTTGCATGTAAGACCAAAGAGGATAACATAAAGAAACTCTCTCACCATTTAAGTATATGTGACAATGCATTACGTACTTGTGTACACTGTGGTGTTTTATTTGACTCCAAGCAAAAGATGCGCCAACACTTACTGACTCATAGTGTGCTGACACCATTAACATGCAACTGTGGACAACAATTTGATACTAAAGAGAGGCTTTTAGCTCACTGTAGGAAGTGTGAAATAGATCACATTTCCTCTATGGGATTTCTATATTCTTGTAAACAGTGTGGAGAGACATTCAGTGAGAGATTTCCACTTTATAAACATGCAAAAGATCACATACAGAAATCACAAGAGAGAGTTTGTGATGTGTGTGGACACACCTTCATTGGAAATGAGGCTTTGTTTAAGCACAGGAAGGAAGAACATGAGAAAGCCGAGAAGGTTTCCTACAGGTGCAAAGTATGTAGTTTCTCTACAGCGGACCGCAAATTAATTTACAGCCATGTACAGAAGCACACAGAAATAAAAGAGCCTAATCGTCATTTGTGCGAATTGTGCGGCCGGAGATTTGCCACTCACGCCACATTGCAAAGGCACTCGTCAAAACACGCTTCAAATATTTCCAAATGCCACATTTGCCACAAACAGTTTGCAAATTTAAAATCTAAGGAGGAACATTTATTGGAACATATAAAGATTGTGATGTGCGAGAAATGTGGTCAGACTGTTAATAGTCTGGATAAACATCAATGTATGTAA

Protein sequence:

>DPOGS215704-PA
METPHVSILEDPTIHLHIKSCLPITINAGDHLPKSICESCISQLNDFYNFQLNARCSQDWLETSLQEKTRKTNEIKTYVQPLPDSEYNSDSLLEFLNNTVNIDEYLNNLGKEDIPGIVNMLDRTVGPEYKMNNKLNKAPSPKKKESPNKTKKPKKMDIDILDSDIEVVEELLKKEIEPKRRNIDKQYSCFACKTKEDNIKKLSHHLSICDNALRTCVHCGVLFDSKQKMRQHLLTHSVLTPLTCNCGQQFDTKERLLAHCRKCEIDHISSMGFLYSCKQCGETFSERFPLYKHAKDHIQKSQERVCDVCGHTFIGNEALFKHRKEEHEKAEKVSYRCKVCSFSTADRKLIYSHVQKHTEIKEPNRHLCELCGRRFATHATLQRHSSKHASNISKCHICHKQFANLKSKEEHLLEHIKIVMCEKCGQTVNSLDKHQCM-