Monarch geneset OGS2.0

DPOGS208437
TranscriptDPOGS208437-TA1419 bp
ProteinDPOGS208437-PA472 aa
Genomic positionDPSCF300095 + 162561-166410
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0072225e-13380.91% 
BombyxBGIBMGA008844-TA5e-7094.35% 
Drosophilasob-PA1e-8866.26% 
EBI UniRef50UniRef50_D6WW962e-10960.77%Sister of odd and bowl n=2 Tax=Tribolium castaneum RepID=D6WW96_TRICA
NCBI RefSeqXP_972035.16e-11160.10%PREDICTED: similar to sister of odd and bowl CG3242-PA [Tribolium castaneum]
NCBI nr blastpgi|910885231e-10960.10%PREDICTED: similar to sister of odd and bowl CG3242-PA [Tribolium castaneum]
NCBI nr blastxgi|910885233e-11960.35%PREDICTED: similar to sister of odd and bowl CG3242-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036765.8e-12nucleic acid binding
GO:00082703.9e-05zinc ion binding
GO:00056223.9e-05intracellular
KEGG pathway 
InterPro domain[403-435] IPR0130875.8e-12Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL17832 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208437-TA
ATGCAAGAGCAAAGTACGTCGCGAAACAAGGCAACACTCAATTCGTTGGAAAGCGCTATAATAAAGTTGAAAAATAACCTTCAGATACAAGACACCAAAACAGATATGAGTGGTTTACTAAGCCTCGAAACGTCAAATTGCGAAGAAACTAAATGTAGTTCGAGCAAGGGAAGTACAAACTCTACAGCCAATATAAACATATCTATAGCTTCAGCGGCACTTCTACAGGAGAATATGCTGCAGCGATCTTTACAAGGTGTAGTAATTTCTTTACAAAATGCTATGATGAATTCATTACAGCAGGCTGCCCTTTTGCCATCTAATTCAGCAGCTGCGGCAGCCTTAAATTTACAAGCATTAGAATCTTATTTAACCTTACAACGCCTTACCTCCGTTTCATCTGTCAGTACATCTCCTAGTACAACAGAAAATAACACATCTGTGAAACAAGATATATTAAGGATAGCGACAGCAAGTGCAAAGATAACTGAGACTGATGTTTCAGAAAGTACATCACGAGTTATGTCCCCGAAAACACCCGAAGAATACCCGTTATCTGACGCAATTGCTGCTGAAGATGTTGATTTATCTGAAGAAGTTGAAGAAGATTTAGCCTTATTAGATAATGAAGACGAATTTACATTAGAGCAACATTTAGAATCTTCATCGAAAAGCTTCAGCTATAGTTCATCACAGAATTTAGACAACGGCTATCCGAGCTTACTAATGAATGCTATGTATCAACAACAAATCAATAGAAGTCCTTCAAATCCACCGCTCTCACCACAGCCTTCAAACTCCTCTAGCACAAAATCTACAAACAGTTCCACGTCATCAACAAGCAGCAGTATATCGAATAAGCAAAAAACTTCTAGTAGCGGCTCAAGAACGAAGAAACAGTTTATTTGTAAATTTTGCAATCGTCAATTTACAAAATCTTATAATCTTCTCATACATGAAAGGACTCATACCGATGAAAGACCATATTCCTGTGACATTTGCGGTAAAGCTTTCCGCAGACAAGACCATTTAAGAGACCACAGATATATACATTCAAAGGAGAAGCCGTTCAAGTGTCTCGAATGTGGAAAAGGTTTTTGCCAGTCGAGAACATTAGCTGTACATAAAATATTGCACATGGAAGAATCCCCACACAAATGTCCGGTTTGCAGCAGAAGTTTCAATCAAAGATCTAATCTGAAGACGCATCTATTAACGCACACCGATATAAAACCTTACAACTGTACGTCATGCGGGAAGGTTTTTAGACGCAATTGTGATTTGAGACGCCACAGCCTGACTCATAATCTTGTTGGAATGTCGTCATTAAACACTTCCCCAAGCGGAGCAGACGGGAAGAGCCCTTTACTTAATCCAAATTTTCCTATCGATAATTTAGATACTGAAGAGGAAAACTGA

Protein sequence:

>DPOGS208437-PA
MQEQSTSRNKATLNSLESAIIKLKNNLQIQDTKTDMSGLLSLETSNCEETKCSSSKGSTNSTANINISIASAALLQENMLQRSLQGVVISLQNAMMNSLQQAALLPSNSAAAAALNLQALESYLTLQRLTSVSSVSTSPSTTENNTSVKQDILRIATASAKITETDVSESTSRVMSPKTPEEYPLSDAIAAEDVDLSEEVEEDLALLDNEDEFTLEQHLESSSKSFSYSSSQNLDNGYPSLLMNAMYQQQINRSPSNPPLSPQPSNSSSTKSTNSSTSSTSSSISNKQKTSSSGSRTKKQFICKFCNRQFTKSYNLLIHERTHTDERPYSCDICGKAFRRQDHLRDHRYIHSKEKPFKCLECGKGFCQSRTLAVHKILHMEESPHKCPVCSRSFNQRSNLKTHLLTHTDIKPYNCTSCGKVFRRNCDLRRHSLTHNLVGMSSLNTSPSGADGKSPLLNPNFPIDNLDTEEEN-