Monarch geneset OGS2.0

DPOGS204086
TranscriptDPOGS204086-TA1365 bp
ProteinDPOGS204086-PA454 aa
Genomic positionDPSCF300350 - 79005-82335
RNAseq coverage416x (Rank: top 29%)
Annotation
HeliconiusHMEL0091327e-16967.67% 
BombyxBGIBMGA011238-TA0.064.97% 
DrosophilaCG8924-PB6e-3856.90% 
EBI UniRef50UniRef50_D2A5R51e-4873.91%Putative uncharacterized protein GLEAN_15126 n=1 Tax=Tribolium castaneum RepID=D2A5R5_TRICA
NCBI RefSeqXP_973299.13e-4973.91%PREDICTED: similar to BTB/POZ domain-containing protein [Tribolium castaneum]
NCBI nr blastpgi|910848195e-4873.91%PREDICTED: similar to BTB/POZ domain-containing protein [Tribolium castaneum]
NCBI nr blastxgi|910848193e-4544.62%PREDICTED: similar to BTB/POZ domain-containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00055155.2e-21protein binding
KEGG pathway 
InterPro domain[5-115] IPR0113339e-27BTB/POZ fold
[27-118] IPR0130695.2e-21BTB/POZ
[33-128] IPR0002103.7e-15BTB/POZ-like
[334-395] IPR0090571.3e-11Homeodomain-like
Orthology groupMCL22260 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204086-TA
ATGGCACTCCCACCGCAGCAGTTCTGTGTGAGGTGGAACTCCTATCATACTAATTTACAGGCAGTATTCCCTCGCCTATTACTAACGGAGCAGTTTGCTGACGTCACATTAGCCTGCGAATCAAAACAGTTGAGATGTCACAAGTTGGTGCTGTCAGCTTGTTCAGCGTACCTGGAACGGCTCCTGCTGCAGAACCCCTGCAAGCATCCCATCGTGCTCATGAGGGACATGCGATTCAGTGAAATGCAAGCCCTAGTGGACTTCATGTACAAGGGGGAAGTGAACGTCACGCAGGAGGAGCTCCCGAGCCTGTTGAAATCCGCCGAAGCCCTACAGATAAGAGCAACACAGAAAACACCGAAGAAGAGCAAATCAGAAGAAAGGGAAAAGACGGACGTCAAAGAAGAGACGGTGGAGGAAGACGGCCTCTACTACGGAGAGCTGGAGAACGAGCACATGGACTATAACGAGGACGAGGAATCAGGCAAGCCGAGTAGTAGTTTAGGACAGACGTCAGGTCGGTATATACGAGTGAAGCCGGAGAGCGAGCTGTTCTTCCAGAATAAGCTGATGGAGAACGCGGCGTACGTTAAGAGACTGGCGAAGATGAATCCAGCCGACATGCAAAAGGAGTTCACCAAGTACGGCCTCACTAACATGGAGGATATGCAGAGAAATGAGAACGATTACTACAAACTGTGGCTGGAACAGAACGGGGAGCTGGAGGTGTCGCTGCTGAAGACCAACCAGGTCAAGAAGAAGAACGACGTCAAGAACCAGAAGTCGGAAGTCGAGCTGATACCAAGGAATAGAGAGAATTACCCCAAAATGGGGGAGAAAATGGCAAGAAAACGCGGGAGGCCGCCAATCTTTAAGGATAAGAACGTGGATATCAGCGAGACCGTGCTTAAGGCGGATCTGGATCAGTTCCTGGAGGGCAAGGAGGTGAGCTCGTCAGGGTTGTCGAGGAAGGATCTAGAGAGAGTCATGATGGGCAAGTTCAACCCTAACAGAAGATACTCCAACGAAGCCATGTGGGCGGCTCTGATGGACGTGAAGAAAGGCGGCAGCATATACAGGGCGGCCCAAACCCACAAGGTCCCCCGCAAGTCCCTGAGGAACTGGATGAAGAGGTGCCACATAAAATCCTCGTTTCCTATGCCGCAGCAGCTAAAGCAGTTCGTGGAGAACAGCAAGAAGCAGAGAGATTATCAGCACCAAGAGAACGATAAGAGCGATAACGTTGATGACAACGAGTTCGGGAAGCACTTCCTCTTCAATCAGGTGTCCAGCGAGGATGAAGTCAGGGACAAGACAAACGCCAGCGATGGCAACATCGCCCTAGACATGTCGCTGCACGAATGA

Protein sequence:

>DPOGS204086-PA
MALPPQQFCVRWNSYHTNLQAVFPRLLLTEQFADVTLACESKQLRCHKLVLSACSAYLERLLLQNPCKHPIVLMRDMRFSEMQALVDFMYKGEVNVTQEELPSLLKSAEALQIRATQKTPKKSKSEEREKTDVKEETVEEDGLYYGELENEHMDYNEDEESGKPSSSLGQTSGRYIRVKPESELFFQNKLMENAAYVKRLAKMNPADMQKEFTKYGLTNMEDMQRNENDYYKLWLEQNGELEVSLLKTNQVKKKNDVKNQKSEVELIPRNRENYPKMGEKMARKRGRPPIFKDKNVDISETVLKADLDQFLEGKEVSSSGLSRKDLERVMMGKFNPNRRYSNEAMWAALMDVKKGGSIYRAAQTHKVPRKSLRNWMKRCHIKSSFPMPQQLKQFVENSKKQRDYQHQENDKSDNVDDNEFGKHFLFNQVSSEDEVRDKTNASDGNIALDMSLHE-