Monarch geneset OGS2.0

DPOGS212649
TranscriptDPOGS212649-TA816 bp
ProteinDPOGS212649-PA271 aa
Genomic positionDPSCF300319 - 7293-15272
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0136241e-5277.86% 
BombyxBGIBMGA013940-TA2e-4268.15% 
DrosophilaRx-PA4e-2587.30% 
EBI UniRef50UniRef50_D6WQD34e-3454.49%Retinal homeobox n=4 Tax=Neoptera RepID=D6WQD3_TRICA
NCBI RefSeqXP_973468.17e-3554.49%PREDICTED: similar to Retinal homeobox protein Rx (DRx1) (DRx) [Tribolium castaneum]
NCBI nr blastpgi|910869791e-3354.49%PREDICTED: similar to Retinal homeobox protein Rx (DRx1) (DRx) [Tribolium castaneum]
NCBI nr blastxgi|910869798e-3354.49%PREDICTED: similar to Retinal homeobox protein Rx (DRx1) (DRx) [Tribolium castaneum]
Group
Gene OntologyGO:00036779.4e-20DNA binding
GO:00063559.4e-20regulation of transcription, DNA-dependent
GO:00055154e-15protein binding
GO:00435654.7e-14sequence-specific DNA binding
GO:00037004.7e-14sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[160-211] IPR0122879.4e-20Homeodomain-related
[156-211] IPR0090574e-15Homeodomain-like
[166-211] IPR0013564.7e-14Homeobox
Orthology groupMCL26150 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212649-TA
ATGGACGTGTCCGAGGACAGACTGTACGAGGCTCAGACGCCGGACCGCTCCGACACCAACAGTCCGAGATCGCAGATGAGCAGCCCCAACTCCGCCTCCTCCATCAACGTGACCGACCAATCGATATCGCTGACGCAGCACCAACAGAACCTGGAGACTCTGAACCGCATGGGGCTGTTCTTCCACCAGCAAATGCACCTGAACCAAAGCTTCGACGCCGTGAAGAGCCGCCTCGGCCTGACCCAGGGCGGCGTGAGCCACGGCCCGCGGCACACCATAGACGCCATCCTGGGCTTGAGCGGCAGACAGAGGGTCGCGGACTACGAGCCGCGGAGAGACGACCCTTGTGATGCTGTGCCGGTGTCGCCCGGAGCCGTCGAGAGCGCCGGTGAAGGTTCCTGCAATAGTAACGATGGGTTCCAACCGCCGGCGGAGGACAAGTCGGCCAGCGACGACGAAGCGCCCCGGCCGGGGTCCGCGGACAAGAAGAAGCATCGCAGGAACAGAACCACCTTCACCACCTATCAGCTGCACGAACTAGAACGAGCCTTCGAGAAGAGTCACTACCCTGACGTATACTCCAGAGAGGAACTGGCCATGAAGGTGAACCTGCCAGAAGTCAGAGTGCAGAATCGACTCGAGGGTTCGATCCATAATGAACCTAAGGGGCTTATGTCACTCAATTCCCTATACGCAAGACGAACATATGTCAAACTGGTGCCAACGTTTCAAAAACTCTCAAAACCCCCCTTCGCCGTACAATCGAAAGGGAACAGAAACGATCGCTGGGCTCGTTCTAAAGTGGACGCTCTCTAA

Protein sequence:

>DPOGS212649-PA
MDVSEDRLYEAQTPDRSDTNSPRSQMSSPNSASSINVTDQSISLTQHQQNLETLNRMGLFFHQQMHLNQSFDAVKSRLGLTQGGVSHGPRHTIDAILGLSGRQRVADYEPRRDDPCDAVPVSPGAVESAGEGSCNSNDGFQPPAEDKSASDDEAPRPGSADKKKHRRNRTTFTTYQLHELERAFEKSHYPDVYSREELAMKVNLPEVRVQNRLEGSIHNEPKGLMSLNSLYARRTYVKLVPTFQKLSKPPFAVQSKGNRNDRWARSKVDAL-