Monarch geneset OGS2.0

DPOGS207179
TranscriptDPOGS207179-TA969 bp
ProteinDPOGS207179-PA322 aa
Genomic positionDPSCF300001 + 4981165-4982525
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0020963e-9789.04% 
BombyxBGIBMGA000639-TA8e-11192.57% 
Drosophilamirr-PB2e-6052.44% 
EBI UniRef50UniRef50_D6WIL95e-8070.93%Mirror n=3 Tax=Endopterygota RepID=D6WIL9_TRICA
NCBI RefSeqXP_971676.12e-8070.93%PREDICTED: similar to iroquois-class homeodomain protein irx [Tribolium castaneum]
NCBI nr blastpgi|2700048892e-7970.93%mirror [Tribolium castaneum]
NCBI nr blastxgi|2700048891e-8964.73%mirror [Tribolium castaneum]
Group
Gene OntologyGO:00036774.7e-22DNA binding
GO:00063554.7e-22regulation of transcription, DNA-dependent
GO:00055153.1e-17protein binding
GO:00056345.9e-14nucleus
GO:00435658.6e-12sequence-specific DNA binding
GO:00037008.6e-12sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[60-126] IPR0122874.7e-22Homeodomain-related
[41-117] IPR0090573.1e-17Homeodomain-like
[75-114] IPR0084225.9e-14Homeobox KN domain
[57-122] IPR0013568.6e-12Homeobox
Orthology groupMCL16046 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207179-TA
ATGGAAACTGGATATCTATGCGTGGTCTGTGAACGAATGCGTACGGAGCTAAGAGATAAGCCGCAGCTTAAAGCAAAAGCAGTCGATGGCTTATCGCGATTTCAATTATCATGCTGGCGCTTCCATTGGCTCTTTGGAGGGAAGAGGGGAGGATACGGCATGGATTTGAACGGTGCTAGAAGAAAAAATGCCACTCGAGAAACTACAAGTACTCTTAAAGCATGGCTTAACGAACACAAGAAAAATCCATATCCAACAAAAGGAGAAAAGATAATGCTGGCGATAATTACAAAAATGACGCTAACACAAGTGTCCACGTGGTTTGCAAATGCTCGACGTCGCCTCAAGAAGGAGAACAAAATGACTTGGGAGCCGAGAAACAGAGTTGACGACGATGACAATAATAACGATGACGACGATCACAAAAGTAATGATGGAAAAGACGCTCTAGATGGAAAAGATTCTGGAACTGGTTCCAGTGAAGATGGAGACAGACCGCAACAAAGGTTAGACCTCCTCGGTCCGCGAACGGAATCTGAATGGTCAGAATCAAGAGCTGACAGTGGACCAGAATCACCTGAGCCTTACGAGAGGCCATTACACCCTGCATATCAGCATCTTCCATCACGTGCCCCACCTGGTAGTACACCTGCCTCTGCTAAGCCTAGAATATGGTCTCTAGCAGACATGGCAAGCAAAGACGGTGAGGCGCCGGCAGCACCAGCAGCTGCCGCTTCCGCTTTCTATCAGACAGCAGCAGCAGCTGCTGCTGCCCGTCTCGCCCATCCTTATGGCCGCCCAGACTTATACAGGGGACTATACCCACCTGCACATGCTGCTGATGTTGCTCTGCTAGAGTATTCACGTTCCTTGGCTCTCGCTGCTCCGGCTCCCGCGCCCCCAGCACCCTCCCCCTCTTCGTCCTCCACATCATCCCTTGCTGAACCACCGCCTTCCTCTCGCGCCTGA

Protein sequence:

>DPOGS207179-PA
METGYLCVVCERMRTELRDKPQLKAKAVDGLSRFQLSCWRFHWLFGGKRGGYGMDLNGARRKNATRETTSTLKAWLNEHKKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWEPRNRVDDDDNNNDDDDHKSNDGKDALDGKDSGTGSSEDGDRPQQRLDLLGPRTESEWSESRADSGPESPEPYERPLHPAYQHLPSRAPPGSTPASAKPRIWSLADMASKDGEAPAAPAAAASAFYQTAAAAAAARLAHPYGRPDLYRGLYPPAHAADVALLEYSRSLALAAPAPAPPAPSPSSSSTSSLAEPPPSSRA-