Monarch geneset OGS2.0

DPOGS211188
TranscriptDPOGS211188-TA831 bp
ProteinDPOGS211188-PA276 aa
Genomic positionDPSCF300007 + 649655-651042
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0084714e-0844.64% 
BombyxBGIBMGA003176-TA5e-10267.50% 
DrosophilaCG11617-PA2e-4846.75% 
EBI UniRef50UniRef50_D6WJR03e-5749.44%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WJR0_TRICA
NCBI RefSeqXP_001811513.16e-5849.44%PREDICTED: similar to CG11617 CG11617-PA [Tribolium castaneum]
NCBI nr blastpgi|1892379451e-5649.44%PREDICTED: similar to CG11617 CG11617-PA [Tribolium castaneum]
NCBI nr blastxgi|1892379456e-6150.39%PREDICTED: similar to CG11617 CG11617-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036778.5e-17DNA binding
GO:00063558.5e-17regulation of transcription, DNA-dependent
GO:00055159.9e-16protein binding
GO:00056343.4e-13nucleus
GO:00435652.4e-12sequence-specific DNA binding
GO:00037002.4e-12sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[44-110] IPR0122878.5e-17Homeodomain-related
[28-104] IPR0090579.9e-16Homeodomain-like
[62-100] IPR0084223.4e-13Homeobox KN domain
[44-109] IPR0013562.4e-12Homeobox
Orthology groupMCL18456 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211188-TA
ATGACAGTTATTCAAAGGACTAGTGCGGCAAAAGTGGAAAAAATGGATGGTACTTCGAAAGAAGATAGTGCAAAATCAGCAAGACCAGTCAGGAATAGGAGATACACACGGAGATGCTTAGTGGCCGGCCAGCGTCCTCAGAAGAGGCTGTTCACTCCAGAAATAAAACGCTATCTCAAAGACTGGCTCGTCCGAAGAAGAGACAATCCTTATCCGAACCGCGAAGAAAAAAAACAATTATCGAGAGAGACGGGATTAACGTACATACAAATCTGCAACTGGTTTGCTAACTGGAGAAGAAAATTAAAGAACGTGAATGCTGATCGCAATCAACTCACGTGGGGTCATTTAATACGTACATACAACGACCGCGCCCAGGGCAACGTGGAGCAATTCAGCATCTGCTCAGACGACAGCATATGGAGCGAACCAGAACAATCCAGTCCGAACAACGAACAAGACTACGACGCGAGGTTCGAAAACAGCCCGGACTCGAATACTTCGTATAAACAAGACACAGAAGAAACATCACCACCCTGCGAGAAATACGAAAGCTTCAACAATAATTCCAACGAGATAGAGAGATGCGATGATAATAATTGCGATAAGGTCATCACTAGTCCAGTACTTCTAAGCAAATGGCTGGAAAGCGCAGCTCGGTTCCAACCAAGCGAAACTAATTATTCTTGGTGGGCGGATGGGAGAAGACGAAAACAAGAACAAAAGGTCCAAAGAATAGTTATAAACACAATAAAACATGATAGGGATGAGGTCGAAGCAGCGATGGCACTTACAACGTTGGCATCAGCTAATTGTCTTACCGCTCCGTAA

Protein sequence:

>DPOGS211188-PA
MTVIQRTSAAKVEKMDGTSKEDSAKSARPVRNRRYTRRCLVAGQRPQKRLFTPEIKRYLKDWLVRRRDNPYPNREEKKQLSRETGLTYIQICNWFANWRRKLKNVNADRNQLTWGHLIRTYNDRAQGNVEQFSICSDDSIWSEPEQSSPNNEQDYDARFENSPDSNTSYKQDTEETSPPCEKYESFNNNSNEIERCDDNNCDKVITSPVLLSKWLESAARFQPSETNYSWWADGRRRKQEQKVQRIVINTIKHDRDEVEAAMALTTLASANCLTAP-