Monarch geneset OGS2.0

DPOGS210997
TranscriptDPOGS210997-TA1110 bp
ProteinDPOGS210997-PA369 aa
Genomic positionDPSCF300004 + 351479-352692
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0250483e-6443.55% 
BombyxBGIBMGA006483-TA2e-3075.64% 
Drosophilaind-PA6e-1861.29% 
EBI UniRef50UniRef50_C0IMT13e-2875.64%Special homeobox protein 8 n=3 Tax=Bombyx mori RepID=C0IMT1_BOMMO
NCBI RefSeqNP_001139537.12e-2844.39%special homeobox protein 1 [Bombyx mori]
NCBI nr blastpgi|1689881511e-2775.64%special homeobox protein 8 [Bombyx mori]
NCBI nr blastxgi|2257030901e-2933.43%special homeobox protein 1 [Bombyx mori]
Group
Gene OntologyGO:00063551.8e-24regulation of transcription, DNA-dependent
GO:00435651.8e-24sequence-specific DNA binding
GO:00037001.8e-24sequence-specific DNA binding transcription factor activity
GO:00036778.9e-23DNA binding
GO:00055151.7e-22protein binding
KEGG pathway 
InterPro domain[155-217] IPR0013561.8e-24Homeobox
[153-216] IPR0122878.9e-23Homeodomain-related
[142-212] IPR0090571.7e-22Homeodomain-like
[177-188] IPR0204796.5e-07Homeobox, eukaryotic
Orthology groupMCL17547 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210997-TA
ATGTCAGTCTCAACATCAATTTCGAATGCAGCTCAGCCAGTTTTCAGTCCAACTAATGCCAGTGATGACCAGTGGCTATCAACCAGCAATCTGCAACCCCGGAACGAGACGAATTTTATAACAGGAGAGCCACAGCAATATAAAGTAAATCAAATTAATTATAACAGCACTAGTGCTTTTTATAATGCACCTGATATTCCGAGTACTCATGTAAATTCTTGCAACGGCCCTATGACTTATAATCCAAATTATATAAACACTTGTCCACCAAATCCAGTTTATGGGCAAGAATTTAGATACGAACCATCACAAACTGCTCATCCTAATCCAGGAGTTATGTTTTTGGGTCCTCGTGGAACTATTGGGACTATGAATTCTTGGAGTAACTCAACCAACAGCAGCAATCGTTTAGTATCTAGGCCACTGAACGGGAATAAACCGTTAAGTGGTGGTGTAAAGAAACCGAAACGTATTCGCACAGCATTCACTAGTTCTCAAATGATGGAACTAGAAAACGAGTATACAAGGAACAGATATCTAGACCGCAGCCGTCGCATCGAACTGTCTGAGATATTGAATTTAAACGAACGCACTATTAAGATTTGGTTCCAGAATAGAAGGATGAAGGAAAAGAAGGATAGAGCTGAGAGTCTTGAGGACACTGAAGCCTCAAGCACTACAGAGCTTAACGATCACCAAGATTATCCTGGACAGATGATCATGTATGGCCAATATCCCCAAAACTTATACGGCAGAAGTAATATTTACATTGAACAGTATCCAGTAACATCCACTCCTCTGACGATGCCGACTAATGAAGTCCAATTAGTTAATAGTATTCCTGAATCGGTTCTTAATACATATCCCACGTATATGGTTGAGAATAATTCTGATATAGTCGAGAATTTCGATATTAAAGAACCAGAAATGAACGTGCAAATGCAAGGGTATAATAACAATAAAATTGAATTGATCGATTCCAAAGAAACGACCCAAGATTCGGTGCCTCAGTCAGAGAGTAGTACAAACGACGCTGGTAAAGATGGCTTTAATGGTCCCAATTGGGATTTATCTTGGATCCGCAGCATTCATATGGACGAAGAACTTTGA

Protein sequence:

>DPOGS210997-PA
MSVSTSISNAAQPVFSPTNASDDQWLSTSNLQPRNETNFITGEPQQYKVNQINYNSTSAFYNAPDIPSTHVNSCNGPMTYNPNYINTCPPNPVYGQEFRYEPSQTAHPNPGVMFLGPRGTIGTMNSWSNSTNSSNRLVSRPLNGNKPLSGGVKKPKRIRTAFTSSQMMELENEYTRNRYLDRSRRIELSEILNLNERTIKIWFQNRRMKEKKDRAESLEDTEASSTTELNDHQDYPGQMIMYGQYPQNLYGRSNIYIEQYPVTSTPLTMPTNEVQLVNSIPESVLNTYPTYMVENNSDIVENFDIKEPEMNVQMQGYNNNKIELIDSKETTQDSVPQSESSTNDAGKDGFNGPNWDLSWIRSIHMDEEL-