Monarch geneset OGS2.0

DPOGS210967
TranscriptDPOGS210967-TA1647 bp
ProteinDPOGS210967-PA548 aa
Genomic positionDPSCF300004 - 360495-362504
RNAseq coverage37x (Rank: top 74%)
Annotation
HeliconiusHMEL0250520.067.87% 
BombyxBGIBMGA006395-TA4e-14157.41% 
Drosophilapb-PA2e-2358.75% 
EBI UniRef50UniRef50_D3KYT24e-15054.48%Zerknullt n=2 Tax=Obtectomera RepID=D3KYT2_BOMMO
NCBI RefSeqNP_001166190.17e-15154.48%zerknullt [Bombyx mori]
NCBI nr blastpgi|2896292121e-14954.48%zerknullt [Bombyx mori]
NCBI nr blastxgi|2896292121e-16355.20%zerknullt [Bombyx mori]
Group
Gene OntologyGO:00063552.2e-27regulation of transcription, DNA-dependent
GO:00435652.2e-27sequence-specific DNA binding
GO:00037002.2e-27sequence-specific DNA binding transcription factor activity
GO:00036774.6e-25DNA binding
GO:00055156.5e-24protein binding
KEGG pathway 
InterPro domain[252-314] IPR0013562.2e-27Homeobox
[244-312] IPR0122874.6e-25Homeodomain-related
[239-309] IPR0090576.5e-24Homeodomain-like
[274-285] IPR0204791.6e-07Homeobox, eukaryotic
Orthology groupMCL25240 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210967-TA
ATGTCCTCAAGCTCCCATTCTCCACCCCCAAGTGATCGGTCAGACGAAGATTCTAAGGACTCGATTTCATCTGAATACGCTGAAACAAGTCAAATACACAACACCAGCCATATAACATTGGCACAAAAAAGTTTTTACAGAGTTAACTCTTTAACGTCACACGGCGTTGAAGATATACTTTCTGAAGGCAATAGCTACAATACTCAAGAGAATACTTATCCGGATTGTAAAACCGAAGTGCTTTCGTCAATCTCAAATCCCTTCGTTCCAGATTATAATTCTAGAGATTCAACAGGATTTAGTATCCATGACATACTTGGCCTTCAACAAGCCTACAATGTGGCCAACGCTCAAGACGAATTGGAATCCCGATACGAGTATCAAATACCGAACTATGACAATATAAGTAATAGTTCTCAAAATAATTATGGTGAAGAGAGAATTTCTGACCATACGATACCAAAGAGCGCAGATATTTTCAACGTAACCGAATCAGAAATTCGAAATGAAGTTGTGTTTCAAAGGAATTACTCAAATAACGAAACTATTTCTTGTCACCAAAGAAGTGATTTGGACAATGATGTTGTAAACAATGCTGAAAGGGAAGAAAGCGATATTAACGAATCTAGTTTTCCCGGACAGAATTCTTCTTGGTGTGAAAAAAACTCGCTGATTAGTAGTCAAGTCATCAGTACAGCCTCTTCTATATCTAACGATATGTCTACCGATTCGTCATCATATCCAAAAGGTTTTACAAAACGAGCACGCACTGCTTACACAAGTTCCCAACTTGTAGAATTGGAAAATGAGTTTCATCAAAATCGATACTTGTGTCGTCCTAGGAGAATAGAATTGGCCAATTATTTGCAGCTTTCGGAACGCCAAATCAAAATATGGTTTCAAAATAGGAGGATGAAATACAAGAAAGATAATAAACACAATAAACCAAGCTCGTCCGTAGACGATAACAGTCCTACAACAAGTTCTAAGGAAATGTCTCCAACTCAGGATCATAAATTGAGCCACAGTCGTGGCTGCGGAGGTCATGATAGACATAGACGTTTACTTAACGAAAGCCATGCAACTCATCATAAAATGTATCTTCCAACCAACGAAACTATACCAAGACCTCCCGATTATTCTTCAATTAGTCCGATTAAATCAGTTGTTAAACCTGGTTCTCAGAGCACTATAGAATTGCCAGCATATACACCTAACTTATCTTACTCGTCCTACTACACAGGAGCAAGCCGGAGCGGTTACTCACCGATATCTGAGGTTTATCGATACAACAGCGATGAATCATTGCAGCAAACTTCTCACACATTGTCGTTATTACAATCGGATAGTTACGTACCTAATGGGATGAATCTAAAGCTCGCCGAAGACATGACTCGATGCCCAACTGGATCTCCATATTATAACACGCTTTCAAACGGAGTGGTTATGCATATTCCAACTACAGATGCATATGGTTACGCCAGCACTATTCCCGCTCTTTCAGCATCTGCTTTCGAAGATAATACCGTTCACACAAGATCAAGCATATCTCAAGATCCTTACTTTGCTTATTTATCATCAGCAGAGACGTCTAACCAACAAACCTCTTCGACAGCTAACAAGTTTTCTTCGTACATTTCACTCTAA

Protein sequence:

>DPOGS210967-PA
MSSSSHSPPPSDRSDEDSKDSISSEYAETSQIHNTSHITLAQKSFYRVNSLTSHGVEDILSEGNSYNTQENTYPDCKTEVLSSISNPFVPDYNSRDSTGFSIHDILGLQQAYNVANAQDELESRYEYQIPNYDNISNSSQNNYGEERISDHTIPKSADIFNVTESEIRNEVVFQRNYSNNETISCHQRSDLDNDVVNNAEREESDINESSFPGQNSSWCEKNSLISSQVISTASSISNDMSTDSSSYPKGFTKRARTAYTSSQLVELENEFHQNRYLCRPRRIELANYLQLSERQIKIWFQNRRMKYKKDNKHNKPSSSVDDNSPTTSSKEMSPTQDHKLSHSRGCGGHDRHRRLLNESHATHHKMYLPTNETIPRPPDYSSISPIKSVVKPGSQSTIELPAYTPNLSYSSYYTGASRSGYSPISEVYRYNSDESLQQTSHTLSLLQSDSYVPNGMNLKLAEDMTRCPTGSPYYNTLSNGVVMHIPTTDAYGYASTIPALSASAFEDNTVHTRSSISQDPYFAYLSSAETSNQQTSSTANKFSSYISL-