Monarch geneset OGS2.0

DPOGS214436
TranscriptDPOGS214436-TA546 bp
ProteinDPOGS214436-PA181 aa
Genomic positionDPSCF300069 + 666886-675721
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0106161e-2355.24% 
BombyxBGIBMGA008482-TA6e-2447.37% 
DrosophilaDbx-PC7e-5790.43% 
EBI UniRef50UniRef50_D2A5M56e-6071.60%Developing brain homeobox n=1 Tax=Tribolium castaneum RepID=D2A5M5_TRICA
NCBI RefSeqXP_002428141.12e-5679.41%Homeobox protein CHOX-E, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700092432e-5971.60%developing brain homeobox [Tribolium castaneum]
NCBI nr blastxgi|2700092431e-5971.60%developing brain homeobox [Tribolium castaneum]
Group
Gene OntologyGO:00036778.3e-27DNA binding
GO:00063558.3e-27regulation of transcription, DNA-dependent
GO:00435653e-22sequence-specific DNA binding
GO:00037003e-22sequence-specific DNA binding transcription factor activity
GO:00055151.5e-21protein binding
GO:00056342e-06nucleus
KEGG pathway 
InterPro domain[24-96] IPR0122878.3e-27Homeodomain-related
[37-99] IPR0013563e-22Homeobox
[28-106] IPR0090571.5e-21Homeodomain-like
[66-75] IPR0000472e-06Helix-turn-helix motif, lambda-like repressor
[59-70] IPR0204793.4e-06Homeobox, eukaryotic
Orthology groupMCL19667 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214436-TA
ATGTTCGTCTGTCCTCGTCCAGTCCGTCCCGCGCAGGCTCACACCCAGGTGTTCCCACTCCCCGGAGGCTTCCCATGGGCTCACAGCTCCAGAGGCAAACCCCGGCGCGGTATGATGCGCCGCGCGGTGTTCTCTGACCTCCAACGTAAAGGTCTGGAGAAGAGATTCCAGCTCCAGAAGTACATCAGCAAGCCGGACAGGAAGAAGCTAGCTGAGAAACTGGGCCTCAAGGATAGTCAGGTGAAGATCTGGTTCCAGAACCGTCGCATGAAGTGGAGGAACTCCAAGGAGCGCGAGCTGCTCGCGTCTGGCGGCTCCCGGGAGCAGACCCTCCCCAACAAGAACAACCCCCACCCGGACCTGTCCGACGCGGAGGTGGACCGCCACAAGCTGTCTCCACCGCCCGATGACGACAAGTACGGGCCAGCGCCCACACACGCGCCCGAGCCCCGGGGCTTCAGGGAGCACGCCTTCACCGCCGCGGACGAGGCCGACGAGTACTACAGCGATCCCAGCGCCTCCGACGAGGAGATCAACGTCACATGA

Protein sequence:

>DPOGS214436-PA
MFVCPRPVRPAQAHTQVFPLPGGFPWAHSSRGKPRRGMMRRAVFSDLQRKGLEKRFQLQKYISKPDRKKLAEKLGLKDSQVKIWFQNRRMKWRNSKERELLASGGSREQTLPNKNNPHPDLSDAEVDRHKLSPPPDDDKYGPAPTHAPEPRGFREHAFTAADEADEYYSDPSASDEEINVT-