Monarch geneset OGS2.0

DPOGS213091
TranscriptDPOGS213091-TA831 bp
ProteinDPOGS213091-PA276 aa
Genomic positionDPSCF300016 - 202146-216750
RNAseq coverage247x (Rank: top 42%)
Annotation
HeliconiusHMEL0063481e-6582.03% 
BombyxBGIBMGA007623-TA2e-1861.43% 
DrosophilaB-H1-PA5e-5062.57% 
EBI UniRef50UniRef50_B3MW332e-5251.47%B-H1 n=19 Tax=Metazoa RepID=B3MW33_DROAN
NCBI RefSeqXP_001965663.14e-5351.47%B-H1 [Drosophila ananassae]
NCBI nr blastpgi|1947671137e-5251.47%B-H1 [Drosophila ananassae]
NCBI nr blastxgi|1232392e-5755.00%Om(1D) [Drosophila ananassae]
Group
Gene OntologyGO:00063551.3e-27regulation of transcription, DNA-dependent
GO:00435651.3e-27sequence-specific DNA binding
GO:00037001.3e-27sequence-specific DNA binding transcription factor activity
GO:00036772.8e-27DNA binding
GO:00055154.5e-23protein binding
GO:00056341.1e-06nucleus
KEGG pathway 
InterPro domain[69-131] IPR0013561.3e-27Homeobox
[49-128] IPR0122872.8e-27Homeodomain-related
[60-138] IPR0090574.5e-23Homeodomain-like
[98-107] IPR0000471.1e-06Helix-turn-helix motif, lambda-like repressor
[91-102] IPR0204793.4e-06Homeobox, eukaryotic
Orthology groupMCL17330 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213091-TA
ATGACCGTCCAACGCGACCAGCGCGAGCGCGCGCCGCGGACCAGGTTCATGATCACGGACATCCTGGACGCGGCGCCCAGGGACCTCAGCGCGCACCGGGACTCGGACTCCGACAGGTCGGCCACGGACTCCCCAGGTGTCAAAGATGACTCCGACGACGTGTCCAGCAAATCCTGCGGTGACGCATCTGCATTGGCTAAGAAGCAGCGCAAGGCTAGAACAGCCTTCACGGATCATCAGCTTCAGACCTTGGAGAAGTCGTTCGAGAGACAAAAATACCTCAGCGTCCAGGATCGAATGGAGCTAGCTGCTAAACTAGGTCTTACAGATACCCAAGTGAAGACCTGGTATCAGAACAGAAGAACGAAATGGAAGCGTCAAACGGCCGTTGGACTCGAGTTACTAGCAGAGGCTGGCAACTACGCAGCCTTTCAACGTTTGTATGGAGGTTACTGGGCAGGAGTGCCCGCGTATCCAACACAGCCTGCCCCTTCTGCTGATTTATACTATCGTCAAGCTGCCGCAACTGCTGCTGCAGCAGCCTCGGCCTCTGCAAACACATTACAGAAACCATTACCATATCGATTATACCCTGGCGCTCCAATGGCGGGTGTTCCCCCGTTAGGTTTGGGTCTGCCGGGTCCGTCTGCTCACTTGGGATCACTGGGTGCTCCTGGTTTGGGAGCCCTCGGTTATTATGCACAAGCTAGACGCACACCCTCTCCAGACGTGGATCCTGGAAGCCCAGCACCTCCGCCGCGATCCCCGCGAGAGCAATCCGTAGAACGACACTCTGACGACGAAGACGACGATGAAACCATACACGTGTAA

Protein sequence:

>DPOGS213091-PA
MTVQRDQRERAPRTRFMITDILDAAPRDLSAHRDSDSDRSATDSPGVKDDSDDVSSKSCGDASALAKKQRKARTAFTDHQLQTLEKSFERQKYLSVQDRMELAAKLGLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGNYAAFQRLYGGYWAGVPAYPTQPAPSADLYYRQAAATAAAAASASANTLQKPLPYRLYPGAPMAGVPPLGLGLPGPSAHLGSLGAPGLGALGYYAQARRTPSPDVDPGSPAPPPRSPREQSVERHSDDEDDDETIHV-