Monarch geneset OGS2.0

DPOGS212527
TranscriptDPOGS212527-TA1095 bp
ProteinDPOGS212527-PA364 aa
Genomic positionDPSCF300222 + 605124-614543
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0093423e-11163.16% 
BombyxBGIBMGA009797-TA2e-5654.51% 
Drosophilaen-PA7e-5171.97% 
EBI UniRef50UniRef50_P276092e-11265.05%Segmentation polarity homeobox protein engrailed n=4 Tax=Obtectomera RepID=HMEN_BOMMO
NCBI RefSeqNP_001037550.14e-11365.05%segmentation polarity homeobox protein engrailed [Bombyx mori]
NCBI nr blastpgi|1129827788e-11265.05%segmentation polarity homeobox protein engrailed [Bombyx mori]
NCBI nr blastxgi|1129827783e-11665.05%segmentation polarity homeobox protein engrailed [Bombyx mori]
Group
Gene OntologyGO:00063551.2e-23regulation of transcription, DNA-dependent
GO:00435651.2e-23sequence-specific DNA binding
GO:00037001.2e-23sequence-specific DNA binding transcription factor activity
GO:00036776e-23DNA binding
GO:00055158.1e-21protein binding
GO:00056345e-16nucleus
GO:00072755e-16multicellular organismal development
KEGG pathway 
InterPro domain[273-335] IPR0013561.2e-23Homeobox
[251-331] IPR0122876e-23Homeodomain-related
[272-346] IPR0090578.1e-21Homeodomain-like
[331-360] IPR0195491e-17Homeobox engrailed, C-terminal
[272-289] IPR0007475e-16Homeobox engrailed
[302-311] IPR0000471.6e-06Helix-turn-helix motif, lambda-like repressor
[295-306] IPR0204795.2e-06Homeobox, eukaryotic
Orthology groupMCL20513 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212527-TA
ATGCTGCGTCTCCCTCGCGCGGCGCGAGGACCGCGCGGGAGCGAGAGGGACGGCCGGCACCCGCCTTGGCCACTCCGTTGGGTGATGGAGGAACCGCCTCCAGTGGAAGGGGAGCTCGCTCTGGGCGTGTCAGCTGATCGAGATCAGTCGCGCTCCGGACGCCGCACCGGCAGCATCTCCAGCATGGCGTACGAGGACAGGTGCAGCGGCCACGCCGACATCACACAGGTCAACCAGACCCAGTACACCTGCACTATCAACCCTAGGAACATCAAAGTACAGCCCGCGTCGCCGCCGCCCAGCCCCGAGTACTACCGGCCGGAGACTCCGGACGTGAAGCCCGTCATCGAGGACGAGCGCCGGAACCCGATAGCTTTCTCCATCAGTAACATACTGCGTCCAGAGTTCGGTGTGACCGCCCTGAGGAACTCCAAGAAGATAGAGGGTCCTAAACCGCTCGGGCCCAACCACAGCATCCTCTACAAGCCGTACGAGATAACCAAGGAGTTGAGTCAATATGGTTACGAGTATGTGAAGACGAAAGAGGATTTCAACCTGCCGCCGCTGGGAGGGTTGAGGCAGACGGTGTCCAGCATCGGGGAGAAAGAGTCCCCGAAGGTCGTGGAACAGAAGAGACCGGACTCGGCCAGCTCGATAGTATCCTCCACCTCGAGCGGCGCCGTCTCCTGCGGCAGCACCGACAACAGCTCGCAGAGCTCCCAGCTGTGGCCGGCCTGGGTGTACTGCACCCGGTACAGCGACAGACCGAGCTCAGGTCCCAGGAGTAGACGGGTGAAGAAGAAGGCGAGCCCTGAGGAGAAGAGACCGAGGACTGCCTTCAGCGCCTCGCAGCTAACAAGATTAAAGCACGAGTTCGCGGAGAACCGCTACCTGACGGAGAGGAGGAGGCAGGCGCTGGCCGCGGAGCTGGGGCTGGCGGAGGCTCAGATCAAGATCTGGTTCCAGAACAAGAGGGCCAAGATCAAGAAGGCCTCGGGCCAGAGGAACCCGCTGGCGCTGCAGCTCATGGCGCAGGGGCTGTACAACCACGCCACAGTCACCGAGAGCGAGGACGAGGAGATCAGCGTCACGTAG

Protein sequence:

>DPOGS212527-PA
MLRLPRAARGPRGSERDGRHPPWPLRWVMEEPPPVEGELALGVSADRDQSRSGRRTGSISSMAYEDRCSGHADITQVNQTQYTCTINPRNIKVQPASPPPSPEYYRPETPDVKPVIEDERRNPIAFSISNILRPEFGVTALRNSKKIEGPKPLGPNHSILYKPYEITKELSQYGYEYVKTKEDFNLPPLGGLRQTVSSIGEKESPKVVEQKRPDSASSIVSSTSSGAVSCGSTDNSSQSSQLWPAWVYCTRYSDRPSSGPRSRRVKKKASPEEKRPRTAFSASQLTRLKHEFAENRYLTERRRQALAAELGLAEAQIKIWFQNKRAKIKKASGQRNPLALQLMAQGLYNHATVTESEDEEISVT-