Monarch geneset OGS2.0

DPOGS215636
TranscriptDPOGS215636-TA858 bp
ProteinDPOGS215636-PA285 aa
Genomic positionDPSCF300041 - 1812110-1822870
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0045184e-36100.00% 
BombyxBGIBMGA007917-TA3e-1965.57% 
DrosophilaCG34367-PC3e-5052.02% 
EBI UniRef50UniRef50_E0VMH12e-5058.33%Short stature homeobox protein, putative n=2 Tax=Pediculus humanus corporis RepID=E0VMH1_PEDHC
NCBI RefSeqXP_967102.21e-5357.80%PREDICTED: similar to CG34367 CG34367-PC [Tribolium castaneum]
NCBI nr blastpgi|1892340302e-5257.80%PREDICTED: similar to CG34367 CG34367-PC [Tribolium castaneum]
NCBI nr blastxgi|1892340302e-5359.63%PREDICTED: similar to CG34367 CG34367-PC [Tribolium castaneum]
Group
Gene OntologyGO:00063551e-28regulation of transcription, DNA-dependent
GO:00435651e-28sequence-specific DNA binding
GO:00037001e-28sequence-specific DNA binding transcription factor activity
GO:00036774.8e-28DNA binding
GO:00055154.7e-25protein binding
GO:00056349.7e-06nucleus
GO:00072751.1e-05multicellular organismal development
KEGG pathway 
InterPro domain[91-153] IPR0013561e-28Homeobox
[67-151] IPR0122874.8e-28Homeodomain-related
[75-150] IPR0090574.7e-25Homeodomain-like
[120-129] IPR0000479.7e-06Helix-turn-helix motif, lambda-like repressor
Orthology groupMCL15350 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215636-TA
ATGGTTATGGAGCGGCTGGCGGACTTCGTCAGCAAGTCTCTGGAGGGAGGAGAGGCCCCGGCGCCACCTCCAGAGCCCGGGGACCACGAAGACATGGACGAGGAAGTGGATATCACAGCTCTTGAAGATAAGGCTCCTCCGGCCAAGCGACCGAGAAATTGGCTGGTGTCTCCTCTCCCAGTACAAAATTACAAAAGAGACGATGATGACGATGATCATAAACACGATATAAATGACAAGTCGAACGGCAGTATCAGCTCGGGTGGTAAGCAGCGTCGTTCACGGACCAACTTCACCCTGGAACAGCTGGGAGAGCTGGAGAGGCTGTTCGATGAAACGCATTACCCTGACGCCTTCATGAGGGAGGAGCTCAGCCAGAGACTAGGGCTCAGCGAGGCCAGGGTGCAGGTGTGGTTTCAAAATCGACGCGCCAAATGTAGGAAACACGAAAGTCAGATGCACAAAGGTCTGCTGGTAGGTGGTAGCGGAACTCCCTTGGAGCCATGTCGCGTGGCACCGTACGTGTCCGTGCCACGGCTGTCGAGCACCACTCAGCGAGCCCCACCACCGCTACCACTCACACCACACCCACCACCTACAGCTTTCGCACCATTCGACTCCGCTATGCTGTCTGCTGCCGCTCACCAGTATGCTAGTGCGGCTGCGGCGGCGGCGGCGGCAGCGTTGTGTCCTCCGTATGCTGGTTTAGCGGCTCTGGCGGCTCGGTGTCGGTCCTCATCTATAGCGGATCTGCGACTGAAGGCTCGCCGGCACGCAGCGGCGCTTGCAGCTGCGAGGGCTCCCTCGCCCTCGAACTCGCCCGCTCCCGCTCCCGCTTCTGCTCCCGCTGACACTTAG

Protein sequence:

>DPOGS215636-PA
MVMERLADFVSKSLEGGEAPAPPPEPGDHEDMDEEVDITALEDKAPPAKRPRNWLVSPLPVQNYKRDDDDDDHKHDINDKSNGSISSGGKQRRSRTNFTLEQLGELERLFDETHYPDAFMREELSQRLGLSEARVQVWFQNRRAKCRKHESQMHKGLLVGGSGTPLEPCRVAPYVSVPRLSSTTQRAPPPLPLTPHPPPTAFAPFDSAMLSAAAHQYASAAAAAAAAALCPPYAGLAALAARCRSSSIADLRLKARRHAAALAAARAPSPSNSPAPAPASAPADT-