Monarch geneset OGS2.0

DPOGS202839
TranscriptDPOGS202839-TA1269 bp
ProteinDPOGS202839-PA422 aa
Genomic positionDPSCF300018 + 937863-941085
RNAseq coverage645x (Rank: top 20%)
Annotation
HeliconiusHMEL0092950.099.53% 
BombyxBGIBMGA010481-TA0.098.58% 
DrosophilaMad-PA0.082.86% 
EBI UniRef50UniRef50_Q179T80.082.59%Mothers against dpp protein n=27 Tax=Bilateria RepID=Q179T8_AEDAE
NCBI RefSeqXP_001601460.10.081.01%PREDICTED: similar to mothers against dpp protein [Nasonia vitripennis]
NCBI nr blastpgi|3071935800.082.94%Protein mothers against dpp [Harpegnathos saltator]
NCBI nr blastxgi|1571102700.082.59%mothers against dpp protein [Aedes aegypti]
Group
Gene OntologyGO:00055154.7e-94protein binding
GO:00063552.2e-79regulation of transcription, DNA-dependent
GO:00056222.2e-79intracellular
GO:00071791.6e-69transforming growth factor beta receptor signaling pathway
GO:00056671.6e-69transcription factor complex
GO:00037001.6e-69sequence-specific DNA binding transcription factor activity
KEGG pathwaydse:Dsec_GM181640.0 
 K04676 (SMAD1_5_8)maps-> TGF-beta signaling pathway
InterPro domain[11-422] IPR0137900Dwarfin
[220-422] IPR0178559.6e-112SMAD domain-like
[226-398] IPR0011322.6e-106SMAD domain, Dwarfin-type
[190-413] IPR0089844.7e-94SMAD/FHA domain
[19-158] IPR0130191.6e-69MAD homology, MH1
[39-148] IPR0036196.9e-68MAD homology 1, Dwarfin-type
Orthology groupMCL10390 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202839-TA
ATGGACACAGATGATGGAGAATCCTCGAGCAGTGGGCCTATGTCCAGTTTAAATAGCCTGTTCTCGTTTACAAGTCCCGCTGTAAAGAAACTTCTTGGCTGGAAGCAGGGTGACGAGGAAGAGAAGTGGGCAGAAAAAGCGGTTGACAGTCTGGTCAAGAAGCTGAAGAAGAGGAAGGGGGCCATCGAGGAACTGGAACGAGCTCTGTCCTGTCCGGGGACACCCTCCAAGTGTGTGACCATACCACGATCATTAGACGGGAGGTTGCAGGTCTCCCATCGCAAAGGCTTACCACACGTTATATACTGCAGAGTGTGGAGGTGGCCAGATCTCCAGAGCCATCACGAGCTCAAGCCGCTCGAGATATGCCAGTATCCGTTCAGCGCGAAGCAGAAGGAGGTATGCATAAATCCTTACCACTACAAGCGTGTCGAGAGCCCGGTGCTGCCGCCGGTGCTGGTGCCGCGGCACTCGGAGTTCGCCCCCGGACATTCCCTGCTGCCGTTCCAGAGGACCTCCGAACCGGCCATGCCCCACAACGTTTCCTACTCGGGTTCGGGATTCCCGCCATCAGCGACGTCAGAATTACCTGACACTCCGCCCCCCGCGTACTCCCCTCCCTCCGACGATTCCGAACCTCCAGGCGAAGTCGCCCCCGTCTCCTATCAAGAACCTCTCTACTGGGCTTCCGTAGCCTACTACGAGCTGAACTGTCGAGTTGGCGAGGTGTTCCACTGCAACTCCCACTCGGTGGTAGTGGACGGCTTCACGGATCCGTCGAACAACAGTGACAGATTCTGTCTCGGCCAGCTCAGCAACGTGAACAGGAACTCCACCATCGAGAACACGAGGCGTCACATAGGGAAGGGGGTACACCTGTACTACGTGGGCGGCGAGGTGTACGCGGAGTGTCTGTCGGACGCCGCGATATTCGTTCAGAGCCGGAACTGCAACCACCACCATGGCTTCCACCCCTCCACCGTGTGTAAAATACCTCCGGGGTGTTCCTTGAAGATATTCAACAACCGGGAGTTCGCACAGCTGCTGTCACAGAGCGTCAATCACGGCTTCGAAGCCGTATACGAGCTGACCAAGATGTGCACTATAAGGATGTCGTTCGTGAAGGGCTGGGGGGCGGAGTACCACAGGCAGGACGTCACCTCCACGCCCTGCTGGATAGAGATCCATCTCCACGGCCCGCTGCAGTGGCTGGACAAGGTCCTCACTCAGATGGGCTCCCCGCACAACGCCATCTCCTCGGTGTCTTAG

Protein sequence:

>DPOGS202839-PA
MDTDDGESSSSGPMSSLNSLFSFTSPAVKKLLGWKQGDEEEKWAEKAVDSLVKKLKKRKGAIEELERALSCPGTPSKCVTIPRSLDGRLQVSHRKGLPHVIYCRVWRWPDLQSHHELKPLEICQYPFSAKQKEVCINPYHYKRVESPVLPPVLVPRHSEFAPGHSLLPFQRTSEPAMPHNVSYSGSGFPPSATSELPDTPPPAYSPPSDDSEPPGEVAPVSYQEPLYWASVAYYELNCRVGEVFHCNSHSVVVDGFTDPSNNSDRFCLGQLSNVNRNSTIENTRRHIGKGVHLYYVGGEVYAECLSDAAIFVQSRNCNHHHGFHPSTVCKIPPGCSLKIFNNREFAQLLSQSVNHGFEAVYELTKMCTIRMSFVKGWGAEYHRQDVTSTPCWIEIHLHGPLQWLDKVLTQMGSPHNAISSVS-