Monarch geneset OGS2.0

DPOGS212475
TranscriptDPOGS212475-TA1362 bp
ProteinDPOGS212475-PA453 aa
Genomic positionDPSCF300222 - 427283-436914
RNAseq coverage886x (Rank: top 14%)
Annotation
HeliconiusHMEL0093272e-14367.31% 
BombyxBGIBMGA009650-TA0.088.04% 
DrosophilaMad-PA2e-14258.30% 
EBI UniRef50UniRef50_P840226e-16564.73%Mothers against decapentaplegic homolog 3 n=158 Tax=Bilateria RepID=SMAD3_HUMAN
NCBI RefSeqXP_001977898.10.070.96%GG17985 [Drosophila erecta]
NCBI nr blastpgi|3361711200.071.77%putative Smad on X protein [Episyrphus balteatus]
NCBI nr blastxgi|1947636910.068.67%GF20979 [Drosophila ananassae]
Group
Gene OntologyGO:00063552.5e-108regulation of transcription, DNA-dependent
GO:00056222.5e-108intracellular
GO:00055151.2e-96protein binding
GO:00071792.1e-54transforming growth factor beta receptor signaling pathway
GO:00056672.1e-54transcription factor complex
GO:00037002.1e-54sequence-specific DNA binding transcription factor activity
KEGG pathwayder:Dere_GG179850.0 
 K04500 (SMAD2_3)maps-> Colorectal cancer
    Adherens junction
    Pancreatic cancer
    Pathways in cancer
    Endocytosis
    Wnt signaling pathway
    TGF-beta signaling pathway
    Chagas disease
    Cell cycle
InterPro domain[5-453] IPR0137900Dwarfin
[252-453] IPR0178554.2e-109SMAD domain-like
[258-429] IPR0011322.5e-108SMAD domain, Dwarfin-type
[211-444] IPR0089841.2e-96SMAD/FHA domain
[2-142] IPR0130192.1e-54MAD homology, MH1
[24-134] IPR0036192.8e-47MAD homology 1, Dwarfin-type
Orthology groupMCL10965 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212475-TA
ATGTTTCCGCTAACTCCACCTGTTGTGAAGCGGCTGTTGGGGTGGAAGAAGGGCCCCGAGGGTTCGACGGCTGAGGACAAATGGTCCGAAAAGGCCGTAAAAAGTTTAGTAAAAAAGTTAAAGAAAAGCGGTGCGATTGAAGAATTAGAGAAGGCTCTGTCCAACCAAAATAGCCATACAAAGTGTGTGACGATACCGAGGGTCAAACCGAATGATAATATAATAAATGGCCAGAACCGTAAGGGATTGCCGCATGTGGTGTACTGTCGGTTGTGGCGTTGGCCGCAGCTTCAGAGCCAGCATGAATTGAAGCCCGTGGATCACTGCGAGTACGCTTACCAGCTGAAGAAAGACGAAGTCTGCATCAATCCATATCATTACAATAAAATTGATTCTCCAGCGTTACCCCCAATCTTGGTGCCTCGTTGTCCCGAGGGGGAGATACGGGCCCCCCCGCCTTACGAATACCAGCATCACGATCACGACAGCGTGATGCAGAGCAGCGTGGGCGTGGTGGGCGTGGGTGTGGGCGCCGGCGTGGGCGTGGGCGTCGGTGGTCACAGCGCTCTGTACCTGGAGGCGACTCTCGCTCAGCAGGTCCCCGGGAACACCACCGTGCAGCTGAGTTCGTCGTCAGTGGAGACCCCTCCCCCCGGCTACATGAGTGAGGACGGTGACCCCATGGACCACAACGACAATATGAATCTGACCCGGCTGACGCCGTCCCCGTCCATGGCGACGGAGGCGGCCCCCGTGTTGTACCACGAGCCGGCCTTCTGGTGCAGCATCAGCTACTACGAGCTGAACACTCGGGTGGGGGAGACCTTCCACGCCAGCCAGCCCTCGATCACCGTGGACGGATTCACAGATCCCAGTAACAGTGAACGCTTCTGCCTCGGTCTGCTCTCCAACGTGAATAGAAACGAAGTCGTCGAACAAACAAGGAGGCACATCGGGAAAGGCGTCCGTCTGTATTATATCGGCGGCGAGGTTTTCGCTGAATGTCTCAGCGACTCGTCCATATTCGTACAGAGCCCGAACTGCAATCAGCGGTACGGCTGGCATCCGGCCACCGTCTGCAAGATACCACCAGGCTGTAACCTGAAGATCTTCAACAACCAGGAGTTTGCTGCTCTCCTATCTCAGTCCGTGTCCCAGGGGTTCGAGGCTGTGTTCCAACTAACCAGGATGTGCACCATCAGGATGAGCTTCGTCAAGGGATGGGGGGCGGAGTACCGTCGTCAAACGGTGACTTCAACTCCGTGTTGGATCGAGCTGCATCTGAACGGTCCCCTTCAATGGCTGGACCGGGTCCTCACACAGATGGGGTCGCCGCCGCTGCCCTGCTCTTCTATGTCATAG

Protein sequence:

>DPOGS212475-PA
MFPLTPPVVKRLLGWKKGPEGSTAEDKWSEKAVKSLVKKLKKSGAIEELEKALSNQNSHTKCVTIPRVKPNDNIINGQNRKGLPHVVYCRLWRWPQLQSQHELKPVDHCEYAYQLKKDEVCINPYHYNKIDSPALPPILVPRCPEGEIRAPPPYEYQHHDHDSVMQSSVGVVGVGVGAGVGVGVGGHSALYLEATLAQQVPGNTTVQLSSSSVETPPPGYMSEDGDPMDHNDNMNLTRLTPSPSMATEAAPVLYHEPAFWCSISYYELNTRVGETFHASQPSITVDGFTDPSNSERFCLGLLSNVNRNEVVEQTRRHIGKGVRLYYIGGEVFAECLSDSSIFVQSPNCNQRYGWHPATVCKIPPGCNLKIFNNQEFAALLSQSVSQGFEAVFQLTRMCTIRMSFVKGWGAEYRRQTVTSTPCWIELHLNGPLQWLDRVLTQMGSPPLPCSSMS-