Monarch geneset OGS2.0

DPOGS207335
TranscriptDPOGS207335-TA2184 bp
ProteinDPOGS207335-PA727 aa
Genomic positionDPSCF300188 - 119367-125680
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0022030.083.82% 
BombyxBGIBMGA010110-TA0.075.84% 
DrosophilaMed-PA4e-9389.33% 
EBI UniRef50UniRef50_D6W7L53e-12382.81%Medea n=8 Tax=Coelomata RepID=D6W7L5_TRICA
NCBI RefSeqXP_971429.21e-12482.81%PREDICTED: similar to Xsmad4a [Tribolium castaneum]
NCBI nr blastpgi|1892338912e-12382.81%PREDICTED: similar to Xsmad4a [Tribolium castaneum]
NCBI nr blastxgi|1984511450.057.32%GA14643 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00055153.4e-92protein binding
GO:00063552.9e-73regulation of transcription, DNA-dependent
GO:00056222.9e-73intracellular
GO:00071794.6e-59transforming growth factor beta receptor signaling pathway
GO:00056674.6e-59transcription factor complex
GO:00037004.6e-59sequence-specific DNA binding transcription factor activity
KEGG pathwaytca:6600743e-124 
 K04501 (SMAD4)maps-> Pancreatic cancer
    Colorectal cancer
    Pathways in cancer
    Wnt signaling pathway
    TGF-beta signaling pathway
    Adherens junction
    Cell cycle
    Chronic myeloid leukemia
InterPro domain[1-723] IPR0137900Dwarfin
[499-704] IPR0011326.7e-113SMAD domain, Dwarfin-type
[494-720] IPR0178555.1e-98SMAD domain-like
[463-719] IPR0089843.4e-92SMAD/FHA domain
[27-136] IPR0036191.1e-61MAD homology 1, Dwarfin-type
[10-138] IPR0130194.6e-59MAD homology, MH1
Orthology groupMCL13926 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207335-TA
ATGAATACGACGGCGCCAACCTCGGCAGATGCATGCCTAAGCATTGTCCACTCACTGATGTGCCATAGGCAAGGTGGTGAGAGTGAAGGCTTTTCAAAGCGGGCTATTGAGTCACTAGTCAAGAAATTGAAAGAAAAGAGAGATGAACTTGATTCTTTGATCACAGCAATCACCACAAATGGTGCCCATCCCAGTAAATGTGTTACTATTCAGAGAACTCTAGACGGTCGATTACAGGTTGCGGGAAGGAAAGGATTTCCCCATGTGATATATGCTCGCATATGGCGTTGGCCCGATCTACACAAGAATGAATTGAAACATGTTAAATTCTGTCAGTTTGCTTTTGATCTGAAATGTGACTCGGTGTGTGTTAATCCATACCATTATGAAAGAGTTGTATCTCCAGGTATTGATCTTTCTGGGCTGACTCTTCAGTCGGGTCCTAGTAGGTTAGTAAAAGATGAGTATACAGCTGGTCTGAGCGGGAATGGCATGGACATGGATACTGGAGAACTCGTAACAATCCAGCACCATGCCACAAGCCCCAGACATCATCACTCCACCATTCCCCATCACCATCAACAGTTCCAGACCTCTAACATTATAATAAATCAAGGACAAACGCCAGATGGCGTTGCCAATATGTTTTCTGCTACTCATGGACCTCGACCGCAAATTCGAGCTGGAGCACCTATGGTTCCACAAATGGTACATTCGCCAGGGGCGCAAATGATGGCTAATCATCAGGGACAAATGGCGGGCGCACCTCAAATGGGTCCGGGAAACCCACAAATGGGTCCCGTTAACCCACAAATGGGGCCTGGAACGCCACAAATGGGTCCAGGCACTCCTCAAATGGGGCCGGGAACACCTCAAATGGGTACAAACGTCCCACAAATGGCTTCACCAAGAATGGCGTCAGCTCCCACCCAAATGTCCCCAGGAACCCCACAAATACCAAATATAAGTCAGGGAATGTCAATACCGAGCCCACAACAAATGGCAATGGCACAACAAAGAACTATAGCCCCAAAACTAGAACCGCCCGATGCTATGGATGCAAGAGCTATGTGGCTGCCAAAGAGAATGAATCATCCTTCAATGCCTGTCAGTATGTCTCCCGGTGGGACGACGCCGTTAATAGACGGCTCCAATAATGCATTCTTTACAAACGAGCAGACTTCTACGGATACTCAAATGACTCAGACCATGCCAGCTGGGAGTCAATCGGTGTCAGCTGTGGTGCCAGTGACGTCATCAGCTATGCCAAGTGAAGCCCAGAATGGTTTCGCCGCGACCAGCCCACCACCACAACCCAGTCCTATACCACATCGCACCCAACATCAACAGGGCACCTGGACCGGGAACAACACCTTGACTTATACACAGAGCCTGGCGCCGCCGCCCGCCGCTCCTATGCAGGATGTACCCACTCACCACCATCACTACTATAATGGCAACCCAGGTGGTTTATTGTCAAGCCAGCCAGCTCCGGAGTATTGGTGTTCGGTGGCTTACTTTGAGCTGGATACTCAAGTGGGGGAAACATTCAAAGTGCCATCCAGCAGACCAAACGTTACGGTCGATGGTTATGTGGATCCGTCGGGTGGCAACAGATTCTGTTTGGGTGCTCTCAGTAATGTACACAGAACTGAACAGAGTGAAAGGGCTCGACTCCACATCGGCAAGGGTGTACAGTTGGATCTCCGTGGTGAAGGAGACGTGTGGCTGAGATGTCTCTCAGATCACTCGGTGTTTGTGCAGTCCTACTACTTGGATAGAGAGGCAGGCCGGGCCCCGGGAGACGCTGTTCATAAGATATACCCATCAGCATGTATCAAGGTGTTCGATCTCCGTCAGTGTCACCGTCAGATGCAAACGCAGGCGGCTACAGCCCAGGCGGCGGCGGCAGCGCAGGCTGCAGCTGTCGCAGGACACATACAGCCAGCACATCCGGGAATGAACAAATGTTTGTCAGCGGCGGCTGGTATCGGCGTGGATGATCTTCGGAGGCTGTGTATAGTCCGTCTGTCGTTCGTGAAGGGCTGGGGGCCAGACTACCCTCGCACCTCCATCAAGGAGACGCCCTGCTGGGTTGAGGTCCATTTACATAGGGCTCTACAGTTACTGGACGAGGTGCTCCACACTATGCCCATAGATGGTCCTCGGACTAGCATCGAGTAG

Protein sequence:

>DPOGS207335-PA
MNTTAPTSADACLSIVHSLMCHRQGGESEGFSKRAIESLVKKLKEKRDELDSLITAITTNGAHPSKCVTIQRTLDGRLQVAGRKGFPHVIYARIWRWPDLHKNELKHVKFCQFAFDLKCDSVCVNPYHYERVVSPGIDLSGLTLQSGPSRLVKDEYTAGLSGNGMDMDTGELVTIQHHATSPRHHHSTIPHHHQQFQTSNIIINQGQTPDGVANMFSATHGPRPQIRAGAPMVPQMVHSPGAQMMANHQGQMAGAPQMGPGNPQMGPVNPQMGPGTPQMGPGTPQMGPGTPQMGTNVPQMASPRMASAPTQMSPGTPQIPNISQGMSIPSPQQMAMAQQRTIAPKLEPPDAMDARAMWLPKRMNHPSMPVSMSPGGTTPLIDGSNNAFFTNEQTSTDTQMTQTMPAGSQSVSAVVPVTSSAMPSEAQNGFAATSPPPQPSPIPHRTQHQQGTWTGNNTLTYTQSLAPPPAAPMQDVPTHHHHYYNGNPGGLLSSQPAPEYWCSVAYFELDTQVGETFKVPSSRPNVTVDGYVDPSGGNRFCLGALSNVHRTEQSERARLHIGKGVQLDLRGEGDVWLRCLSDHSVFVQSYYLDREAGRAPGDAVHKIYPSACIKVFDLRQCHRQMQTQAATAQAAAAAQAAAVAGHIQPAHPGMNKCLSAAAGIGVDDLRRLCIVRLSFVKGWGPDYPRTSIKETPCWVEVHLHRALQLLDEVLHTMPIDGPRTSIE-