Monarch geneset OGS2.0

DPOGS203241
TranscriptDPOGS203241-TA2265 bp
ProteinDPOGS203241-PA754 aa
Genomic positionDPSCF300210 - 26976-34590
RNAseq coverage1079x (Rank: top 12%)
Annotation
HeliconiusHMEL0087040.070.81% 
BombyxBGIBMGA007076-TA0.076.25% 
Drosophilalack-PA0.057.07% 
EBI UniRef50UniRef50_C1K0020.076.25%E3 ubiquitin-protein ligase SMURF2 n=1 Tax=Bombyx mori RepID=C1K002_BOMMO
NCBI RefSeqNP_001139724.10.076.25%E3 ubiquitin-protein ligase SMURF2 [Bombyx mori]
NCBI nr blastpgi|2263429240.076.25%E3 ubiquitin-protein ligase SMURF2 [Bombyx mori]
NCBI nr blastxgi|2263429240.077.09%E3 ubiquitin-protein ligase SMURF2 [Bombyx mori]
Group
Gene OntologyGO:00064641.1e-125protein modification process
GO:00168811.1e-125acid-amino acid ligase activity
GO:00056221.1e-125intracellular
GO:00055157.2e-17protein binding
KEGG pathwayphu:Phum_PHUM3774800.0 
 K04678 (SMURF)maps-> Ubiquitin mediated proteolysis
    Endocytosis
    TGF-beta signaling pathway
InterPro domain[412-754] IPR0005691.1e-125HECT
[298-331] IPR0012027.2e-17WW/Rsp5/WWP
[17-128] IPR0089734.1e-11C2 calcium/lipid-binding domain, CaLB
[18-84] IPR0000085.6e-07C2 calcium-dependent membrane targeting
Orthology groupMCL11443 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203241-TA
ATGTCTACTCCAGTGTCTAACCGTAAATATGGGGCACAAAAAATTAGACTTACAAGTTTACCGGATCCTTTTGCAAAGATATCAGTTGATGGCAGCGGTCAGGTCTATTCAACTAGTGCAGTCAAAGCAACATTGGAACCGAAATGGAATACTCATTACGATCTGTATCTCACCAAAGGTGAAGGAATAACAATCAGTTTGTGGAATCAACGTAAAATACATAAAAGACAAGGGTCTGGTTTCCTGGGTTGTGTAAGAATTCAGCCGTCCACAGTTCACAAGTTGAAAGACACGGGATATCAATGCCTGGAGCTCTGCGAGGACAGTACGGGTGAGGTGTGCGGTGTTCGCGGTCAAGTAATCGTGTCGCTTCTGTCCCGGGACGGAGCCCGCGGCGAGCAGGCCTCTACTGTGGGCGAAGGCTCCCCGCTGGCTGTAGTGGGCCCCGCGGGTGATGTTCGAGCGCCGAGAGATTCAGTCAGTACCGAGACACCGCTACCAGCTCATTGGAAACAGAGGTTTACACCCGCTGGAAGACCTTTCTACATAAACCATCAGCTCTGTCGATCGCAATGGGAGCGTCCGAGTACTTCCCCCCTGTCACCGTCGTCTCCTACGGCCCCCCCGACCGTGACCCAAAATCTCCCCGTATCCCAAAATTCCCCCGTGTCCCCTAATTCCCCCGAATCCCCTAATTCCCCTGCGTCCCCTAGTATTGACACACCGTCCTCAGTCAATGTGACGCCGATAGCGACGTCCGATCTACCGCCCGGATATGAGATGAGAATCACGGTTCAAGGTCAGGTGTACTTCTACAACGGCAGCACCCGCTCGTCCACGTGGCACGACCCACGTGTCCCACAACACCTGAGACACTGCGCCGCGGTCGCCGGGCCACTACCGCCCGGCTGGGAGATGAGGCACACACACTCCGGACGGCCGTACTTCGTTGATCACAACAACCGCACTACACAGTTCACTGACCCCAGACTAGCTTTGACCACCAGAATTGCGCCGCCGACTGAAGCAGTACAATCTACAAGCCCGCCGTCTACAAACCCGAGTACTAACTGGCCTGCAGACAACGACGTTCTCCCTAAGTACAAAAGAGACCTCGCGGCTAAGGCGAGGGTGTTGAGAGCTGAGCTACAGGCATTACAGCCACAAACAGGACACTGTAGGATTGAAGTATCACGTAACGAAGTACTAGAAGAATCGTACCGTCTCGTGATGAAGTTACGGGGTAAGGAGTTGAGGAAACGCCTGCTCGTCAAGTTCCGCGGTGAAGAGGGGCTCGACTACGGCGGCGTGGCCCGCGAGTGGCTCCATCTATTGGGGAGGGAACTGTTCAACCCGCACTATGGACTGTTTCAGTACGCTAATGCTGGCGATGACAGATATGCGTTACAGGTGAATGCCGACTCTGGGGTGAATCCGGAACATCTTTCATATTTCCACTTCGCTGGTCGCATCCTCGGAGTGGCTTTGTTCCACGGACACCAGCTGGACGCTGCCTTCACTGCACCCTTCTATAAGCAGCTACTAGGAAGACCCATCACCCTCAGGGATATACGAGATGTTGACCCAGAACTGCACAGATCATTGTCTTGGATGCTTGAAAATAGCATAGCCGGTGTCATAGACACGACATTCTCCGTCGAGAGTTCGTCGTTCGGGGCAGTGCGAAGCGTCGAATTGAGGCCGGGCGGGACCAACGAAGCAGTCACAGATTCAAACAAAAGAGATTACGTACGTCTGTATGTAGCGCATAGGTTTACTAGAGGAGCGGAAAGGCAATGGCTGGCATTACAAAGGGGTCTGGCTGACATTATTCCACCGCAACTCCTACAACCACTATCTCCCAGAGATCTTCAACCTTTACTGGCTGGAAGAGCTGATCTCGACCCTGTAGATTGGAAGCGGCACACTCGCCTGAAACACGTGAATCCAGACGCGCCAATTGTTGGTTGGTTCTGGGAGATAGTCGAGGAGTTCGACGCGGAAATGCGAGCTAGGTTATTGCAATTCGTTACCGGGTCGAGACGCGTTCCCCTCGCTGGGTTCAGGGCTTTACAGGGTTCTACGGGAGCGGCGGCTCCTAGACTATTCACTTTACATTTGGTCGCTGATGCGTCACCGGATTCCCTGCCCAAGGCGCACACGTGTTTCAATAGACTTGATCTGCCGCCGTATCCGACCAAGGAAAAATTACACGACAAACTGAAACAGGCCGTGTTGGAAACGGCCGGATTTGCCGTCGAATGA

Protein sequence:

>DPOGS203241-PA
MSTPVSNRKYGAQKIRLTSLPDPFAKISVDGSGQVYSTSAVKATLEPKWNTHYDLYLTKGEGITISLWNQRKIHKRQGSGFLGCVRIQPSTVHKLKDTGYQCLELCEDSTGEVCGVRGQVIVSLLSRDGARGEQASTVGEGSPLAVVGPAGDVRAPRDSVSTETPLPAHWKQRFTPAGRPFYINHQLCRSQWERPSTSPLSPSSPTAPPTVTQNLPVSQNSPVSPNSPESPNSPASPSIDTPSSVNVTPIATSDLPPGYEMRITVQGQVYFYNGSTRSSTWHDPRVPQHLRHCAAVAGPLPPGWEMRHTHSGRPYFVDHNNRTTQFTDPRLALTTRIAPPTEAVQSTSPPSTNPSTNWPADNDVLPKYKRDLAAKARVLRAELQALQPQTGHCRIEVSRNEVLEESYRLVMKLRGKELRKRLLVKFRGEEGLDYGGVAREWLHLLGRELFNPHYGLFQYANAGDDRYALQVNADSGVNPEHLSYFHFAGRILGVALFHGHQLDAAFTAPFYKQLLGRPITLRDIRDVDPELHRSLSWMLENSIAGVIDTTFSVESSSFGAVRSVELRPGGTNEAVTDSNKRDYVRLYVAHRFTRGAERQWLALQRGLADIIPPQLLQPLSPRDLQPLLAGRADLDPVDWKRHTRLKHVNPDAPIVGWFWEIVEEFDAEMRARLLQFVTGSRRVPLAGFRALQGSTGAAAPRLFTLHLVADASPDSLPKAHTCFNRLDLPPYPTKEKLHDKLKQAVLETAGFAVE-