Monarch geneset OGS2.0

DPOGS212176
TranscriptDPOGS212176-TA1470 bp
ProteinDPOGS212176-PA489 aa
Genomic positionDPSCF300038 + 1087560-1094619
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0125640.086.46% 
BombyxBGIBMGA006621-TA0.084.60% 
DrosophilaCG5087-PA0.063.40% 
EBI UniRef50UniRef50_G6D7670.0100.00%Putative ubiquitin-protein ligase n=4 Tax=Coelomata RepID=G6D767_DANPL
NCBI RefSeqXP_970749.10.070.19%PREDICTED: similar to ubiquitin-protein ligase [Tribolium castaneum]
NCBI nr blastpgi|910908060.070.19%PREDICTED: similar to ubiquitin-protein ligase [Tribolium castaneum]
NCBI nr blastxgi|910908060.070.19%PREDICTED: similar to ubiquitin-protein ligase [Tribolium castaneum]
Group
Gene OntologyGO:00064645.1e-82protein modification process
GO:00168815.1e-82acid-amino acid ligase activity
GO:00056225.1e-82intracellular
KEGG pathwaytca:6593390.0 
 K10588 (UBE3B)maps-> Ubiquitin mediated proteolysis
InterPro domain[152-484] IPR0005695.1e-82HECT
Orthology groupMCL11657 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212176-TA
ATGCTCGTCCTATTCGCTGATTGTATGACGCACTATGTGACAATTTTAGACGACCTGGAAATGTATGAGAAGCAAGATCCGTTTAAGCTGCAGGACTTCGTGAACATGTCGCAATTCCTGAACATGTTCATATACAAGTCGATCACGGGCCAGTTGTTTGATCTCAAAACTATCCAGAACAACGAGGTTTTTACGTCGCTCCATACATTATTACTGGCGCTGTATAGACGCGACTGTCGCCGACCGTACGCTCCGCCCCACCACTGGCTGGTCAAGGAGATTAGGGACACGCACTTCATGGCTGACTTGGAGAAGGGGAAGAAACCTCAACAGGTCCTGGTACAAAAAACGCCGCATATGATAGCTCACGGTGAACGAGTCCGGCTGTTTAGGAGAGCCGTTGCTGATGAGAAGGTGGTGCTAGGTCTGACGGAGCGTGCGTGCGGAGGTCGATCTACGCTGGTCACGGTCCGTCGGACGCGCCTCGTTGAGGACGGATATAGACAGCTGGCAGCTCTGCCCAGCCGAGCTCTGAGAGCTGTGGTCAGGGTGAGGTTCATCAACGAGCAGGGGCTGGACGAGGCCGGCATAGACCAAGATGGCGTCTTCAAAGGTATGCGACATCTAGGTATAGTGGTGGATGTGCCGTTCGCTTCGTTCTTCCTGAGCCAGGTTTTAGGTCAAACGACTCAAGCTCTGTACAGCTGGATCGATGAGCTGCCATCGCTTGACAAAGACCTGTACAGAAGCCTCACATATATAAAACATTTCCAGGGTGACATATCGTCGTTGGAACTAACGTTTTCTGTGGACGAAGAGAGGCTAGGGGAGATCGTGACTCACGAACTGGTGCCAGGAGGGAAGTGTATACCGGTTACTAATGAGAACAAAATAAACTACATACACCTGATGGCTCACTTCAGGATGCACACGCAGATAAAGGATCAGACCAACGCTTTCATTGTGGGCTTTAGGACGATCATTAACCCGGAGTGGCTCTCGCTGTTCTCTACCCCGGAGCTGCAGCGTCTCATCAGCGGCGACAACGTGCCCTTGGACCTGCGCGACCTGCGGCGGCACACGCAGTACTACGGCGGCTTCCACGACTCGCACCGCGTCGTGTGTTGGCTGTGGGACGTGCTGCAGAGAGACTTCACAGAGAACGAACGAGCGATGTTCCTTAAGTTCGTGACGTCATGCTCCAAGCCCCCAGTGTTGGGCTTCGCTCATCTGAAGCCGCCGTTTTCTATCCGCTGCGTCGAGGTCGGTGACGACGAGGACACGGGGGATACTATTGGGAGCGTTATACGAGGCTTCTTCACAATAAGGAAGAAGGACCCCTTGAACAGACTACCGACGTCGTCGACGTGTTTCAATCTACTTAAACTACCGAACTACCAAAAGAGAAGTACACTGCGAGATAAACTGCGGTACGCGGTTAATTGTAACACAGGTTTCGAACTGTCCTAA

Protein sequence:

>DPOGS212176-PA
MLVLFADCMTHYVTILDDLEMYEKQDPFKLQDFVNMSQFLNMFIYKSITGQLFDLKTIQNNEVFTSLHTLLLALYRRDCRRPYAPPHHWLVKEIRDTHFMADLEKGKKPQQVLVQKTPHMIAHGERVRLFRRAVADEKVVLGLTERACGGRSTLVTVRRTRLVEDGYRQLAALPSRALRAVVRVRFINEQGLDEAGIDQDGVFKGMRHLGIVVDVPFASFFLSQVLGQTTQALYSWIDELPSLDKDLYRSLTYIKHFQGDISSLELTFSVDEERLGEIVTHELVPGGKCIPVTNENKINYIHLMAHFRMHTQIKDQTNAFIVGFRTIINPEWLSLFSTPELQRLISGDNVPLDLRDLRRHTQYYGGFHDSHRVVCWLWDVLQRDFTENERAMFLKFVTSCSKPPVLGFAHLKPPFSIRCVEVGDDEDTGDTIGSVIRGFFTIRKKDPLNRLPTSSTCFNLLKLPNYQKRSTLRDKLRYAVNCNTGFELS-