Monarch geneset OGS2.0

DPOGS208857
TranscriptDPOGS208857-TA3120 bp
ProteinDPOGS208857-PA1039 aa
Genomic positionDPSCF300009 - 2001127-2011363
RNAseq coverage456x (Rank: top 27%)
Annotation
HeliconiusHMEL0167860.063.24% 
BombyxBGIBMGA012577-TA0.082.10% 
DrosophilaCG9153-PB0.044.77% 
EBI UniRef50UniRef50_A7URT70.048.31%AGAP006994-PB n=4 Tax=Anopheles RepID=A7URT7_ANOGA
NCBI RefSeqXP_002425127.10.049.62%hect E3 ubiquitin ligase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700020500.050.28%hypothetical protein TcasGA2_TC000997 [Tribolium castaneum]
NCBI nr blastxgi|2700020500.050.81%hypothetical protein TcasGA2_TC000997 [Tribolium castaneum]
Group
Gene OntologyGO:00064646e-127protein modification process
GO:00168816e-127acid-amino acid ligase activity
GO:00056226e-127intracellular
KEGG pathwayphu:Phum_PHUM1750500.0 
 K10615 (HERC4)maps-> Ubiquitin mediated proteolysis
InterPro domain[711-1039] IPR0005696e-127HECT
[1-377] IPR0090911.5e-100Regulator of chromosome condensation/beta-lactamase-inhibitor protein II
[212-261] IPR0004082.3e-15Regulator of chromosome condensation, RCC1
Orthology groupMCL10418 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208857-TA
ATGTTTTGCTGGGGCAATGCAACTCATCATGAACTCTGTATTGAATGTCCCGGCAACCTTGACCTGGTAATAAAGCCAACATTATCAAAATGGAAGGAGAGCAGCCATATTCAACAAATCTCAGCTGGAGAGTTCCATACATTGTATTTATCAAATAATGGTCATTTGTACACATGCGGCAGTAATGAAGTCGGCCAACTCGGCAGACAGACTAATTCAGCGGATGGAACATCTCCAGTTTTAGTAGACACATTTAAGGGTTGCACATTATCAACTATAGCTTGTGGCTTACAACATTCAATGGCACTTGATGAATGGGGGCAACCATTTAGTTGGGGTTCTGACAGTATGGGTCAATTGGGTAGCAACTTAGGTTCACACGCCCAAGATAAACCTAAAATTATAAAAACCCTGGCTACAAAAAATGTGATTCAAGTTGCTTGCGGATCCTACCACACAGTAGTCTTAACAAATAATGGTGATCTTTATGCATGGGGAGCTAACAGTTATGGTCAATGTGGTCTGGGAAATAGATCAAACAAGGAAACCACCCCACAGGCCTTATCATCTATACTCGGAATACCAATAGCTCTGATTGCTTGTGGCAGTAACCACACTTTTGCCTTATCACAGTCGGGTGCTGTTTTTGGATGGGGTAAAAATAGTCATGGACAACTGGGCTTGCAAGATAGAGAAAGTAGGTGCTATCCAACACATTTGAAGACATTAAGAAATGTTAAGGTTTGTCATATATCATGCGGCGAGGATTTTACAGCGTTCTTGACTCTAGACGGAGGTGTCTTCACGTGCGGTGCCGGCGAGTACGGTCAAACCGGACACGGTAATACTAAGGACGAACTGGTACCGAGGAAGGTGATGGAACTGATGGGTAGCACAGTCACTCAGGTCGCGTGCGGCCGTCGACACATGTTATGCCGCGTGGGGGAGCGTATCCTGGCGTGTGGGTACGGAGCCCGGGGTCAATTGGGATGTCCGCACATGGCCTTCGCCCTCGTCCCAACACCAGTGACCTTCACGCCAAATGACGAGTCTCCATTCTCACCAGAGATATTCAGTGGTCCGGTGAAGGTGTTTGCTGGTGGAGATCACAGCTTCCTTATTCTGAACAGTGAGAAATCTCCGGCAGACAGTCGAGTGGCCGATGCCAGTAAACAAATACTGTCACTTAGTAATGCCAAGCTCGTCCCATGTCTGGTGTTCAAGAATAACGATGTCGTTAGTCAGGATTTAATGGCCTATCTGGAGACGGTATTCGGCTCATTGGGTTGTATCAACAGTTCTTTTCTGTTGCCTCAAAATGGACACTTCGGTTGCAACACTAAGGTGCCAGGTGTTGATTTAAAGAAAGCTGAGGAGGCTTTTACATTGATTAGTAGAATTGAGAATTTAACTATACAGGAACTGATATTCAACAATTTAACGGAGAATATTATAAAGAAGGTGAAGGTGTCCCCACCAGATGCAGAGGCCTTGAGAGTTTTCTTATTACTGCCACTATACCATGAGATGAGGAACCCTCGCAGACATCCTGAGCTCCAGGGTCCATTCGCTGAGGCTTTCAACAATTTGTCGACTCACCCTCAGCGTATAGTGCAGTTGTGGTGGGAGGCTCAGTCTACTGATTACTTTGAGATGCTGGTTGACATCTTTAAGAGCGTCATCGTGTACGAGCTCATGCAGCCGGTGGTTAGAACCATTAAGAAAATCTATTTCACCAAGAGCATGGTCCAGATATTAAACACGTTGACGTCATTGAATAAAATAAATTTCACGAATCCAAAGAATCCCAAGATACCAGCTGAGTGTTTCTATATTGAGAACCTTTGTAACTACGTCGACATCGCAGCAGACTATATCAACTGGCTGTCGGATCAAGACTCGTCCCAGCCACATCTGTGTAACTACGCGTTCCTGTTTGATGTCCAATGCAAATCTCTACTATTGAAAATCGATCAGCAAGTGCAAATGCAAATCGCCGTGAATCTGGTGACAACACAAATATTCTCACGTCTCTTTATGGATCCGACTTACGAATATCAAAGGGATCAATTTCTCATATTAACAGTGTCGAGAAATCATTTAGTAAGGGACACGATGCTGCAAATAAGCAATCATGATACATCCCAATTGAAGAAACCTCTAAGAGTAGAATTTGTAGGCGAAGAGGCCGAGGACGCTGGTGGCGTTAAAAAGGAATTCTTTATGTTATTATTAAAAGAAATATTCGATCCAGTGTACGGAATGTTCAAACAGTCGGAGGAGACGAATATGATTTGGTTCTCAAACAACCCCTTCGAGGACGAGGTCATGTACTACTTGATAGGAGCTATATACGGTCTAGCGATATACAATTCAATAATCATCTACGTACCATTCCCACTAGTGCTGTACAAGAAACTGTTAGGTGAATCTGTTATGTTAGACGACCTATCTGATCTGTATCCGACGCTGGCCAGCAATCTGAAACACTTATTGGATTATCCCGACGAGGATGTCGAAGAGGTGTTCAGTTTGTGTTTCGCTGTGAACACTGAGGTCTTCGATCAAATACAAGTTCATCCTCTGAAGAAGAATGGTGAGAACATTTCAGTGACACATGAGAATAAGAACGAGTATGTTGATCTGTACGTTGATTTTCTTCTGAACAAATCTGTGGAAAATCAGTTCAGAGCTTTCAATCAAGGATTTCAAAAGGTATGTGGCGGTAGAATAATAAAGCTCTTCAGATCCCACGAGTTACAGTCCGTGGTGATAGGGAACGAGGAATACGACTGGGAGATGTTTGAGAACAACTGCGAGTACAAGAACGGGTACACGGCTACTGACCCGCAGATAAGATGGTTCTGGGAGGTTTTCCACGAGTTGCCGCTAGAGGATAAAAAGAAATTCCTATTATTCCTTACTGGCAGTGACCGCGTTCCCATTCAAGGAATGAGAGACATAAAGATAAGGATCCAGCCAGTAGCTGACGACAGATTCTTCCCAGTGGCTCACACGTGCTTCAACCTTCTAGACCTCCCTCGATACCAGACCAAAGAAAGACTCAAATATCACTTACTGATGGCGATACAACAGACGCAGGGTTTCTCGCTAGTATAG

Protein sequence:

>DPOGS208857-PA
MFCWGNATHHELCIECPGNLDLVIKPTLSKWKESSHIQQISAGEFHTLYLSNNGHLYTCGSNEVGQLGRQTNSADGTSPVLVDTFKGCTLSTIACGLQHSMALDEWGQPFSWGSDSMGQLGSNLGSHAQDKPKIIKTLATKNVIQVACGSYHTVVLTNNGDLYAWGANSYGQCGLGNRSNKETTPQALSSILGIPIALIACGSNHTFALSQSGAVFGWGKNSHGQLGLQDRESRCYPTHLKTLRNVKVCHISCGEDFTAFLTLDGGVFTCGAGEYGQTGHGNTKDELVPRKVMELMGSTVTQVACGRRHMLCRVGERILACGYGARGQLGCPHMAFALVPTPVTFTPNDESPFSPEIFSGPVKVFAGGDHSFLILNSEKSPADSRVADASKQILSLSNAKLVPCLVFKNNDVVSQDLMAYLETVFGSLGCINSSFLLPQNGHFGCNTKVPGVDLKKAEEAFTLISRIENLTIQELIFNNLTENIIKKVKVSPPDAEALRVFLLLPLYHEMRNPRRHPELQGPFAEAFNNLSTHPQRIVQLWWEAQSTDYFEMLVDIFKSVIVYELMQPVVRTIKKIYFTKSMVQILNTLTSLNKINFTNPKNPKIPAECFYIENLCNYVDIAADYINWLSDQDSSQPHLCNYAFLFDVQCKSLLLKIDQQVQMQIAVNLVTTQIFSRLFMDPTYEYQRDQFLILTVSRNHLVRDTMLQISNHDTSQLKKPLRVEFVGEEAEDAGGVKKEFFMLLLKEIFDPVYGMFKQSEETNMIWFSNNPFEDEVMYYLIGAIYGLAIYNSIIIYVPFPLVLYKKLLGESVMLDDLSDLYPTLASNLKHLLDYPDEDVEEVFSLCFAVNTEVFDQIQVHPLKKNGENISVTHENKNEYVDLYVDFLLNKSVENQFRAFNQGFQKVCGGRIIKLFRSHELQSVVIGNEEYDWEMFENNCEYKNGYTATDPQIRWFWEVFHELPLEDKKKFLLFLTGSDRVPIQGMRDIKIRIQPVADDRFFPVAHTCFNLLDLPRYQTKERLKYHLLMAIQQTQGFSLV-