Monarch geneset OGS2.0

DPOGS209738
TranscriptDPOGS209738-TA3678 bp
ProteinDPOGS209738-PA1225 aa
Genomic positionDPSCF300105 + 353189-363909
RNAseq coverage436x (Rank: top 28%)
Annotation
HeliconiusHMEL0113480.083.59% 
BombyxBGIBMGA008926-TA0.079.09% 
DrosophilaCG9934-PA0.040.37% 
EBI UniRef50UniRef50_E2AX730.047.26%Ubiquitin conjugation factor E4 B n=9 Tax=Neoptera RepID=E2AX73_CAMFO
NCBI RefSeqXP_973165.10.048.36%PREDICTED: similar to ubiquitination factor E4 [Tribolium castaneum]
NCBI nr blastpgi|910885750.048.36%PREDICTED: similar to ubiquitination factor E4 [Tribolium castaneum]
NCBI nr blastxgi|910885750.048.26%PREDICTED: similar to ubiquitination factor E4 [Tribolium castaneum]
Group
Gene OntologyGO:00344502.9e-137ubiquitin-ubiquitin ligase activity
GO:00065112.9e-137ubiquitin-dependent protein catabolic process
GO:00001512.9e-137ubiquitin ligase complex
GO:00165672.9e-137protein ubiquitination
GO:00048427.1e-23ubiquitin-protein ligase activity
KEGG pathwaytca:6619420.0 
 K10597 (UBE4B, UFD2)maps-> Ubiquitin mediated proteolysis
    Protein processing in endoplasmic reticulum
InterPro domain[525-1137] IPR0194742.9e-137Ubiquitin conjugation factor E4, core
[1153-1208] IPR0036137.1e-23U box domain
[1155-1208] IPR0130835.6e-18Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13847 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209738-TA
ATGAGTGAACTTAGTCAAGAAGAGATTCGTAGACGGAGGTTGGCTCGACTCGCAGCCCTCGGAGGGACGGGCTCTTCAGTTCCAACTAGTCCTCCCATTACACCTGGTGCATCAGTTCCACCAGACGTTCTTAGTCCAGGTCCACCGTCACGTTTGTCTCCAGCGTCAGTTCCACGTAGCTCCACAGAAAATGCAGACAGTAAAAACAAAGAACAAAACTATGCAGAGAACAAAAGTGATGTGTCAGATCAAACAAAAATACCAGCTGATGGTCTAATTGTTACAGAAAAGCCAGATAATAATTTATTAGACGAAATGCCAGATACAAAAATGACAGACCTTTCTCAAACACGGCAGTACAATAGTTTTGACTCTATGGGAGAGGACACATTGTTAAGCTGTGGGTCAGTGAGAGTTCTGCCTCAGTCTACTGTCAGTGGGTCAGAAGGTGCTTCGACAAGAGAAGATAGCACATCACGTTCAATTTCTCCAGCACAGTTTGTAGAACCACCACCAGTTAGACCCAGAACACACAATGCTTCACCAAGTTGCTCAAGACGCTCTCTTTCAATGGAAGTTGATGATGTTTCAGAGAGAAATTCACAGTCTGAACAAACACAAGAACCAATGGAGGTTGAAGATGACTCAACACAATCTCCTGCTCGTAAAATTCAAAGATCTCGTACTGTAAGTTGCACAGAGTTAACAGAGGAACAGTTAAGGAATATTGTTGCCAAAATATTACAAGTTTCATGGTCCGATGACAGCGCTGGTGGCATTTTTGTGCCTTCAGTTGCGGCATCACTACTAGATAATCCCAAACTAGGTTTAGAGGAGATAGCATCAGAGGCACTGATGGACACACTTTGTCAGATTATAGACGGAAACGATCCTCTGAATCAGAAGTTGATTGCTATAACGGAAACAACAAAAAGGCTGAGTGAAGACTGTGGTGATGGAGATGATTTGTTGACGAGCGGACATCTCGGTTCTGAAGCGGATGCTAGCAAATCAGACTGCCCTACACCATCGTTACCCATTCAGAAACCAATGCCCTCCCAAGGTTTAGCAGTCAGCTACATACTGAAATTCTACAATAACATTAATCTGTATGAAAGGGACCATCCAAAGAAAAGCTCCGAACCGCCTCTGAGTGATCTGTTGCAAAGTTTCCGAACGCTGCTCGTAAACAATCTAGTATTAGTGCTGAGGGGGAAATATGATTTGGAGAAATGTCGGAAATCGCCTCTGTTACCTTACCTCTTGATTGGGAACACTCCGATCGGACTGATACCTGAAGTTCTCTTAGCGACATATCAGGATAAAGAAGCTTTCGAGGAGGTGTTCGTGCCGCTGCTTATGGGTGTTCGTGAAGAGATGCGTCGATGTGTGTCCCCGCTGGTAGGTCGCGGGCACGGCGCACCGCTACGTGCTCTACGTGCCCTGTGCGAGCTGCGCGCTCCCCCACGACACGCCGCACGGCCTGTGTGTGCGTTACTGGCACGCCTACCCTCGTTATGCCCACCATCCGTTACCACAGCACCGGGGCGAGAAATCGCCAGAGTCAGTTTCCTTGGACCCTTCTTTGCGATATCCCTATTCGCTGAGGAGAATCCCCGTTTCGCTGAGAGGATGTTCGCGGGCACTGACCAGAGCCTGTCTTTCGCTCTGCAGCGTGAGGTGGAAGCCAGTCGGAACACACTCCACAATATCTGTCACAACATACTGCTGTGTCCCGACGCAAGGGAACCATTCCTTAACTACTTCGCCACATTACTCCAGAGGAATGAGCGAAGAGCACAACTACAGACAGATGAACGTTCCCTGGCTGGCGACGGCTTCATGTTGAACGTTTGCTCGGTTCTCCAACTACTGTCGGTGCGCATCAAGATGGACCGCGTGTACGAGCTGTATACCTTCCAACCGGACACGTGGTACAACGTCAGGGATGAAACCCGCCTGTATTTCACAGCACAGGAGGCACAGGAATGGCAGGACGCTCTCAACAACGATCCAAACCACGAATGGCCGGAAGCGAAGTTTCAGACTGTGTGTTGGTTCCTGACTCTCCACATGCATCACGTGGCGCTCATACCCGCGCTCCACACGCACCAGAGACGCATACGTGCCTTCCGAGATCTTCAGAAGGTGATTGAGGAGCTGATGGCTGCTGAGCCTCAGTGGAGGAATAGCTTCTCAGCGTTCAGGAACAGGGAACTACTGCGACGATGGAGGAGACAGATCAAGAGGCTACACCGTTCCAAGCAGTGTGCGGAGACGGCTCTCCTGGACCCCGAGCTGATGCGGCGTGGTGTTCAGTTCTACTCGTCCGTGTGTTCGCTGTTGGTCCGTCAGCTGCAGTCGGCGGCGTCTACCGGCCCTAGCACACCGATCGCCTCACCATCACACGCCTTCCGCGCCCTACCCGAGTGGTATGTGGAAGACATCGCGGAGTTCATGCTGTTCGCCGTCCAGTATGTGCCACAAACGGTGGCCTACCACATAGAAGACCCGATAGTAACCTGGCTTCTGAGCGCCATCTGTAACTCCCATCTCATAAAGAACCCGTACTTGGTTGCAAAAATCGTAGAGGTTTTATTCGTCATCAATCTATCCCTGCCGATGAAGCTGAAGAACGTCTACGAGAAGTTCATGGACCATCCCATGTCCCAGACGGCGCTGCCGAGTGCCCTGATGAAATTCTACACGGATATTGAAACAACGGGCCAGAGCACCGAGTTCTATGACAAGTTCACTATCAGATTCCACATCAGCATCATACTTAAGGGAATGTGGGACAGACCCATACACAAACAGGCCATCGTCAAAGAATCCCGTTCGGGTCGACAGTTTGTCAAGTTCATCAACATGCTCATGAACGACACCACCTTCCTCCTCGACGAATGTCTGACGTATCTCAAACGTATCCACGAAGCTCAAGAGGCTGAGGCCGAAGGCTCGAGCAGCAGCAGTAGCTCGAGTGAGGCTCGAACTCGAGCACTCGCGCAGGACGAGCGCCAGTGCCGCTCATACCTCACGTTGGCTCGAGAAACCGTGGACATGCTCGAGTACCTCACAGTCGACATCAAAGAGCCCTTCCTGCGAGCGGAACTCGTCGACAGACTGGCGTCCATGCTCAACTTTAATCTACAACAACTCTGCGGGCCCAAGTGCAATAATCTTAAGGTCCGCCAGCCGGAGAAGTACGGCTGGGAGCCGCCATATAAATATATCAATATTAACCGACATCAAAAAAGCCGACAAGCAAATTCAATCAATTGGTCGCTACATAGTCTTATTAAAAGGTCCTTCCGCAAGGAGTTGTTTGATGAAGCAGCGGTTCGCCTCGCCAAGTCGTATATTAAGACGCCCTCAGAGATAGAGCGGTTCAGGACTCTCGCTGACAACGCGTACCAGATCGCTGTATCAAATCAGCAAAGAAGTGACGAATTCGCGGACGCACCCGAAGAATTCAGAGATCCCCTGATGGATACCCTGATGACTGACCCGGTCACCTTACCTTCTGGAAAGGTAATGGATCGTTCGGTGATCCTCCGCCATTTGCTTAACAGCGCCACCGACCCTTTCAACAGACAACCTCTCACTGAGGACCAGTTGCGTCCAGGCAGAATGAATATAACCGTTTTAGGATCTCATAGAGAAAATAAAGTTGTATAA

Protein sequence:

>DPOGS209738-PA
MSELSQEEIRRRRLARLAALGGTGSSVPTSPPITPGASVPPDVLSPGPPSRLSPASVPRSSTENADSKNKEQNYAENKSDVSDQTKIPADGLIVTEKPDNNLLDEMPDTKMTDLSQTRQYNSFDSMGEDTLLSCGSVRVLPQSTVSGSEGASTREDSTSRSISPAQFVEPPPVRPRTHNASPSCSRRSLSMEVDDVSERNSQSEQTQEPMEVEDDSTQSPARKIQRSRTVSCTELTEEQLRNIVAKILQVSWSDDSAGGIFVPSVAASLLDNPKLGLEEIASEALMDTLCQIIDGNDPLNQKLIAITETTKRLSEDCGDGDDLLTSGHLGSEADASKSDCPTPSLPIQKPMPSQGLAVSYILKFYNNINLYERDHPKKSSEPPLSDLLQSFRTLLVNNLVLVLRGKYDLEKCRKSPLLPYLLIGNTPIGLIPEVLLATYQDKEAFEEVFVPLLMGVREEMRRCVSPLVGRGHGAPLRALRALCELRAPPRHAARPVCALLARLPSLCPPSVTTAPGREIARVSFLGPFFAISLFAEENPRFAERMFAGTDQSLSFALQREVEASRNTLHNICHNILLCPDAREPFLNYFATLLQRNERRAQLQTDERSLAGDGFMLNVCSVLQLLSVRIKMDRVYELYTFQPDTWYNVRDETRLYFTAQEAQEWQDALNNDPNHEWPEAKFQTVCWFLTLHMHHVALIPALHTHQRRIRAFRDLQKVIEELMAAEPQWRNSFSAFRNRELLRRWRRQIKRLHRSKQCAETALLDPELMRRGVQFYSSVCSLLVRQLQSAASTGPSTPIASPSHAFRALPEWYVEDIAEFMLFAVQYVPQTVAYHIEDPIVTWLLSAICNSHLIKNPYLVAKIVEVLFVINLSLPMKLKNVYEKFMDHPMSQTALPSALMKFYTDIETTGQSTEFYDKFTIRFHISIILKGMWDRPIHKQAIVKESRSGRQFVKFINMLMNDTTFLLDECLTYLKRIHEAQEAEAEGSSSSSSSSEARTRALAQDERQCRSYLTLARETVDMLEYLTVDIKEPFLRAELVDRLASMLNFNLQQLCGPKCNNLKVRQPEKYGWEPPYKYININRHQKSRQANSINWSLHSLIKRSFRKELFDEAAVRLAKSYIKTPSEIERFRTLADNAYQIAVSNQQRSDEFADAPEEFRDPLMDTLMTDPVTLPSGKVMDRSVILRHLLNSATDPFNRQPLTEDQLRPGRMNITVLGSHRENKVV-