Monarch geneset OGS2.0

DPOGS216104
TranscriptDPOGS216104-TA5376 bp
ProteinDPOGS216104-PA1791 aa
Genomic positionDPSCF300182 - 198089-217316
RNAseq coverage1260x (Rank: top 10%)
Annotation
HeliconiusHMEL0093110.077.06% 
BombyxBGIBMGA009223-TA0.067.20% 
DrosophilaCG9086-PA0.037.14% 
EBI UniRef50UniRef50_UPI00022467320.045.87%UPI0002246732 related cluster n=1 Tax=unknown RepID=UPI0002246732
NCBI RefSeqXP_001606200.10.045.92%PREDICTED: similar to ubiquitin ligase E3 alpha [Nasonia vitripennis]
NCBI nr blastpgi|3454808610.045.87%PREDICTED: E3 ubiquitin-protein ligase UBR2-like [Nasonia vitripennis]
NCBI nr blastxgi|3071876690.045.41%E3 ubiquitin-protein ligase UBR2 [Camponotus floridanus]
Group
Gene OntologyGO:00082705.5e-20zinc ion binding
GO:00048425.5e-20ubiquitin-protein ligase activity
GO:00301632.6e-15protein catabolic process
KEGG pathway 
InterPro domain[117-187] IPR0139932.6e-31Zinc finger, N-recognin, metazoa
[118-186] IPR0031265.5e-20Zinc finger, N-recognin
[245-331] IPR0147191.1e-15Ribosomal protein L7/L12, C-terminal/adaptor protein ClpS-like
[249-321] IPR0037692.6e-15Adaptor protein ClpS, core
Orthology groupMCL11311 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216104-TA
ATGAGCTCGCCACCGCCTATGCAGTTGGAGGTGGAAGTGGGAGACGAGTCCATGGAGGAGGATGGAGCAGAAATGATCACGCCGGACATGTGGTTTCCGGTTCCTGCTAAAGCTGATGCTCTAGTGAAGCTATGGCAGACCAAGATGGCGGAAGGGGTGCTCTCCCCGGCGCACTTTCAGGATCATTGGCGCGTTACGGTGCCCAGGATATATTCACCACAACCTAATAGGACTTGTCTAGACTGGTCGTTCGATGAGGAGATGGCGAGCAAGTTGCTCATTCAGCCTCTGGAGCAGTTCGTGTGGGGCTCCGCTGAGAGCTCGGAGCCTCCGCCGCCGAGACGGTCCACGCTCTGCGGGAGAGTGTTCAAGCAGGGGGAACCGGCTTATAGCTGTAGAGAATGTGGTATGGACAACACGTGCGTCCTCTGTGTGGAGTGCTTCAAGGTGTCCCCTCATCGCAATCACAGATATAAGATGGGTCAGTCCGGAGGCGGCGGATGCTGCGACTGTGGAGACACAGAGGCCTGGAAGAGAGATCCCTCCTGCGACTTACACTCGGCTAAGGATAACGAAGAACAGGCTCAAGCCAGTATCAGCCCCGAGGTATTGGAGCGTATGAAGATAGTGGCGTCTGTTTGCTTACCCTACTGCTTCCGCCTCCTTACATTCGACCACGCGCCTGGACTGCCCAACGATCTGATACTCAAGGATACCGAGAGGGACTTACTTCAGATCTTGGATCAGCCCGATTGTTACTGCACTGTGCTATACAACGACGAGACTCATACCTTTGAACAGGTGATAACGACTCTGATACGTGTGATGAAGTGCACTCACCGCGACTCGGTGGAGCTGGTCAGCCTCATAGACCGCGAGGGTCGCGCTCTGGTCAAGTGCAGCTCTTTCCAGATAGCCGACAAACTGAAGAATGACTTCGAGATCTTCACATCCCGTCATGGACCAGCTCTCAAAGTGCTGGTGATGCAGGCACACGTAATCGCCCATCAAACATTCGCTATGAAACTATTGAATTGGCTTCAGAATTTCGTAAGCCAAGAACAAAGTCTCCGTCTAGCTGTTTGTCAGGTGGCGCTGGGGGAAGAGACCGGTTTATCCGGTTGGGGTTTGGCCGGAACTTCAGGTGGGGTGGCCGTGGGCGTCATGCAGAATGACTGCAAGATGTGGAAAGCTGCAAGGACGGCTTGGCATAGACTTCTCATAGCAACCACATTAATGGACTACTCCACCAAACGAACCATGGCCATACTGTTTACTAAGAATTATGCCACTATAATGAAGGACTACATCCGTGACGACCACGACCACTCCTTCTCGATATCGTCTCTGTCCGTGCAGCTCTACACCACCCCGACCCTGGCTCATCACCTCATCGCCAAACACGACGCGCTGTTCGTAGTCATGAACACGTTCGTCAGTGAATGCACAAGGAAGTGTAACGCTGAGGGCCGTCTCGAGTTTGATCGTAACCACGTGCCGATGGGTTTCAAACGTGCCCAGTTTATTCTGTACGACGTGAAGTACCTCCTGGGGTCCATACCGACGAGTTTCGACGACGACCTCCGGAAGGGCTTCCTGCACGGACTGTCGCTCATGATGAACCTGCTGGTCATGATGCAGGGCATGGACTCCGTGGTGAGGCAGATTGGACAACACATGGAGTATGAACCAGAATGGGAATCGGCTTTCAACCTGCACGTGAAGCTCGCGAACAGTATAACGCTGGCGTTGGAATGGTGCAGCGTAGAACGGTCGCTGGCTGCATCGGCGTACCGGATGGCACTCAGAAGACATGCTGATAATTTTGCTGCTGGCGACAAGTATGAACTTAGAGAGTTAGGTAACCAAAGCGCCTCAGTGATCCCATACGACGTGTCCAAAGAGCCTGTGTCGGTTCATCGACCTCTATCCCGTTTTATAGCGGGACTTCATCTTCACCTCCACAGACATGGCTTGTCGTACCACAGCAAAGAGTTCGATAGACATGACAAACCCAAACCCAAGCCAGAGGAACTCATTGAGCCGGTGCTACGTACCATGGCCATGATAGCCCAGGTCCACGCCGGTATGTGGCGTCGCAATGGCTTCGCTCTCCTCAACCAGCTGTACTTCTACCACAACGTCAAGTGTCGCACGGAGATGTACGACAGAGACGTCATCATGCTTCAGATCGGTGCTTCGTTGATCGAGAGCAACGAGTTCATAATACATGTCCTGAACAAGTTTAATCTCCTGGACTGGGCGGCTAACGACTTTGAACAGCGTACCATAGAAGACGACACGCTCAGACACACTATAAGCATGGTCGAGGAATTCCTTGGATTACTTATTACCGTGGTGGGCTCCCGCTACGTGCCGGGAGTGGGCGAGGTGACGGCAGCACAGAGAACTAAGAAGGAGATCATACAGATGCTGTGCGTTAAACCCATGCCGCACTCCGAACTCAATAGATCTCTTCCCGAAAATCAGTTGCACGAGACGGGACTCGAAGCTGTGATCAACGAGGTCGCTGACTTCGTGAAGCCGAGCGGCACGCACAATCGCGGCCTGTACAAACTGAAGCCGCACCTGTACGATGAATACGACACCTTCTTCTATCACTACACCAGGGAGGAGCTGTCCCGGAGCGAGGAGGAGCAGAGGAACAGGAGGAAAGCTGCCGGTCTTCCAGAATGCTGTCCCCCTCCTCCTCTGCCGCTGCTGTCTCCGCCGTTCCGTCTGCTGGCGAACCTGTTGCAGAGTGACGCAGCACTGCACGTCCTGCGTGTAGTTTTGACCCGCGCCCTCGACCTTAGGGCCAGGTCCTTCTCGGAGCCGCAAGTCCACAAGGCTCTCCATCTAATAGGCTACGCTCTGCGCGATGAGGAGAGTTCTCACTACGAGTTCTTGGCGTTCGCTGAGAGTGCTGCTCGCAGTGGACTGCTGGCGTTGCTGCAACGACTGGCCGCCAGCCCTAGGGTCGACGCCCACCGACCACTCGCTAAGTGGCTCCTTAATAAAATGAAGTCTCTGTTGGGACAGACCGATGACAACATGGGTGGTGACGAATGTATGGAGACGGATCAGGAAGAGAAGCCTCGCTCTGATGAAACAGCCGACGCCGAGAAAGCTCGTCGCGCCAAACTGGCGGCGGAGCGAAGGGCTAAGGTTATGGCACAAATGAAGGCTCAAATGAACAACTTCATATCAAACAACGCCATGCTGTTCAAAGAGACGACCACTGAGGCGACAGAGGAGGAGGAGAAGCAAGATCTGCTGCCGTTATACCGTGGAGCGGCTCTAGGCGTGTGGGGAGGGGGAGTCGTCGAACCAGCCAGGGTCTGCATCATGTGCCAGGAACAGGCTCGTGTTGAAGCTAAATCCGAGCCGCTGGTACTCGTAGCCTTCGCCCAACACTCCAGCGTGTTGAATCGCCGCGGTGTTGGCGCTGGGTCGGGCGGTCCCGGCGGAGCGGGGGCTGCTGGTGACGCGTGGCGGAGCGCCTGGCCCCCAGCCGGACTGGGGGCGCAGCCCCACGTGTCATGCTGTGGACACGCCCTCCACGCTAAGTGCTGGAGGAAGTATGTCGATGGAGTGCTTGACAAGGAGAAGCTTAGACCTTACCGCATCCGTCAGCCGGCTGCTTTCGACGTTGAAAAGAAGGAGTACCTGTGTCCGCTGTGCGAGCGGCTTTGTAACACGGCCGTGCCGCTGCTGCCCTCCCCTCCCCTCCCGCCGCGGCCGCCGCCCCCCCTCGCGGAGGGCACCTTCACGGACGCCGCCGAGCTGATACTGAAGCTGAAACATCAGGTTTGCTCAGAATCAGTACGTCAGTGCACTGAATACACTCAGGAGGCGCCCTGCGCGGGGACGCCTCAGGCTGGGGGGGCGGAGTCCGAGGGAGAAGAGTCCCCTGACGAGGCGGAAGTGTACGTCTCCACACACGCCGAGACTCTACTGCCTAAAGAATTTTTAGAACACTTCCATGAGAAGCCGAGGAAATATAACGACACTACTGCCGTTCTTATTGCTGAGTTTGCAGAGATGTTGCCGGGGATCTGTGGATTATCAGAATCTGGAGGTATGGTCCGTGTGGCCGCTTTGTACAGAGCGACGTCCTACACCATCTTGAGCACCAACGTCGTTCTACAGGCTGAGAATAGACCGCTGCTAGGAGACCTGCCCTCGAGACACCGAGACGCCTTGCAGGCGCTGATCAGACTGGCGGCGGTTCTGCCTCCTATATGGCCGACGCCTAAACATATTTCACACCATGCTCTCAGCTGTCTAAACACTTTAGTGAAAACATCTCCGCTATCACACGACGTATTCGGAACATTGGTGGCGCTGGTGCTGTCGGCTCCGTGTTTGTTTTCCAAAAAGGCAGCCCCGGCCAGACCCACACATTTGGCGCGACAAATAACATTGGAGGCTTTCAGGGCCACCATAACTAGAGCCCTTATAGCTGTAGACGTTACGCATTGCAACAGCGAACCGATGGAGGACACCGAGCAGGTTAACAAACCGGACTTGGAAAATCTGCTGCCATTCATGAAAGAGCTGCGGCAGGGGAATTTAGACATAGAAAACTTGAGAGCCGGTGAGGTGTGGGAGTGTGTCAAGAAGCAGTGTCACGGCTTCCTTCGCTGCTGCTGTTTGTTCTACCACTTCCTAAGCGACATCCAACCACCCAACGAGTTGACCCTGGTGGGCGGCGACACCTGGGATATCATGTGCGGGTACCTGGACCTCCCCAACACGTTCAGGGATCTCATTGACAATCCGCTGGCGAGGAACAAAGCGTCGATGTGGTCACAACTATCCACGGAGTGGTTCAAAGGAGAGCTGTCCCCGCAAATGGTATTGGACCCCAGTGAGCCACCCAGTTTGATAACACTGCCGATGGATTTTTCGGAGTTGATGAATGTGGTGTCAGAATTCTCGTGCCCTAACTCTGAGCGCGAGGACACCAAAAACCCAACCATGTGCCTTGTGTGTGGACAGATATTATGTTCGCAGAGCTACTGCTGTCAGATTGAAATACGGAAGATCGGGCGAGGCGGCGGCTCGGAGCTGGCGGGGGCGGTGGTGGCGCACGCCCTGTGGTGCGGCGCGGGGGCCGGGGTCTTCCTCAGGGTCCGCGAGTGTGAACTCCTGTTGTTGGCGGCTCCCTCCCGCGGGGCGATGCTACCGGCGCCATACCTGGACACGTATGGAGAGACTGATCAGGGCCTCCGCCGCGGGAATCCCCTCCAGCTCTGCAACGAGCGTTACCAGGCCCTACGTATGGTGTGGCTGTCGCACGGCATACACGAGAGGATCGCCCGCGGCCTGGACTCCAATATGCTGGTCACCACCACCTGGCAGAACATGTGA

Protein sequence:

>DPOGS216104-PA
MSSPPPMQLEVEVGDESMEEDGAEMITPDMWFPVPAKADALVKLWQTKMAEGVLSPAHFQDHWRVTVPRIYSPQPNRTCLDWSFDEEMASKLLIQPLEQFVWGSAESSEPPPPRRSTLCGRVFKQGEPAYSCRECGMDNTCVLCVECFKVSPHRNHRYKMGQSGGGGCCDCGDTEAWKRDPSCDLHSAKDNEEQAQASISPEVLERMKIVASVCLPYCFRLLTFDHAPGLPNDLILKDTERDLLQILDQPDCYCTVLYNDETHTFEQVITTLIRVMKCTHRDSVELVSLIDREGRALVKCSSFQIADKLKNDFEIFTSRHGPALKVLVMQAHVIAHQTFAMKLLNWLQNFVSQEQSLRLAVCQVALGEETGLSGWGLAGTSGGVAVGVMQNDCKMWKAARTAWHRLLIATTLMDYSTKRTMAILFTKNYATIMKDYIRDDHDHSFSISSLSVQLYTTPTLAHHLIAKHDALFVVMNTFVSECTRKCNAEGRLEFDRNHVPMGFKRAQFILYDVKYLLGSIPTSFDDDLRKGFLHGLSLMMNLLVMMQGMDSVVRQIGQHMEYEPEWESAFNLHVKLANSITLALEWCSVERSLAASAYRMALRRHADNFAAGDKYELRELGNQSASVIPYDVSKEPVSVHRPLSRFIAGLHLHLHRHGLSYHSKEFDRHDKPKPKPEELIEPVLRTMAMIAQVHAGMWRRNGFALLNQLYFYHNVKCRTEMYDRDVIMLQIGASLIESNEFIIHVLNKFNLLDWAANDFEQRTIEDDTLRHTISMVEEFLGLLITVVGSRYVPGVGEVTAAQRTKKEIIQMLCVKPMPHSELNRSLPENQLHETGLEAVINEVADFVKPSGTHNRGLYKLKPHLYDEYDTFFYHYTREELSRSEEEQRNRRKAAGLPECCPPPPLPLLSPPFRLLANLLQSDAALHVLRVVLTRALDLRARSFSEPQVHKALHLIGYALRDEESSHYEFLAFAESAARSGLLALLQRLAASPRVDAHRPLAKWLLNKMKSLLGQTDDNMGGDECMETDQEEKPRSDETADAEKARRAKLAAERRAKVMAQMKAQMNNFISNNAMLFKETTTEATEEEEKQDLLPLYRGAALGVWGGGVVEPARVCIMCQEQARVEAKSEPLVLVAFAQHSSVLNRRGVGAGSGGPGGAGAAGDAWRSAWPPAGLGAQPHVSCCGHALHAKCWRKYVDGVLDKEKLRPYRIRQPAAFDVEKKEYLCPLCERLCNTAVPLLPSPPLPPRPPPPLAEGTFTDAAELILKLKHQVCSESVRQCTEYTQEAPCAGTPQAGGAESEGEESPDEAEVYVSTHAETLLPKEFLEHFHEKPRKYNDTTAVLIAEFAEMLPGICGLSESGGMVRVAALYRATSYTILSTNVVLQAENRPLLGDLPSRHRDALQALIRLAAVLPPIWPTPKHISHHALSCLNTLVKTSPLSHDVFGTLVALVLSAPCLFSKKAAPARPTHLARQITLEAFRATITRALIAVDVTHCNSEPMEDTEQVNKPDLENLLPFMKELRQGNLDIENLRAGEVWECVKKQCHGFLRCCCLFYHFLSDIQPPNELTLVGGDTWDIMCGYLDLPNTFRDLIDNPLARNKASMWSQLSTEWFKGELSPQMVLDPSEPPSLITLPMDFSELMNVVSEFSCPNSEREDTKNPTMCLVCGQILCSQSYCCQIEIRKIGRGGGSELAGAVVAHALWCGAGAGVFLRVRECELLLLAAPSRGAMLPAPYLDTYGETDQGLRRGNPLQLCNERYQALRMVWLSHGIHERIARGLDSNMLVTTTWQNM-