Monarch geneset OGS2.0

DPOGS205164
TranscriptDPOGS205164-TA2733 bp
ProteinDPOGS205164-PA910 aa
Genomic positionDPSCF300197 - 249967-257104
RNAseq coverage875x (Rank: top 15%)
Annotation
HeliconiusHMEL0099820.088.31% 
BombyxBGIBMGA001267-TA0.067.35% 
DrosophilaUbe3a-PA0.052.31% 
EBI UniRef50UniRef50_E0W2B70.053.86%Ubiquitin-protein ligase E3A, putative n=9 Tax=Neoptera RepID=E0W2B7_PEDHC
NCBI RefSeqXP_969096.10.059.48%PREDICTED: similar to AGAP012366-PA [Tribolium castaneum]
NCBI nr blastpgi|910915060.059.48%PREDICTED: similar to AGAP012366-PA [Tribolium castaneum]
NCBI nr blastxgi|910915060.059.48%PREDICTED: similar to AGAP012366-PA [Tribolium castaneum]
Group
Gene OntologyGO:00064648.7e-139protein modification process
GO:00168818.7e-139acid-amino acid ligase activity
GO:00056228.7e-139intracellular
KEGG pathwaytca:6575480.0 
 K10587 (UBE3A, E6AP)maps-> Ubiquitin mediated proteolysis
InterPro domain[579-910] IPR0005698.7e-139HECT
Orthology groupMCL12735 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205164-TA
ATGAACTCAAAAAGCGAAACTGACGAAGCTTCAAGGGGGAGCGAAGCATCATCGTCAAATAATAATCCAGCCCCAAGCAACTCAAAGGAACAATCCGAGGCGGGTTTATGCGCCGATACAAGTTCAAACACGACCGACATCATGAAGAGAGCCGCCGCAAGACAACTGATTGAGAGATATTTCTACCAGCTTCTAGACGGCTGCGGTAACCCTAATTGTGACAACCAGTACTGTGCTTCAAGTGGAGAGGCAAGAAACTTGACCCCAAATGAGGCTGCAGCAGAAGCAATAAAACTATTTTACAAGGAGGCTCGCCTCTGTGACACTTTACCAAATAAAGTACCCAGAACTGAAGCCAGCACATGTGACACTAGTGCTAGCGCACCATGTTCAACGTCAAATACAAAGGACAAAAACAATAGGGATAAGAAAGGTGATCCATCATGTAGTTCTTCACCAAATGACAACAAGACGGACAAGCCTGAAACTAGTCATAATATATCTCGGGAGGAGTATAAAAATCTAAGTCATAATAACGGGGCTCCGTTAACCGAGGATCGTATATATGAACTGTGTGATGAGTGTCTCCAGACTAAAAAACCTCAAGCGCTCATAAGAGCCCTCGGCGAAGCCTTCACCCAGCCGGCAATTTTATCCAGATGTTTTCAGAAAAAGGTCTCCAAATGTGATGTTAGCGAAGGGAAGAAAAAGTTAGATAAGGAATTGAAGGCGATGTGTTCTTCGAGTGACCCCGACAAGGATGTGGACTGTGTCACCGGCCCGGGGCTGGATGTGGGTTCATGTCAGAGAGCCTTTCAGTATCTTGCCAAAGTGCCATCAGAATATTATAGTTCTGCGTTAGTGACAGCGTTGAAGACGTTGGCCGAGAACATGGAAATAGATTTACGAATAACAAAGAAAATGAGTTTGGACGACGTGGTCACCTGCTTCGTGATAGCCTTCGAGGTTCCGGATCTCAGATGTAGCGATTATTTGGAGATTGCCCTGCCGGCTCTGTGCCACGCGGCCGAACATTTGCCAGTTAAAGCTCAAGCGAAACTAGCTCGTACCTGGGCCCAGCACTGTAAGGACAGTTTACGTCACATCTTGGAAACCTTGCAGCAGCTGATCACATTGAGGGTGATCTGTACCAACTACTCGAGGAATTTCCAGGTGCAGGACGACGAAACGGTTACCATGGCAACTAAACTGATGAAGATAGTGTACTTCGCAAATATGTTAGCTGGAGTGATGGAAGCGAATACGTTAAGGGAGGAACCCATAGTGATCGCCAGTCAATTGGACCCGCTAGGAGACGCCTTAGATCATTTGTACCCCCTCTCATCGATCAAGAACTCCAAACAGGCGCAGCAAGATGACCCTCTAGCTATTGAATTAGATGTAAACGTACTAGACTCTAGGAAACCATATTTGCCATTCGAAGAGTTTTATAACGAACCCCTTAGTGATAACATAGAAATGGATATAGATCTAGCTAATTGTAAAACTGAAATAGGTAGAAAGTTCTCGTTCCTGAAGTACCCGTTCATACTGACGGCGGCCACCAAGTCGCTCGGCCTCTACTACGAGAACCGTATCCGCATGTACTCGGAGCGGCGCGTGTCTCTCCTCCACGCGGTAGTAGGGGCCGCACCTCCCATGCCCTTCTTGAGGCTGAAAGTACGACGCTCGCACATCATAGATGACGCGCTTGTTGAGTTAGAGATGATAGCAATGGAACGAGCGTTGGATCTCAAAAAGCAGCTGGTGGTGGAGTTCGAGGGCGAGCAGGGCGTGGACGAGGGCGGAGTCAGCAAGGAGTTCTTCCAACTGGTCGTGGAACAGATCTTCAACCCCGACTACGGCATGTTCACACACAGGCAGGACTCGCATACTGTCTGGTTCAACCCTACGTCGTTCGAGACGGAGGCCCAGTTCACGTTAATCGGTATCGTGCTGGGGCTAGCCATATATAATAATATAATATTAGCGGTCAACTTCCCCATGGTGGTGTACAGGAAACTCATGGGAAAGAAAGGTTCCTTCGAAGATCTCGCTGATTGGAATTCTACTTTATACAACGGTTTAAAAGACATGCTGGACTACACCGGAAGTGATTTAGAGGAGGTGTACTACCAGACGTTCAGGATATGCTACACCGATGTGTTCGGCAACAATATATTCCATGATCTCAAAGAAAACGGCGACAATATTTTCGTCACGCAGGACAATAAACAGGAATTTGTGGATCTGTACTCGGATTTCCTCCTCAACAAGTCCGTGGAGAGCCAGTTCCGCGCGTTCCGTCGCGGGTTCGTGATGGTGACGGACGAGAGTCAGCTCGGAGCGCTCTTCAGGCCCGAGGAGGTGGAGACGCTCGTCTGCGGCAGTAAGAACTTCGATTTCAACGAGCTGGAGAAGTCTACGGAGTACGACGGCGGTTATACGGCTGAGTCACAGATCATCAAAGACTTCTGGAGCATCGTGCACGGTCTGACGCTCGAGGACAAGCGGAAGCTGCTGCAGTTCACGACCGGCTCGGACAGGGTGCCCGTGGGCGGGCTGAGCCACCTGAAACTGGTCATCGCCAGAAACGGCCCCGACTCGGACCGCCTCCCCACCGCGCACACCTGCTTCAACGTGCTCCTGCTGCCCGAGTACAGCACGCGGCACAAGCTGCAGGACAGGCTCATGAAGGCCATCAGCTACTCCAAGGGCTTCGGCATGCTGTAA

Protein sequence:

>DPOGS205164-PA
MNSKSETDEASRGSEASSSNNNPAPSNSKEQSEAGLCADTSSNTTDIMKRAAARQLIERYFYQLLDGCGNPNCDNQYCASSGEARNLTPNEAAAEAIKLFYKEARLCDTLPNKVPRTEASTCDTSASAPCSTSNTKDKNNRDKKGDPSCSSSPNDNKTDKPETSHNISREEYKNLSHNNGAPLTEDRIYELCDECLQTKKPQALIRALGEAFTQPAILSRCFQKKVSKCDVSEGKKKLDKELKAMCSSSDPDKDVDCVTGPGLDVGSCQRAFQYLAKVPSEYYSSALVTALKTLAENMEIDLRITKKMSLDDVVTCFVIAFEVPDLRCSDYLEIALPALCHAAEHLPVKAQAKLARTWAQHCKDSLRHILETLQQLITLRVICTNYSRNFQVQDDETVTMATKLMKIVYFANMLAGVMEANTLREEPIVIASQLDPLGDALDHLYPLSSIKNSKQAQQDDPLAIELDVNVLDSRKPYLPFEEFYNEPLSDNIEMDIDLANCKTEIGRKFSFLKYPFILTAATKSLGLYYENRIRMYSERRVSLLHAVVGAAPPMPFLRLKVRRSHIIDDALVELEMIAMERALDLKKQLVVEFEGEQGVDEGGVSKEFFQLVVEQIFNPDYGMFTHRQDSHTVWFNPTSFETEAQFTLIGIVLGLAIYNNIILAVNFPMVVYRKLMGKKGSFEDLADWNSTLYNGLKDMLDYTGSDLEEVYYQTFRICYTDVFGNNIFHDLKENGDNIFVTQDNKQEFVDLYSDFLLNKSVESQFRAFRRGFVMVTDESQLGALFRPEEVETLVCGSKNFDFNELEKSTEYDGGYTAESQIIKDFWSIVHGLTLEDKRKLLQFTTGSDRVPVGGLSHLKLVIARNGPDSDRLPTAHTCFNVLLLPEYSTRHKLQDRLMKAISYSKGFGML-