Monarch geneset OGS2.0

DPOGS200310
TranscriptDPOGS200310-TA3879 bp
ProteinDPOGS200310-PA1292 aa
Genomic positionDPSCF300026 - 71161-87241
RNAseq coverage849x (Rank: top 15%)
Annotation
HeliconiusHMEL0063650.055.62% 
BombyxBGIBMGA005587-TA0.066.13% 
DrosophilaCG10254-PA5e-8341.16% 
EBI UniRef50UniRef50_E0VN240.044.39%Ubiquitin-conjugating enzyme E2-230k, putative n=1 Tax=Pediculus humanus corporis RepID=E0VN24_PEDHC
NCBI RefSeqXP_001120701.10.048.58%PREDICTED: similar to CG10254-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3504178660.045.36%PREDICTED: hypothetical protein LOC100748178 [Bombus impatiens]
NCBI nr blastxgi|3071844570.042.24%Ubiquitin-conjugating enzyme E2 O [Camponotus floridanus]
Group
Gene OntologyGO:00168811.1e-22acid-amino acid ligase activity
KEGG pathwayame:7247990.0 
 K10581 (UBE2O)maps-> Ubiquitin mediated proteolysis
InterPro domain[1022-1283] IPR0161356.6e-60Ubiquitin-conjugating enzyme/RWD-like
[1053-1199] IPR0006081.1e-22Ubiquitin-conjugating enzyme, E2
Orthology groupMCL12463 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200310-TA
ATGGCGGGTCAAAATCCAGGTTCGCCTACCGATTTTCAATATTTTTACGAGGATGAAGTGTACAAAATCAATAACCGTGGCCAAGTCGTCTTCGGTCTGGTGTTGGAGAATTATGAAGCAAACTCAAGCGACCAAGAGAGCGACATCGAAACACCCATACAAAAAGGCGAAATTCGTGTTGTCTGGCACCCTTCGGGTACCGAACGTGTTATTTCGGAAAAATCGGTTGGTTTAGCCGATCGCTCGTTGATGCCAGGAGATGTTGTCCGTCGCCTTATAGCCGGGCGGGACACCCAGCGGGGTTATTGCAGGGATATAGTGATGACAGCGGCGCTCCAGATAGTCGGCACCAAGCACGTCATACCCAACGTGGCCAGCGAGAGGCTCCAGCCTTTGGAGGAATTCACTCCAGACCTGGCTGTCTGTCTTGATTCCTGGGTCGGATCGTCAAAGTCGGTTCACAGTAAACTGCGTCTAGTGTCATCGGATGGGTCTCGGTTGGAATATCCAGATCTGGACACGTGTCCATTGGAGGACTACTCGATGAGACGTCGACGTGCAACGCCGTACTCCTCGTCGGAGTTTTACCCGGGGCAGGTGGTGTACGGGCCACTGGGAGCCCTCGACACCGCCAACTGGCTGAACATGACCAAGGAAATGAAGGCAGCCAGGAAACATAAGATGCATGACCATAAGATGTTGCTCTCCCATGGCATGCGACCTTTGGCCCTGGACGAGGGCGGGTGGCTGCGGCCCGTTATGTTCGACCCCAGTTCTCGCATCGCCAGGGACACAGCATGCCAGATGGTGAAGAGTTTATGCGATTCCTACGAAAGGACGAAGGCTGTCCTGATATTATTGACGAGCTTCCTTCCCGAGGTCGGAACTGCCGGAGAGGCGAGCGAACAGTTTCTGCAACTCTATCAGAGTCTGGCGTCCGAGGCTCCTTGGAAACAGTTCCTGGCTTTACGTGGAGTGCTGCAACAGATCGCCGACCTTATGACCAAAGAGATAGATCAACTGCATCGTCTCGAAGAAACCACGCTAACATCCGACCTCGCTCAAGACGGCGGTAATGAGAAATTGGAATCTCTTGATGAAGCTTTAAAAGCTGATTCAAACTGTCCCAGTATTAAAATAGATCCTAGTGGAGATGTCGCGTTACCAGTAGTTAGTAGCATAGATGATACTGTCACTGAAGAGAAAAACGATTCCGGAATAAGTCCAGAGCCTCAAGAGGGGCGAAGGCCTGCTGGTTTGAATGGAGCCCCGCCAGACCAGGACGATGATTCAGATAACTGGGAGAACACCAGCAGCGATGGCAGTGATACAGACAGCGGCGCCACGTGGTCGTCTCGCTGTTCGTCCGCGGCGTCCGGTTCGGTCCGCGGGGCGAGGTCCCCGCAGCTGGCTGTCCGCTTACTACGAGGCAAGAGACTGAAGCGTGCGGTCCGTAGGGCTCCGCCCGCGCCGCCCCCACGACGGACGGATAGGGTCGTCGTGGAGACGCTACACACCACCAGCAGGGCTAACGTCGTTTGGCAGGACGGTACAATTGAAATGGGTATACCCTCAACACAGTTGTATCCTATCCATCACCTGGATGGTCAGGAATGTTTCCCAGGAGACTTCGTCATCAACGGCGCCGCTAGTGTAGAGGAAACTCAACAGCTGAAGCATAGAGAATACGGGGTGGTGCAGCGAGTAGATCATCATGGGCGGACTGCCATTGTACACTGGTATAGGACATACACCAGCGTAGACGAACCAGTGCCACAGATGTTGTTTGAGAGTGAGATGAGTGTATACGATCTCAAGGATCACCCTGATTTCCAATATCGTCCGGGAACTGTGGTCATCAGAGTTGCCAACTTCACTGGCGAAGACGCCAACTGTACCGCGGGACAGGTTATAGACAATTTTCCAACTGGCCGTGTTAAAGTGTGGTGGGTTGATGGACACACCAGTATGTGCTGGCCACAGGATTTATACAAGGTCGGTGAATATGACTCTGAAGACGGCGAGCTGTGGGGCTCCGAGGGCACTGTGTCCGAGGATTCGTGGGAAACACAGAGTTCTGCTCACGAGTTGGACCCACGTACACCTGACGCGCCTGTAGCCGCACCACCGGCCACTGCACCGACCTGGTCCGCACCTGCATCAGGTACAGCGGTGGCAACTCCTAGCGAGGAGGCTGGTCCTTTGCCGAAGCTGTTGGAGCCTCGCGTGGCGGCGCACATCGAGAGAGGACGAGTGGCTATGAGGAAACTAGAAGAGATCTTCGCTAAACAACCAACACTACAGAGCCAGGAGATTATGCGCAAGCTACTGAATCTCTATAAGGACTGCCGGTTCCTTGACAGATTGATGGGCACCACCTTCTTCCATGAAGATCACTTTTTGGGTCTATTGGAGCGGGTCCGCGAGCGTGGGGCGAGCACCCCTCGTGCTGGAGAGCGTCGTGTCCACGAACAGCTAGCTCGTCTGTTCAGCACCGAGCCCGCCGCTGACTTGGAACACGATGAGGATATGAAGATGGAAGACCGCACCCTCAGACCCATCGTAGCAGCTAATGTCACCGTGGAACCCATGGAGACAGAATTCTCATCACCAAAGAAGCCGCTGAATCTGAACATTGAACAGTCAGCGTCTTGTTCTGATACTAACGTCGCTGAAGGTCCCATTGAAAACTCTGCTGAAACAGAAGGCGAAGTTGTGTATAGCGGTTCCCGCAACGTTTGCTACAAGCTGTGCGCTCTCATACACCGACAGCTGGTCAAGGCGCACGCCGAAGTCAACCGGAGAAGGCCGCAAGAGTTGGCCGAATTCCTTAACAGCTTAATGAAAAAGCAAAAAGGGGGCGGGGAGAAGTTACCCTACCACTTTTCAGTTCAATACGAAGCCCTGGTGGTGTCGATGGTCGATCAGGCTGAGAAGGCGGAGGAGAAAGCATCCACTCATGAGGTCAAGGAACCCGTCGCCGACACGCCCAAGATGGAAACTGCTGGTCCGGAAGCGCCTCTAGCGTGTGCTGAAGGGGAGGGGGAGGGCGGTGTAGGGGGCGAGGGGTTCAGCGTGTTGGAGTCAGCACCCGCCGCACACAGATTCAGACTGTCCATGTTACAACCCTCGGAGCCGAGGAGCTTTTACTCCGCTGTGAAGAGAGAGATCAAATTGTTGAAGAGCGATCTACCACCGGGGGTGTGGGTTCGTGGCTACGAGGACCGTATAGACCTTATATCTGTGATGATAGCCGGCCCGGCCCGCACGCCCTACGAGGGCGGCCTGTTTGTGTTCGACGTTCAGCTCGGCGGGGAGTACCCTAGGGCTCCGCCCCTGTGTCACTACCACTCATACTGCTCTGACAGACTCAACCCTAATCTGTATGAGGATGGAAAGGTGTGTGTATCTCTCCTGGGGACCTGGTCGGGTCGCGGAGTCGAGGTCTGGGGGAAAGATAGTTCCTTGCTGCAGGTTATCGTGTCCCTGCAGGGGCTCATACTGAACGCAGAGCCCTACTTTAATGAAGCTGGTTACGAAAAACAGAAAGGCACCCAGCAGGGTGAGGAGAATTCGCGTATGTACAACGAGATGGTGCTGTTAAAGTTGGTGCAGTCTATGACTAAGATGGTTATGAACCCGCCGGAGCCGTTCCGCGAAGAGATCCTGCAGCATCTGCGTGCGTCCGCTGCAAGTCTCTGCCGTCGACTGGAGGGGCTGGTTTCTCTGTCCAACGGCCAGCCGTGCGACCTCCCTCCCCCCGACTACCCCCTCATCCCCGCCTCCCGCGGCTTCTGCCTCACCCTCCGCTCCTCGCTGGAGTCCTTCCGCAACGCGCTCCGTAAGAACGACATCACAGTCCCCCCCACCACGTTATAG

Protein sequence:

>DPOGS200310-PA
MAGQNPGSPTDFQYFYEDEVYKINNRGQVVFGLVLENYEANSSDQESDIETPIQKGEIRVVWHPSGTERVISEKSVGLADRSLMPGDVVRRLIAGRDTQRGYCRDIVMTAALQIVGTKHVIPNVASERLQPLEEFTPDLAVCLDSWVGSSKSVHSKLRLVSSDGSRLEYPDLDTCPLEDYSMRRRRATPYSSSEFYPGQVVYGPLGALDTANWLNMTKEMKAARKHKMHDHKMLLSHGMRPLALDEGGWLRPVMFDPSSRIARDTACQMVKSLCDSYERTKAVLILLTSFLPEVGTAGEASEQFLQLYQSLASEAPWKQFLALRGVLQQIADLMTKEIDQLHRLEETTLTSDLAQDGGNEKLESLDEALKADSNCPSIKIDPSGDVALPVVSSIDDTVTEEKNDSGISPEPQEGRRPAGLNGAPPDQDDDSDNWENTSSDGSDTDSGATWSSRCSSAASGSVRGARSPQLAVRLLRGKRLKRAVRRAPPAPPPRRTDRVVVETLHTTSRANVVWQDGTIEMGIPSTQLYPIHHLDGQECFPGDFVINGAASVEETQQLKHREYGVVQRVDHHGRTAIVHWYRTYTSVDEPVPQMLFESEMSVYDLKDHPDFQYRPGTVVIRVANFTGEDANCTAGQVIDNFPTGRVKVWWVDGHTSMCWPQDLYKVGEYDSEDGELWGSEGTVSEDSWETQSSAHELDPRTPDAPVAAPPATAPTWSAPASGTAVATPSEEAGPLPKLLEPRVAAHIERGRVAMRKLEEIFAKQPTLQSQEIMRKLLNLYKDCRFLDRLMGTTFFHEDHFLGLLERVRERGASTPRAGERRVHEQLARLFSTEPAADLEHDEDMKMEDRTLRPIVAANVTVEPMETEFSSPKKPLNLNIEQSASCSDTNVAEGPIENSAETEGEVVYSGSRNVCYKLCALIHRQLVKAHAEVNRRRPQELAEFLNSLMKKQKGGGEKLPYHFSVQYEALVVSMVDQAEKAEEKASTHEVKEPVADTPKMETAGPEAPLACAEGEGEGGVGGEGFSVLESAPAAHRFRLSMLQPSEPRSFYSAVKREIKLLKSDLPPGVWVRGYEDRIDLISVMIAGPARTPYEGGLFVFDVQLGGEYPRAPPLCHYHSYCSDRLNPNLYEDGKVCVSLLGTWSGRGVEVWGKDSSLLQVIVSLQGLILNAEPYFNEAGYEKQKGTQQGEENSRMYNEMVLLKLVQSMTKMVMNPPEPFREEILQHLRASAASLCRRLEGLVSLSNGQPCDLPPPDYPLIPASRGFCLTLRSSLESFRNALRKNDITVPPTTL-