Monarch geneset OGS2.0

DPOGS204808
TranscriptDPOGS204808-TA1059 bp
ProteinDPOGS204808-PA352 aa
Genomic positionDPSCF300221 - 340443-348457
RNAseq coverage2356x (Rank: top 5%)
Annotation
HeliconiusHMEL0143950.097.73% 
BombyxBGIBMGA001411-TA0.096.60% 
DrosophilaCG2924-PA3e-16173.78% 
EBI UniRef50UniRef50_Q29R094e-15973.78%CG2924, isoform A n=25 Tax=Eumetazoa RepID=Q29R09_DROME
NCBI RefSeqNP_001040223.10.096.32%ubiquitin conjugating enzyme E2 [Bombyx mori]
NCBI nr blastpgi|1140521400.096.32%ubiquitin conjugating enzyme E2 [Bombyx mori]
NCBI nr blastxgi|1140521400.096.32%ubiquitin conjugating enzyme E2 [Bombyx mori]
Group
Gene OntologyGO:00168811.2e-09acid-amino acid ligase activity
KEGG pathwaydpe:Dper_GL133852e-161 
 K10582 (UBE2Q)maps-> Ubiquitin mediated proteolysis
InterPro domain[174-339] IPR0161352.3e-37Ubiquitin-conjugating enzyme/RWD-like
[182-304] IPR0006081.2e-09Ubiquitin-conjugating enzyme, E2
Orthology groupMCL11726 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204808-TA
ATGTCAGCTAGTGTCGACGAACTGACCTGCAGATTCGTTGGCAAAAATGGAAAGAAATACGAAATACACGCAAATATTACGGAGACGTATCCTAACACGCCGCCGGTGTGGTTTGCTGACAGCGAGGATCCCATTGTCACTAATGCTGTCCAGATCCTCAGTAACACACAGGGGAGAGATAATCACGTTATAAATCAGGTGGGTATACTACTGAGAGAGCTCTGCAAACTACACGGTGTCCCGGAACCACCTGATTTGGACTCATTGTCACTGCCAGTACATCCGGCGCCTCATCAGAGAGTGCCAAGTGTGACATCAAACGGTGCTGAGTCCGGTACTGAGGAGGATGAGGAGATGGCTGCCGAGGAAGACGAGTCAGAGGGTGAGGATGACCTGCCGCTAGAAATGGTTGATGATGCTGGCAGGAGCAACAAGGACGACATGGAAACAGAGCACCTAGCGACACTAGAGCGGTTGAGACAGAACCAGCGACAGGATTACCTGTCGGGCAGCGTCTCGGGCAGTCTACAGGCCACGGACAGACTGATGAAGGAGCTCAGAGACATATACCGCTCACATTCCTTCAAGAACAACATGTACTCTATAGAATTAGTGAATGATTCGCTGTATGAATGGAACATAACGCTGCGTTCCGTGGACCCTGACAGTCCGCTGCACAATGACCTGTTACTCCTCAAGGAGAAGGAGGGAAAAGACTCTATACTCCTTAATATAATGTTCAAAGAAACCTACCCCTTCGAACCGCCATTCGTGAGGGTCGTATACCCTGTTATATCAGGTGGATATGTATTAGTGGGGGGTGCTATCTGTATGGAACTGCTGACAAAACAAGGCTGGTCGTCAGCGTACACAGTAGAAGCTGTTATCATGCAGATAGCCGCTACCCTGGTGAAGGGGAAGGCCCGTATACAGTTTGGGGCGTCGAAGGTCGTGTCGCAGACTCAGTATAGTCTAGCTCGAGCTCAACAGAGCTTCAAGTGCCTAGTGCAGATACATGAGAAAAATGGCTGGTTCACTCCGCCGAAGGAGGATGGCTAA

Protein sequence:

>DPOGS204808-PA
MSASVDELTCRFVGKNGKKYEIHANITETYPNTPPVWFADSEDPIVTNAVQILSNTQGRDNHVINQVGILLRELCKLHGVPEPPDLDSLSLPVHPAPHQRVPSVTSNGAESGTEEDEEMAAEEDESEGEDDLPLEMVDDAGRSNKDDMETEHLATLERLRQNQRQDYLSGSVSGSLQATDRLMKELRDIYRSHSFKNNMYSIELVNDSLYEWNITLRSVDPDSPLHNDLLLLKEKEGKDSILLNIMFKETYPFEPPFVRVVYPVISGGYVLVGGAICMELLTKQGWSSAYTVEAVIMQIAATLVKGKARIQFGASKVVSQTQYSLARAQQSFKCLVQIHEKNGWFTPPKEDG-