Monarch geneset OGS2.0

DPOGS214809
TranscriptDPOGS214809-TA1821 bp
ProteinDPOGS214809-PA606 aa
Genomic positionDPSCF300059 + 394437-398110
RNAseq coverage699x (Rank: top 18%)
Annotation
HeliconiusHMEL0172890.084.26% 
BombyxBGIBMGA012116-TA0.086.70% 
DrosophilaUba2-PA0.052.77% 
EBI UniRef50UniRef50_Q7Q8G90.058.70%AGAP008637-PA n=11 Tax=Endopterygota RepID=Q7Q8G9_ANOGA
NCBI RefSeqXP_001660971.10.062.52%ubiquitin-activating enzyme E1 [Aedes aegypti]
NCBI nr blastpgi|1571268390.062.52%ubiquitin-activating enzyme E1 [Aedes aegypti]
NCBI nr blastxgi|3454792490.061.69%PREDICTED: SUMO-activating enzyme subunit 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054881.5e-73binding
GO:00038241.7e-39catalytic activity
GO:00086416.3e-25small protein activating enzyme activity
GO:00064646.3e-25protein modification process
GO:00055246.3e-25ATP binding
KEGG pathwayaag:AaeL_AAEL0106410.0 
 K10685 (UBLE1B, SAE2, UBA2)maps-> Ubiquitin mediated proteolysis
InterPro domain[4-523] IPR0090369.9e-100Molybdenum cofactor biosynthesis, MoeB
[377-439] IPR0160401.5e-73NAD(P)-binding domain
[170-375] IPR0232801.6e-64Ubiquitin-like 1 activating enzyme, catalytic cysteine domain
[19-150] IPR0005941.7e-39UBA/THIF-type NAD/FAD binding fold
[337-401] IPR0001276.3e-25Ubiquitin-activating enzyme repeat
[156-200] IPR0195725.3e-23Ubiquitin-activating enzyme
Orthology groupMCL14102 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214809-TA
ATGGTTGCGAGAGTAGCTGGTGTGTTTGACGAAAAGCTTACTGAAGCCATTGCAAATTCTAAAATCTTAGTAGTCGGTGCCGGCGGTATAGGTTGTGAAATATTAAAGAATCTCGTTTTGACAGGATTCCCTCAAATTGAAATCATCGACCTTGATACAATCGACGTAAGCAATCTAAATAGACAATTTTTGTTTCACAAAGAGCATGTGGGGAAATCAAAGGCACAGGTGGCCAAAGACAGTGCACTCAGTTTCAATCCCAACGTAAATATAGTTGCACATCATGACAGTGTTATTAGTAATGACTATGGGGTGAGTTATTTCAAGCAGTTCAATATTGTCCTGAATGCCTTGGATAACCGTGTTGCCAGAAATCATGTCAACAGAATGTGTCTTGCTGCAAACGTTCCTCTTATTGAAACGGGAACAGCTGGTTACGCTGGACAGGTGGAGCTTATAAAGAAGGGTGTGACACAGTGTTACGAATGCCAACCGAAGGCTCCACAAAAATCCTTCCCAGGTTGCACTATAAGGAACACCCCGTCTGAACCGATCCACTGCATTGTATGGGCCAAGCATCTTTTCAATCAACTGTTTGGTGAAGAGGACCCTGACCAGGATGTCAGTCCCGATACAGCTGACCCAGAAGCTGCGGGGGATGCAGGTTCAACTGCTCTAACATCAGAGAGCAGCTCAGGAAACGTTGAGAGGAAAAGTACAAGAACATGGGCCGCGGAAACCAATTATGATCCAGAAAAGTTATTTGCTAAGTTATTTGGTGATGATATCCGGTACCTGCTGTCAATGGAGAATCTGTGGAAGAAACGCAGGCCACCCACACCGTTATCCTGGGATAGCTTACCAGGGAAAGATAATATAGAAATACAACATTCAGGGTTGCCAGATCAAAGAGTGTGGTCTGTGTATGAATGTGCTCAGGTATTTGCTGCCAGTTGCAAAGCTCTTCAAACAGATCTTAAAAGTCGTCCTGAAGGTGATCATCTGGTTTGGGATAAAGATGAAAAGAGTGCTATGGACTTTGTCACTGCCTGTGCTAATATCAGATCACATATTTTCAATATTCCACTCAAATCACGATTTGAAATTAAATCTATGGCTGGTAATATAATACCAGCAATTGCCACAGCTAATGCAATCGTGGCGGGTTTGGCAGTATTACGCGCGCAGGCGTTACTAAAAGGAGAGCTTGAAACTTGTACTAGTGTTTATCTAAGACCTAAAGTCAACCACCGCGGACAACTATTTGTACCCGAAAAAACTTTAACACCACCAAATCCTAAATGTTATGTGTGTTCTCCGAAACCGGAAGTAGCATTAGCCTGTAACCTGAAACATCTTACACTTAAAGACCTCAATACGGCGTTCAAAGAAGGTCTTAACATGCAGGCTCCTGACGCTACAGTGGAAGGCAAAGGTCTTGTTGTACTCTCATCTGAGCCGGGCGAAACTGATCACAACAACGAAAAGACTTTAGAAGAAATCGGTCTAAACGACGGCTGTGCCTTACTGGTCGACGATTTCCTGCAAAACTACGAAGTACGAGTGCGCCTGCAGCAGGAGGACGAGGAAAAAACATGGCGCTTAGTTACAGACGCAGATTCGCCAATGCTCGGCCCGAAAGAGGAAAAGACCGCCAACGGTTCGAGCGGTTCCGAACCGAAACCCGGCCCGTCACGCTCCAAGGAAGACAGCGATAGTGACATGGAAATTATCGAGGAGGACGATGACGGTGAACCGAAACCGAAACCGCCAAAACGTAGGCGAACCGAAATGACCGATGAAGTAGTCGAACTCTGCTAG

Protein sequence:

>DPOGS214809-PA
MVARVAGVFDEKLTEAIANSKILVVGAGGIGCEILKNLVLTGFPQIEIIDLDTIDVSNLNRQFLFHKEHVGKSKAQVAKDSALSFNPNVNIVAHHDSVISNDYGVSYFKQFNIVLNALDNRVARNHVNRMCLAANVPLIETGTAGYAGQVELIKKGVTQCYECQPKAPQKSFPGCTIRNTPSEPIHCIVWAKHLFNQLFGEEDPDQDVSPDTADPEAAGDAGSTALTSESSSGNVERKSTRTWAAETNYDPEKLFAKLFGDDIRYLLSMENLWKKRRPPTPLSWDSLPGKDNIEIQHSGLPDQRVWSVYECAQVFAASCKALQTDLKSRPEGDHLVWDKDEKSAMDFVTACANIRSHIFNIPLKSRFEIKSMAGNIIPAIATANAIVAGLAVLRAQALLKGELETCTSVYLRPKVNHRGQLFVPEKTLTPPNPKCYVCSPKPEVALACNLKHLTLKDLNTAFKEGLNMQAPDATVEGKGLVVLSSEPGETDHNNEKTLEEIGLNDGCALLVDDFLQNYEVRVRLQQEDEEKTWRLVTDADSPMLGPKEEKTANGSSGSEPKPGPSRSKEDSDSDMEIIEEDDDGEPKPKPPKRRRTEMTDEVVELC-