Monarch geneset OGS2.0

DPOGS209010
TranscriptDPOGS209010-TA1017 bp
ProteinDPOGS209010-PA338 aa
Genomic positionDPSCF300209 - 69929-72423
RNAseq coverage333x (Rank: top 35%)
Annotation
HeliconiusHMEL0025395e-17689.14% 
BombyxBGIBMGA012550-TA9e-16982.61% 
DrosophilaCG13343-PA5e-12364.76% 
EBI UniRef50UniRef50_Q7ZVX61e-12666.03%NEDD8-activating enzyme E1 catalytic subunit n=96 Tax=Eukaryota RepID=UBA3_DANRE
NCBI RefSeqXP_001662412.12e-13870.79%ubiquitin-activating enzyme E1c [Aedes aegypti]
NCBI nr blastpgi|1571320254e-13770.79%ubiquitin-activating enzyme E1c [Aedes aegypti]
NCBI nr blastxgi|1571320253e-13370.79%ubiquitin-activating enzyme E1c [Aedes aegypti]
Group
Gene OntologyGO:00054883e-53binding
GO:00168814.3e-28acid-amino acid ligase activity
GO:00038241e-24catalytic activity
GO:00086413.6e-23small protein activating enzyme activity
GO:00064643.6e-23protein modification process
GO:00055243.6e-23ATP binding
GO:00451166.5e-09protein neddylation
KEGG pathwayaag:AaeL_AAEL0123066e-138 
 K10686 (UBE1C, UBA3)maps-> Ubiquitin mediated proteolysis
InterPro domain[1-325] IPR0090365.4e-91Molybdenum cofactor biosynthesis, MoeB
[224-275] IPR0160403e-53NAD(P)-binding domain
[150-220] IPR0233184.3e-28Ubiquitin activating enzyme, alpha domain
[1-122] IPR0005941e-24UBA/THIF-type NAD/FAD binding fold
[181-245] IPR0001273.6e-23Ubiquitin-activating enzyme repeat
[128-171] IPR0195722.3e-12Ubiquitin-activating enzyme
[276-315] IPR0149296.5e-09E2 binding
Orthology groupMCL14127 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209010-TA
ATGGGTTTTAAGAAAATACACATAATTGATATGGATACTATTGAACTTTCAAACCTCAACAGACAGTTCCTGTTCAGAAAGAATGATATTGGTTTATCAAAAGCTAAATGTGCAGTAGAATTTGTAAATAAAAGAGTGCCAGGTTGTGAAGCAGTGGCCCATCACTGTTCCATTCAAGATATGGATGAAGGATTTTATCGTCAATTCCATATTGTAGTGTGCGGTCTAGACTCCATTGTGGCAAGGCGCTGGTTAAACGGGATGCTTATGTCTCTACTTCAATATAATGATGACAGAACTTTAGATCAGAGCAGCGTTATCCCATTGGTGGATGGAGGCACTGAAGGGTTCAAGGGCAATGCAAGAGTCATATTACCAGGAATGAGCGCTTGTATTGAATGTACCCTAGATCTATATCCTCCTCAGAAAACATTTCCATTATGTACCATTGCTAACACACCAAGGTTACCAGAGCATTGTGTGGAATATGTCAAAGTACTTCAATGGGGTAAAGAAAATCCATGGGGTTCTTCAACTACCTTGGATGGTGATGATCCTCAACATGTAGCGTGGGTATATGAAAAGGCCCAAGAGAGGGCTATGAAGTATGGAATCACATCGGTAACATATAGATTAACACAGGGTGTATTGAAAAATATCATTCCTGCTGTCGCCAGCACTAATGCTGCTATAGCTGCTGCTTGTGCTACTGAGGTTTTTAAACTAGCGTCATCCTGTTGCATTAATATGAACAACTATATGGTGTTAAATATGTCGGATGGTTTGTACACATATACTTTTAATGCTGAAAGAAGACAAGATTGTGTTGCGTGTAGTAATTCTACAAGGACTATGGAAATTGACTGTAATGCCACTCTACAAGCTATTTATGATAAGTTATGTGAAGATAGAGGTTATTTAATGAAGAGTCCAGCTCTCATGAAAAACCAATTCGAATCGTGTGAAATAGATATCGAAGTTCGAACCCAATTAAAAGACACCTACGCTGAAAAATGA

Protein sequence:

>DPOGS209010-PA
MGFKKIHIIDMDTIELSNLNRQFLFRKNDIGLSKAKCAVEFVNKRVPGCEAVAHHCSIQDMDEGFYRQFHIVVCGLDSIVARRWLNGMLMSLLQYNDDRTLDQSSVIPLVDGGTEGFKGNARVILPGMSACIECTLDLYPPQKTFPLCTIANTPRLPEHCVEYVKVLQWGKENPWGSSTTLDGDDPQHVAWVYEKAQERAMKYGITSVTYRLTQGVLKNIIPAVASTNAAIAAACATEVFKLASSCCINMNNYMVLNMSDGLYTYTFNAERRQDCVACSNSTRTMEIDCNATLQAIYDKLCEDRGYLMKSPALMKNQFESCEIDIEVRTQLKDTYAEK-