Monarch geneset OGS2.0

DPOGS210473
TranscriptDPOGS210473-TA1347 bp
ProteinDPOGS210473-PA448 aa
Genomic positionDPSCF300062 + 423322-425409
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0215713e-17089.91% 
BombyxBGIBMGA002755-TA3e-14779.50% 
DrosophilaAos1-PA3e-7047.37% 
EBI UniRef50UniRef50_Q1HPK76e-14579.50%SUMO-1 activating enzyme n=146 Tax=Obtectomera RepID=Q1HPK7_BOMMO
NCBI RefSeqNP_001040485.11e-14579.50%SUMO-1 activating enzyme [Bombyx mori]
NCBI nr blastpgi|2984022079e-14590.11%SUMO-1 activating enzyme [Heliconius melpomene melpomene]
NCBI nr blastxgi|1140526072e-14579.50%SUMO-1 activating enzyme [Bombyx mori]
Group
Gene OntologyGO:00054881.3e-57binding
GO:00038242.9e-18catalytic activity
GO:00086416.8e-11small protein activating enzyme activity
GO:00064646.8e-11protein modification process
KEGG pathwayphu:Phum_PHUM4280305e-85 
 K10684 (UBLE1A, SAE1)maps-> Ubiquitin mediated proteolysis
InterPro domain[10-311] IPR0090361.3e-62Molybdenum cofactor biosynthesis, MoeB
[5-311] IPR0160401.3e-57NAD(P)-binding domain
[33-162] IPR0005942.9e-18UBA/THIF-type NAD/FAD binding fold
[38-62] IPR0000116.8e-11Ubiquitin/SUMO-activating enzyme E1
Orthology groupMCL14341 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210473-TA
ATGGTTGAAAATAATGAAGTCGAACTCTCAGAGGCAGAGGCTGAACAATATGATAGACAAATCCGTCTATGGGGCTTGGAATCCCAAAAGAGGTTACGCGCTTCTAAAGTTTTGATCATCGGCATGTCTGGCCTAGGAGCCGAAATAGCCAAAAATATTATACTATCAGGTGTGAAGAGTGTTTGTTTATTGGACAGTGAGAAACTCAAAGAAACAGATCTTTACTCACAGTTTTTGGCTCCTCCGGACAAAATAGGTGAAAACAGAGCCGAGACATCTTTACAGCGTGCAAGGGCTTTGAACCCAATGGTTGACGTCACTGCAGAGACGAAGGCTGTGGATGATCTTCCGGACAGCTACTTTGCGACTTTCGATATAATCTGCGCTACCGGTTTGAAGCAAGAGCAACTGGAACGAGTTAATAACATATGTCGCGACAACAACAAGAAATTTCTGTGTGGCGACGTCTGGGGCACGTTTGGATACATGTTTGCTGATTTAATTGACCATGAATATTCCGAGGAAATAGTTCAACACAAAGCTGTTAAACGTGGACCCGATGATAATGAAGCGAATGCTAGAGAAACTGTTAGTATCACTGTAAAGCGAAGAGCTATTTACGTTCCCTTACAAAACGCCTTATCTGTTGACTGGACCAAACCTGAATTACGATCTAGATTACGTAGAGGGGACCCATCATACTTTGTCATGAAGATTCTTTCAAGATTTAGAGATGAATACAACAGAAACCCTGATCCAGCGCAACGAAAAACGGAGACTGAAATATTGCTGCGTATGAGAGATGAACTTGTCAAGGAGCTGTCTCTTCCTGCTGGATTTATAAAGGATGCCTTACTGACAGATGTGTTTGGAATAGTATCTGGTGCTGCAGCGGTTGTGGGCGGAGTTATTGCCCAGGAAGTTGTGAAGGCTTCTATAGCACGATTTCCAGAAGCTGAAGATAAGATAAGGGTTGAAATGAATGTGGTGTCTGAAGTTGGCGATACAAGATACAGACTAGCAGCGGAAACAGCTGAAAAAGCTCAATTACTCACAGCTTTATTACCCGCTGCGCAGGACGCTGCGTCATATGATTTGAAAGAAATGTTACAAAGGTACAAAGATGTCATCTTATTAAATGAAGAGTTACTTGCGGGGTGCCACGTACGTAGGGCGACGCAGGAACAAACTCTGACGTCACTAAAGAACTTGCACACTATACTGCAACAAGCAGCTAGGTTGCGAGTTGGAAAATACAGCAAAATGGTTGTGAACGCATGCAGAAAAGCCGTCAGCGACAACAACACTGAGGCTCTCGTTAAAATACTACAAGCTGGGGATACTTAA

Protein sequence:

>DPOGS210473-PA
MVENNEVELSEAEAEQYDRQIRLWGLESQKRLRASKVLIIGMSGLGAEIAKNIILSGVKSVCLLDSEKLKETDLYSQFLAPPDKIGENRAETSLQRARALNPMVDVTAETKAVDDLPDSYFATFDIICATGLKQEQLERVNNICRDNNKKFLCGDVWGTFGYMFADLIDHEYSEEIVQHKAVKRGPDDNEANARETVSITVKRRAIYVPLQNALSVDWTKPELRSRLRRGDPSYFVMKILSRFRDEYNRNPDPAQRKTETEILLRMRDELVKELSLPAGFIKDALLTDVFGIVSGAAAVVGGVIAQEVVKASIARFPEAEDKIRVEMNVVSEVGDTRYRLAAETAEKAQLLTALLPAAQDAASYDLKEMLQRYKDVILLNEELLAGCHVRRATQEQTLTSLKNLHTILQQAARLRVGKYSKMVVNACRKAVSDNNTEALVKILQAGDT-