Monarch geneset OGS2.0

DPOGS210588
TranscriptDPOGS210588-TA1170 bp
ProteinDPOGS210588-PA389 aa
Genomic positionDPSCF300168 - 447257-450087
RNAseq coverage352x (Rank: top 33%)
Annotation
HeliconiusHMEL0091680.082.91% 
BombyxBGIBMGA013537-TA9e-10588.13% 
DrosophilaCG1749-PA1e-13964.84% 
EBI UniRef50UniRef50_Q6IVA45e-13666.93%Ubiquitin-like modifier-activating enzyme 5 n=21 Tax=Eukaryota RepID=UBA5_CHICK
NCBI RefSeqNP_001138805.12e-17779.90%ubiquitin-like modifier-activating enzyme 5 [Bombyx mori]
NCBI nr blastpgi|2238901803e-17679.90%ubiquitin-like modifier-activating enzyme 5 [Bombyx mori]
NCBI nr blastxgi|2238901802e-17579.90%ubiquitin-like modifier-activating enzyme 5 [Bombyx mori]
Group
Gene OntologyGO:00054881.2e-58binding
GO:00038245.3e-36catalytic activity
KEGG pathway 
InterPro domain[26-300] IPR0090365.7e-61Molybdenum cofactor biosynthesis, MoeB
[37-294] IPR0160401.2e-58NAD(P)-binding domain
[65-206] IPR0005945.3e-36UBA/THIF-type NAD/FAD binding fold
Orthology groupMCL14198 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210588-TA
ATGTCTTCTTTAGAAGAATTGCAGAAAAAAGTGTCTGAATTGGAAGCTAAACTCGCAGCAGTGAAGGGAAATGTTGGAGCAGTCCGACAAAAAATCGAAGTTATGTCTTCTGAAGTAGTAGATTCCAATCCCTACAGCCGACTCATGGCTTTAAAGCGTATGGGTATAGTGAACAACTATGAGAAGATCCGTGAGATGTCAGTAGCTGTAGTCGGTGTGGGTGGAGTCGGCAGTGTCACTGCGGAGATGTTGACAAGATGTGGTATCGGAAAGTTAATCCTCTTCGATTATGACAAAGTGGAGTTGGCCAACATGAACCGTCTGTTCTTCCAACCACATCAGGCCGGCCTGAGTAAGGTAGATGCAGCGTCAGCCACATTAAGAGCCATCAACCCAGATGTCACCATTGATGCATACAACTACAATATCACTACGGTCGACAATTTCCAAAATTTCTGTGATACTATTAGAACAGGCAGCTTAACTGAAGGCCCCGTAGACCTGGTGCTGAGCTGTGTCGATAATTTTGAGGCTCGCATGGCCATCAACATGGCCTGCAATGAACTCAATCAGAAGTGGTTTGAGTCTGGTGTTAGTGAGAATGCTGTGTCTGGACATATACAGTTCATCATACCGGGGGAGACTGCTTGCTTTGCATGCGCGCCGCCGCTGGTGGTGGCGTCTAATATCGACGAGAAGACCTTGAAGCGTGATGGAGTGTGCGCCGCCTCGCTCCCCACCACTATGGGTATCGTGGCCGGATTCCTCGTTCAGAACACGCTGAAGTATCTGCTGAGTTTCGGTAACGTGTCCCACTATTTGGGGTACAGCGCTATGACTGACTACTTTCCTCGGATGAGTTTGAAGCCGAACCTTCAGTGTGACGACTCGTTCTGCCGCCAGCGTCAGTCGGAGTACTCGTCGAGACCTGCGGTGGAGCTCGCCACCGAGGTCACGGAAGACCCCGCGCCCTTACACGAAGACAACGAGTGGGGTATCAGTCTGGTGGATGAGAACTCTCCTGAAGAAGAGTCCGTGGGATTGAAGCTCGTGGAAGGGGTTCAGGTGGCGTACGCGATTGCGGGAGACAGTAGTACACCGGAGACCAGCTCGGCGGTGGCGGCCGAGGAAGACCTGGAGGAGCTGATGAGGAAGATGAAGAATATGTAG

Protein sequence:

>DPOGS210588-PA
MSSLEELQKKVSELEAKLAAVKGNVGAVRQKIEVMSSEVVDSNPYSRLMALKRMGIVNNYEKIREMSVAVVGVGGVGSVTAEMLTRCGIGKLILFDYDKVELANMNRLFFQPHQAGLSKVDAASATLRAINPDVTIDAYNYNITTVDNFQNFCDTIRTGSLTEGPVDLVLSCVDNFEARMAINMACNELNQKWFESGVSENAVSGHIQFIIPGETACFACAPPLVVASNIDEKTLKRDGVCAASLPTTMGIVAGFLVQNTLKYLLSFGNVSHYLGYSAMTDYFPRMSLKPNLQCDDSFCRQRQSEYSSRPAVELATEVTEDPAPLHEDNEWGISLVDENSPEEESVGLKLVEGVQVAYAIAGDSSTPETSSAVAAEEDLEELMRKMKNM-