Monarch geneset OGS2.0

DPOGS213077
TranscriptDPOGS213077-TA1242 bp
ProteinDPOGS213077-PA413 aa
Genomic positionDPSCF300016 - 371492-375549
RNAseq coverage1612x (Rank: top 8%)
Annotation
HeliconiusHMEL0097697e-9287.15% 
Bombyx% 
DrosophilaCG1315-PA5e-14759.04% 
EBI UniRef50UniRef50_B3M0W99e-14659.18%GF16645 n=4 Tax=Drosophila RepID=B3M0W9_DROAN
NCBI RefSeqXP_967315.13e-16164.23%PREDICTED: similar to argininosuccinate synthetase [Tribolium castaneum]
NCBI nr blastpgi|910917285e-16064.23%PREDICTED: similar to argininosuccinate synthetase [Tribolium castaneum]
NCBI nr blastxgi|910917285e-15464.62%PREDICTED: similar to argininosuccinate synthetase [Tribolium castaneum]
Group
Gene OntologyGO:00055242.6e-235ATP binding
GO:00040552.6e-235argininosuccinate synthase activity
GO:00065262.6e-235arginine biosynthetic process
KEGG pathwaytca:6556598e-161 
 K01940 (E6.3.4.5, argG)maps-> Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
InterPro domain[1-406] IPR0015182.6e-235Argininosuccinate synthase
[164-367] IPR0240741.8e-78Argininosuccinate synthetase, catalytic/multimerisation domain body
[6-163] IPR0147291.8e-51Rossmann-like alpha/beta/alpha sandwich fold
Orthology groupMCL15605 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213077-TA
ATGAGTAAAGATCTCGTTATTTTGGCCTACTCCGGCGGTTTGGACACCAGCTGCATTTTAAAATGGTTAATTCAAAAAAATTACGATGTCGTCTGCTACATGGCCGATATTGGACAAGACGAAGATTTCGAAAAGGCACGCCAAAAAGCAAAACTCATAGGAGCAAAGGATGTCATAATTGAAGATTTGCGCAATGATTTCGTCTCCAACTATATGTTTCCCGCTATTCAGATGGGTCTGGTTTATGAGAGTCGATACTATTTAGGGACCTCCGTGGCACGGCCTTGTATCTCCGTTGGAATAGTAAATGCTGCAAAAAAATTAGGCGCCAAGTATATTTCTCATGGTGCTACAGGCAAGGGTAACGATCAAGTACGCTTTGAATTGAGCGTGTATTCGTTATGGCCGGAAGGGAAGGTGATCGCACCATGGCGTCAACCGGAATTCTTTAATCGATTTCAAGGAAGAAAAGATCTAATCGAATTTGCTAAACAAGAAAACATACCGGTTTCCGCTACACCAAAGGCCCCTTGGTCGACTGATGAAAACATTATGCATATCAGTTATGAATCGGGAGTCCTAGAAGACCCAACAGCGGAGCCTCCTAGTGGGATTTACAAAATGACTCGCGATCTGAACCAAGCTGAGGATTATCCGAGCACCATGGATATAACATTCGAAAAAGGTTTGCCAGTATCAGTCAAAATTCCCAGCGGTCAAGAGAAAGCTTTAATTATAAACGATCCGCTGAATTTGGTGAGGACGTTAAACAAACTCGGCGGTCAACATGGCGTCGGTCGCATTGATATAGTGGAAAACCGTTTCCTGGGATTAAAGTCTCGAGGACTCTACGAGACCCCGGCTGGGACTATACTCCACGTTGCACATCTGGACTTGGAGACTTATGCTCTTGATAAGGAAGTTTTAAGGTTAAAAAAATACCTTCAAGAAAAAATGTCAGACTTTGTTTATAATGGATTTTGGTTTGCCCCGGAAGCCAGATACACGAGGAAGTGTTTAGAATTGTCACAAGAATCAGTTTCCGGAACTGTTACAGTCCAAGTCTTCAAAGGGAACGTAATAGTCTTGGCAAGAAAAAGTGCCAAAAGTTTGTACAATCAGGAATTGGTTTCCATGGACATTGCCGGAGGATTTTCTCCTGAAGATAGCACTGGTTTTATTAACATCAATGCAGTGAGACTGAAGGAATATGCTCGTTTTGCTGGTGGCCAGGAATTTTAA

Protein sequence:

>DPOGS213077-PA
MSKDLVILAYSGGLDTSCILKWLIQKNYDVVCYMADIGQDEDFEKARQKAKLIGAKDVIIEDLRNDFVSNYMFPAIQMGLVYESRYYLGTSVARPCISVGIVNAAKKLGAKYISHGATGKGNDQVRFELSVYSLWPEGKVIAPWRQPEFFNRFQGRKDLIEFAKQENIPVSATPKAPWSTDENIMHISYESGVLEDPTAEPPSGIYKMTRDLNQAEDYPSTMDITFEKGLPVSVKIPSGQEKALIINDPLNLVRTLNKLGGQHGVGRIDIVENRFLGLKSRGLYETPAGTILHVAHLDLETYALDKEVLRLKKYLQEKMSDFVYNGFWFAPEARYTRKCLELSQESVSGTVTVQVFKGNVIVLARKSAKSLYNQELVSMDIAGGFSPEDSTGFININAVRLKEYARFAGGQEF-