Monarch geneset OGS2.0

DPOGS205080
TranscriptDPOGS205080-TA1425 bp
ProteinDPOGS205080-PA474 aa
Genomic positionDPSCF300074 + 135804-138103
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0048250.073.81% 
BombyxBGIBMGA006876-TA1e-17068.92% 
DrosophilaCG9510-PC8e-11143.56% 
EBI UniRef50UniRef50_D7EI903e-12448.16%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D7EI90_TRICA
NCBI RefSeqXP_001605269.13e-12546.28%PREDICTED: similar to argininosuccinate lyase [Nasonia vitripennis]
NCBI nr blastpgi|1565452855e-12446.28%PREDICTED: argininosuccinate lyase-like [Nasonia vitripennis]
NCBI nr blastxgi|2700152203e-12048.16%hypothetical protein TcasGA2_TC008532 [Tribolium castaneum]
Group
Gene OntologyGO:00040566.1e-204argininosuccinate lyase activity
GO:00424506.1e-204arginine biosynthetic process via ornithine
GO:00038247.8e-119catalytic activity
KEGG pathwaynvi:1001216588e-125 
 K01755 (E4.3.2.1, argH)maps-> Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
InterPro domain[1-471] IPR0090496.1e-204Argininosuccinate lyase
[12-469] IPR0089487.8e-119L-Aspartase-like
[22-306] IPR0227612.5e-72Lyase 1, N-terminal
[110-132] IPR0030313.1e-53Delta crystallin
[111-129] IPR0003621.4e-24Fumarate lyase
Orthology groupMCL15050 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205080-TA
ATGTCCTCTTTAGACAGGTATCAACTTTGGGGCGGTTGCTTTGGAGAAGAACCATCATCGGTACTAAGACGTTTAAATGACTCTTTAGGTATAGATGTCAGACTCTTCCATGAAGACATCCGAGGCAGTAAGGCTTGGGCAAACGAATTACACCGCAGTGGTCACCTTTCGGAAGATGACAACACAGCTATTCAGAGCGGATTAAAAAAAGTGGAAGATGATATAGAACAAGAACTTTGTATCAACGGCCGACTCAATGATCCCGAGGAAGATATCCATTCTGTTGTTGAAAGACGTCTTCAAAAGTACGCGGGTGATGCTGCTCTGAGACTACATACCGCTAGAAGTCGCAATGACCAGTCAGCAACAAACACTAAGCTGTGGATGCTAAAATCGTTACAACATATAAAGAAAGAAATAGGTCAACTGCTATCAATTCTCATTTCACGGGCAAAAAAAGAAATTCATATCATTGCACCCGGATACACACATTTGCAACGAGCCCAACCAATCCGCTGGAGTCATTTCTTGTTAAGTTACGCATGGATGTTTCGAGACGATATTATACGGTTACAAGAGATCGTTGAAAGGCTTTCTTGTAGTCCTTTAGGCAGTGGAGCTATTGCTGGTTGTGCCTTGCAAATAGACAGAAAGAGATTGGCCGAAAGTTTAGGTTTTAAACAATGTACGCCTAACTCCATGTATGCTGTAGGATCGAGGGATCACATCGTCGAATATCTCAACTGGGCTTCGCTGTGTGGAGTTCATTTGAGTAAATTAGCCGAAGATCTGATCATATATAGCACGCAAGAATTTGGTTTTATAAGACTTTCGGATCAGTTTTCAACCGGATCCAGTCTTATGCCCCAAAAAAGAAATCCAGATGGCTTAGAACTAGTGAGGGGGGCGGCCGGTCTTCTACTTGGTGATGCATTTTCCTTTAGCTGCATATTAAAAGGCTTGCCCAGCACATACAATAAAGACTTGCAATCGGATAAAGAAGTATTATTTAGATCTTATGATAGATTGCTCGATTGCATTAAAGTTACAGCGGGAACTGTGGAAACCATGCAGATCGATGAAGAAAGGTCAGTAGGTGTATTAGATGCTGGAATGCTCGCTACTGATCTGGCCCACGTCTTAGTTCGTGGGGGAGTGCCGTTTCGTCGAGCTCATCACACTGTGGGCGCGGTCTTGCGACGAGCCGCAGAACTAGGACATGATCTTCAAACACTACCTTATCAGGAATATATTACCATATGTCCTGAATTCGGAACAGAAAAAGAATTACGAAAAATCTTTTCTTGGGAGTCAAGTGTTGAACAGTACACGACCGAGGGAGGAACATCTAAATCAGCAGTGTCGAAGCAGATAGAGAGTTTGGAGCAGTGGATCAAAGATATCACAAATAAAATCTTGATCTAA

Protein sequence:

>DPOGS205080-PA
MSSLDRYQLWGGCFGEEPSSVLRRLNDSLGIDVRLFHEDIRGSKAWANELHRSGHLSEDDNTAIQSGLKKVEDDIEQELCINGRLNDPEEDIHSVVERRLQKYAGDAALRLHTARSRNDQSATNTKLWMLKSLQHIKKEIGQLLSILISRAKKEIHIIAPGYTHLQRAQPIRWSHFLLSYAWMFRDDIIRLQEIVERLSCSPLGSGAIAGCALQIDRKRLAESLGFKQCTPNSMYAVGSRDHIVEYLNWASLCGVHLSKLAEDLIIYSTQEFGFIRLSDQFSTGSSLMPQKRNPDGLELVRGAAGLLLGDAFSFSCILKGLPSTYNKDLQSDKEVLFRSYDRLLDCIKVTAGTVETMQIDEERSVGVLDAGMLATDLAHVLVRGGVPFRRAHHTVGAVLRRAAELGHDLQTLPYQEYITICPEFGTEKELRKIFSWESSVEQYTTEGGTSKSAVSKQIESLEQWIKDITNKILI-