Monarch geneset OGS2.0

DPOGS210804
TranscriptDPOGS210804-TA1350 bp
ProteinDPOGS210804-PA449 aa
Genomic positionDPSCF300027 - 850294-852032
RNAseq coverage532x (Rank: top 24%)
Annotation
HeliconiusHMEL0050250.091.76% 
BombyxBGIBMGA007119-TA0.089.72% 
DrosophilaCG3590-PA0.074.48% 
EBI UniRef50UniRef50_P305663e-17668.97%Adenylosuccinate lyase n=260 Tax=cellular organisms RepID=PUR8_HUMAN
NCBI RefSeqXP_002048667.10.074.71%GJ14098 [Drosophila virilis]
NCBI nr blastpgi|1839792620.090.39%similar to CG3590-PA [Papilio xuthus]
NCBI nr blastxgi|1839792620.090.39%similar to CG3590-PA [Papilio xuthus]
Group
Gene OntologyGO:00040182.7e-107N6-(1,2-dicarboxyethyl)AMP AMP-lyase (fumarate-forming) activity
GO:00091522.7e-107purine ribonucleotide biosynthetic process
GO:00038241.2e-94catalytic activity
KEGG pathwaydvi:Dvir_GJ140980.0 
 K01756 (E4.3.2.2, purB)maps-> Alanine, aspartate and glutamate metabolism
    Purine metabolism
InterPro domain[18-422] IPR0047692.7e-107Adenylosuccinate lyase
[1-424] IPR0089481.2e-94L-Aspartase-like
[68-267] IPR0227616.4e-22Lyase 1, N-terminal
[12-71] IPR0240833.7e-14L-Aspartase-like, N-terminal
[340-423] IPR0194681.3e-13Adenylosuccinate lyase C-terminal metazoa/fungi
[113-131] IPR0003625.3e-12Fumarate lyase
Orthology groupMCL12229 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210804-TA
ATGTTGCATTACACGCGTATACGAGTATATGAGAACCAAGATCTCGGCTTAGACATCACCGATGAACAGATCAAGGAACTGGAGTCGGCAATCCACGACATAGATTTCCCATCAGCCGCAGAACATGAGAAGAAGGTCCGTCATGACGTCATGGCGCACGTCCATACCCTCGCTGAACGCTGTCCGTTAGCGGCTCCTATCATACACCTAGGAGCTACTTCATGCTACGTTGGCGATAATACCGATCTGATTGTATTGAAACACGGCTTGGACTTACTCCTGCCCCGGCTCGCTGCTGTTATAAGCCAACTGTCGAAATTCTCCGATGAATACAAATCGCTTCCGATTTTGGGATTCACCCATTTACAACCAGCTCAGTTAACAACAGTTGGAAAGAGAGCTTCGTTGTGGCTTAGCGACCTTCTTATGGACGAGCGTGCATTGTCCCGGGCAAGAGAAGATTTAAGGTTCAGGGGAGTCAAAGGCACTACAGGAACTCAAGCTTCGTTCTTACAACTGTTTAAAGGTGACACTAGTAAGGTTAGGGCTCTGGATAAGAGGGTCGCAGAGCTTGCTGGATTTGATAAACGTTATCTTGTCACCGGTCAGACGTACTCAAGGAAGGTAGATTTAGAAGTTATAGCGGCGTTATCTGGGTTAGGAGCTACTGTCCATAAAATGTGCTCTGACATCCGTATTCTCGCTTCTCGTAAAGAATTGGAGGAACCGTTTGAGACTTCTCAAATAGGATCCAGCGCGATGCCCTACAAAAGAAATCCTATGAGGTCTGAACGTTGCTGTGCCCTGGCTCGGCATTTGATAACGCTTCATGCGAATGCTGCCAACACCCACGCCGTCCAATGGATGGAACGTACTTTAGATGACTCTGCTAACCGACGCATCACTTTAGCTGAAGCATTTTTGACTGCCGACGCGACTTTGCTTACTCTCCTTAATATTTGTCAAGGGCTGGTGGTGTACCCAAAAGTAATTGCTCGTTATATTGCACAGGAGCTCCCGTTCATGGCAACGGAAAATATTATAATGGCAATGGTACAATCTGGTGGTGATCGGCAGGTTTGTCATGAAAAAATACGAGTTTTGTCCCACGAAGCCGGAGCGGTAGTCAAACAGGAAGGAAAAGATAATGATTTAATAGATCGCATCAAAAATGATAAATATTTTGCTCCTATCATACCACAGCTTGACAAAATATTAGACGCGTCTACTTTTATTGGTCGAGCTCCTGAGCAAGTCACGGAATTTTTGGAAGAAGAAGTATATCCAGTTCTTGCAAAGTACAAGAACTCATTACTTGAAGTCGAGAAACCAGTTACTCTAAATATATAA

Protein sequence:

>DPOGS210804-PA
MLHYTRIRVYENQDLGLDITDEQIKELESAIHDIDFPSAAEHEKKVRHDVMAHVHTLAERCPLAAPIIHLGATSCYVGDNTDLIVLKHGLDLLLPRLAAVISQLSKFSDEYKSLPILGFTHLQPAQLTTVGKRASLWLSDLLMDERALSRAREDLRFRGVKGTTGTQASFLQLFKGDTSKVRALDKRVAELAGFDKRYLVTGQTYSRKVDLEVIAALSGLGATVHKMCSDIRILASRKELEEPFETSQIGSSAMPYKRNPMRSERCCALARHLITLHANAANTHAVQWMERTLDDSANRRITLAEAFLTADATLLTLLNICQGLVVYPKVIARYIAQELPFMATENIIMAMVQSGGDRQVCHEKIRVLSHEAGAVVKQEGKDNDLIDRIKNDKYFAPIIPQLDKILDASTFIGRAPEQVTEFLEEEVYPVLAKYKNSLLEVEKPVTLNI-