Monarch geneset OGS2.0

DPOGS209487
TranscriptDPOGS209487-TA2115 bp
ProteinDPOGS209487-PA704 aa
Genomic positionDPSCF300127 - 429686-438162
RNAseq coverage1168x (Rank: top 11%)
Annotation
HeliconiusHMEL0162710.068.09% 
BombyxBGIBMGA007436-TA0.078.25% 
Drosophilabur-PB0.059.51% 
EBI UniRef50UniRef50_Q17JX50.060.39%Gmp synthase n=17 Tax=Bilateria RepID=Q17JX5_AEDAE
NCBI RefSeqXP_002427615.10.062.16%GMP synthase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420138530.062.16%GMP synthase, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420138530.062.16%GMP synthase, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00039224.4e-58GMP synthase (glutamine-hydrolyzing) activity
GO:00055244.4e-58ATP binding
GO:00061774.4e-58GMP biosynthetic process
GO:00061643.1e-11purine nucleotide biosynthetic process
GO:00065413.9e-08glutamine metabolic process
GO:00038243.9e-08catalytic activity
GO:00090581.2e-06biosynthetic process
KEGG pathwayphu:Phum_PHUM3331400.0 
 K01951 (E6.3.5.2, guaA)maps-> Purine metabolism
    Drug metabolism - other enzymes
InterPro domain[23-208] IPR0047394.4e-58GMP synthase, N-terminal
[25-205] IPR0179261.1e-36Glutamine amidotransferase type 1
[231-396] IPR0147298.5e-30Rossmann-like alpha/beta/alpha sandwich fold
[459-620] IPR0016743.1e-11GMP synthase, C-terminal
[67-76] IPR0117023.9e-08Glutamine amidotransferase superfamily
[67-76] IPR0062201.2e-06Anthranilate synthase component II/delta crystallin
Orthology groupMCL13375 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209487-TA
ATGCACACAGCCAGTACCGAGGGACAGTATCGTGGTGCCAACGGAGCAGCCAACGGCAGAGACAAAGTCGCTATATTAGATGCCGGCTCACAATATGGAAAGGTGATAGATAGAAGGATCCGCGAGCTGTGCGTGGAATCTGAGATCCTTCCGCTGGACACTCCCGCGTATCATCTCAAGGAGGCTGGATACCGAGCGATCGTCATCTCAGGTGGACCGAACTCCGTGTACGCTGAAGACGCCCCCAGATATGATGCTGATATATTTAAGATCGGTCTGCCGGTGTTAGGCATATGTTATGGTATGCAAATGCTGAACAAGGAGTTTGGTGGTTCCGTACTGAGGAAGGAGGCTCGCGAGGACGGCCAGTATGAGGTGGAAGTTGAAACCACGTGCCCGTTGTTCAACCGCCTGGAAAAGCTGCAGCCAGTACTGCTGACCCACGGGGATAGCGTGCAGAAGGTCGGCGAACGGTTCCGGGTCGGGGCGCAGTCGTCGAACCATCTCATAGCGGCCATTTACAATGAGCAGATGAGGTTGTATGGGGTACAATTCCATCCGGAGGTGGACCTAACGCCCAAAGGCAAGCAGATGTTGTCAAACTTCTTATTCGACATCGCGGGTCTGTCTCGGACCTTCACACTGCGCTCACGCCGCGAAGCCTGCGTACAGTACATACGAGAGACGGTCGGGGATAATAAAGTTCTGGTCCTTGTCAGCGGTGGAGTCGACTCCACAGTTTGCGCCGCCTTATTAAGAACAGCACTGCGTGAAGATCAAGTCATCGCTTTACACATCGACAATGGTATCGTATCCACGGTAGTCCGCGAGGCGGGAGGTCGCACCCGTCACACTCCGCTGCTGTGTCACGCCACGGCTCCCGAGGACAAGAGGCGCATCATAGGAGACGTGTTCGTCAGGGTCGCCGAGCACGCCGTCAGGGACCTGCTGCAGCTGCAAGAGGAGCAAGTGCTGCTGGGTCAGGGAACTCTCCGACCGGACCTCATAGAGTCGGCGTCGGCGCTGGCGTCCGGAGCCGCGGCCGCCATCAAGACGCACCACAACGACACCGAGATGGTGCGCGCGCTGCGGCAGAGGGGGAGAGTCGTGGAGCCGCTCAGGGACTTCCACAAGGACGAGGTCCGTCAGCTCGGTACGGAGCTGGGTCTGCCGGCGGTGCTGGTGGAGCGACACCCCTTCCCCGGCCCGGGGCTGGCTGTGAGAGTTCTGTGCCAGGACGAGCCCTACGCGGACAGAGACTTCGCCGAGACACAGGTGATAGTTAAGATAATGGTTGAATACGCGTCCATGTGTGTGAAGTCTCATGCGTTGCTGGGTAGAGTCTCCAACGCCACCACCCCGGCCGAGCAGAGCGAGCTGAGGCGCATCTCGTCCGCGGGAGCCCTCGCCGCTACGCTGCTGCCCCTGAGATCGGTCGGGGTCCAAGGGGATCACAGAACGTATAGCTATGCGGTGGCGCTGTCCACGGAACGCTATCCGCCCGACTGGAAGGATATGAACTACCTCGCCAAGATCATACCACGAGTGTGCCACAACGTGAACAGGGTTTGTTATGCTTTCGGCGGTCTGATCAAGGAGCAGGTGACGGACATCACGCCAACTTTTCTTTCTCAACAAGTCATTTCCACCATACGGCAGGCGGACGACCTCGCTACACAGGTAACATTGTACTTGGTCCACAACTTGTCCAGCGGTCTGGGTGGCCTAATCTCCCAGATGCCCGTGGTTCTCGTTCCGGTCCACTTCGATCGTGACGCAGGGCTTCGGGCGCCGTCCTGCCAGCGCTCGCTGGTGCTGCGCCCCTTCATCACCAACGACTTCATGACGGGAGTGCCGGCTTTACCTGGCGAGCCCGCTATGCCGCAGGATGAAGCAAGATTGCTCCCGTTGAGCCTCGACCACACTGATGAGATTATATCCGTAGGTGGTGGACAGAATGCGCAAGGAGCTAATGACAGTGCCGGGTATATCGCGTGTGTTGTACGACCTGACTGCCAAGCCGCCGGCCACCACCGAGTGGGAATGATCACCCATCATGTACGCGGCAGTGGTCGACTACACACCACTATGCACCATCCCAGGAACGCCTTCTAA

Protein sequence:

>DPOGS209487-PA
MHTASTEGQYRGANGAANGRDKVAILDAGSQYGKVIDRRIRELCVESEILPLDTPAYHLKEAGYRAIVISGGPNSVYAEDAPRYDADIFKIGLPVLGICYGMQMLNKEFGGSVLRKEAREDGQYEVEVETTCPLFNRLEKLQPVLLTHGDSVQKVGERFRVGAQSSNHLIAAIYNEQMRLYGVQFHPEVDLTPKGKQMLSNFLFDIAGLSRTFTLRSRREACVQYIRETVGDNKVLVLVSGGVDSTVCAALLRTALREDQVIALHIDNGIVSTVVREAGGRTRHTPLLCHATAPEDKRRIIGDVFVRVAEHAVRDLLQLQEEQVLLGQGTLRPDLIESASALASGAAAAIKTHHNDTEMVRALRQRGRVVEPLRDFHKDEVRQLGTELGLPAVLVERHPFPGPGLAVRVLCQDEPYADRDFAETQVIVKIMVEYASMCVKSHALLGRVSNATTPAEQSELRRISSAGALAATLLPLRSVGVQGDHRTYSYAVALSTERYPPDWKDMNYLAKIIPRVCHNVNRVCYAFGGLIKEQVTDITPTFLSQQVISTIRQADDLATQVTLYLVHNLSSGLGGLISQMPVVLVPVHFDRDAGLRAPSCQRSLVLRPFITNDFMTGVPALPGEPAMPQDEARLLPLSLDHTDEIISVGGGQNAQGANDSAGYIACVVRPDCQAAGHHRVGMITHHVRGSGRLHTTMHHPRNAF-