Monarch geneset OGS2.0

DPOGS210060
TranscriptDPOGS210060-TA6177 bp
ProteinDPOGS210060-PA2058 aa
Genomic positionDPSCF300017 - 820857-851884
RNAseq coverage1489x (Rank: top 9%)
Annotation
HeliconiusHMEL0029830.076.61% 
BombyxBGIBMGA012701-TA0.085.55% 
DrosophilaCG9674-PF0.069.82% 
EBI UniRef50UniRef50_E5SC900.055.42%Glutamate synthase n=2 Tax=cellular organisms RepID=E5SC90_TRISP
NCBI RefSeqNP_001041678.10.084.89%glutamate synthase [Bombyx mori]
NCBI nr blastpgi|1152924190.084.89%glutamate synthase [Bombyx mori]
NCBI nr blastxgi|1152924190.084.89%glutamate synthase [Bombyx mori]
Group
Gene OntologyGO:00451810glutamate synthase activity, NADH or NADPH as acceptor
GO:00055060iron ion binding
GO:00506600flavin adenine dinucleotide binding
GO:00160400glutamate synthase (NADH) activity
GO:00101810FMN binding
GO:00065371.8e-187glutamate biosynthetic process
GO:00551141.8e-187oxidation-reduction process
GO:00166391.8e-187oxidoreductase activity, acting on the CH-NH2 group of donors, NAD or NADP as acceptor
GO:00081524.2e-187metabolic process
GO:00038244.2e-187catalytic activity
GO:00159302e-153glutamate synthase activity
GO:00166382e-153oxidoreductase activity, acting on the CH-NH2 group of donors
GO:00068074.5e-112nitrogen compound metabolic process
GO:00164912.3e-110oxidoreductase activity
GO:00515367.9e-39iron-sulfur cluster binding
KEGG pathwaytca:6585840.0 
 K00264 (GLT1)maps-> Nitrogen metabolism
    Alanine, aspartate and glutamate metabolism
InterPro domain[1-2055] IPR0122200Glutamate synthase, eukaryotic
[1575-2000] IPR0060051.8e-187Glutamate synthase, NADH/NADPH, small subunit 1
[831-1256] IPR0137854.2e-187Aldolase-type TIM barrel
[845-1212] IPR0029322e-153Glutamate synthase, central-C
[21-415] IPR0005835.6e-127Glutamine amidotransferase, class-II
[500-785] IPR0069824.5e-112Glutamate synthase, central-N
[1269-1537] IPR0024892.3e-110Glutamate synthase, alpha subunit, C-terminal
[1575-1720] IPR0122857.9e-39Fumarate reductase, C-terminal
[1575-1729] IPR0090512.3e-38Alpha-helical ferredoxin
[1721-1917] IPR0237534.6e-13Pyridine nucleotide-disulphide oxidoreductase, FAD/NAD(P)-binding domain
Orthology groupMCL15997 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210060-TA
ATGGAGTGGGAAGCACCACCCAAGCAAGGCTTGTACGACCCCCAAAACGAGCACGAGGCTTGCGGTGTTGGATTTGTTGTTGCTATCGACGGAAAACGTTCTCATAAGATCGTTCGCGATGCCGAAGTCCTCGCAAAGCGCATGGAACACCGTGGAGCGTGCGCCTGTGACAATGACACCGGCGATGGCGCTGGTGTGTTGACCGCCATCCCTCACCAGTTCTACTGCGCTCAGCTAAGAGACAGCCACCAAATCGATTTACCCCCGTTCGGGAAGTACGCCACCGGCATCTTCTTCCTGGACAAGCTTCACCATCAGGACATCGAGAAGAAGTTCCAGGAGTTAGCTGAGAGCCTCCACCTCCGCGTGATCTGCTGGAGGACTGTACCAACCAATAACGCCACTATCGGTCAAGTGGCTCGTAACTCGGAGCCGTATATGCGCCAAGTGTTTGTTACCGGAGATATCGGGGATGAACCTCAGCTAGCTCGTCAGCGACCCAGGATAGAAGGCACTCTTGAAGCATTTTTTTCAGATCGCCGTCGCTCGCCGGCCGCCGCTCCGCCGCCACTCGTCCACATCTTCGTGTTACGCAAGCGCGCCTCCCACGAGCTGGTGGTGCCGGGAGCCCGCTTCTATATTTGCAGCTTATCTCTGAGGACCGTCGTTTACAAGGGACTCCTCACATCCAATCAGCTATGGGAATACTTCAAGGATCTGAGCAACCCGGCGTTCACTACATACCTGGCGCTGGTCCACACTCGCTTCTCAACCAACACCTTCCCCAGCTGGGAGAGAGCCCATCCTTTGAGAGTGCTTGCCCACAATGGAGAAATAAATACATTACGAGGTAACGTCAACCTGATGAAGGCCCGGGAAGGTGTCATGAAGAGTGATATATTTGGAGATGAACTAAAGAAGCTGTACCCGGTGGTGGAACCCAACTTATCTGACTCCGGCTCAGCCGATTGTGTGCTGGAGTTCCTCCTGCACTGCGGTCATAGATCTTTGCCGGAAGCTGTCATGACTATGGTACCGGAAGCATGGCATAATGACGTCACCATGCCCGCAGAAAAGAGGGATTACTACCAATGGGCAGCTTGTGCCATGGAGCCATGGGATGGGCCAGCTCTAGTGTCCTTTACTGATGGAAGATATATTGGCGCCATTCTGGACAGAAATGGTCTGAGACCATCCAGGTTCTATGTCACGAGTGAGAACATTCTCGTGATGGCCTCGGAGGTGGGAGTCTATGATGTGGATCCGGAGAAGGTTATCCTCAAGAGTCGTCTGAAGCCAGGCCGCATGTTGTTAGTGGACACCGAGGAGAAGAGAATCATACAAGACGTGGAGCTGAAGATGGACATCGCCAGGAGCAGACCGCACTCACAGTGGCTCAAGGAGCAGATAACAATGGAGGATATTTACAAATCGGTGTCCCAGAGCGATTTGTCCAGTAATGGGTGTGTGAATGGAGCTATATCTGGTCTGGGGGACAAGAGGCTCGGTCTGTTTGGATACACCATAGAGTCCATCAACATGCTGCTACTGCCCATGATACAGAACAAGAAAGAGGCTCTCGGCAGCATGGGGAACGACGCGCCCCTGGCTTGTCTGTCTCGTTTCGAACCCCTGCCCTACGACTACTTCAAGCAGCTGTTCGCACAGGTCACCAACCCACCCATCGACCCCTTCCGTGAGAAGATCGTGATGTCCCTGATGTGTCCGATCGGCCCGGTGGCCAACATCCTGCGGCCCGGCGCGGAGTTCGTGCACCGCCTGTTCCTGCCGCAGCCCGTGCTGTCCATCCCGGACCTCAAGGCGCTGATCGCCACCACACATCGCGGCTGGAGGACGAAGGTTATCGATTGTACGTTCGATATATCCGACGGTCCCATGGGCCTGGAGCCGGCCCTGACTCGCCTGGCGGGGGAGGCGCATGACGCTGCTGAGGACGGCTACCAGCTGCTGGTGCTCTCCGACCGCCAGGCCGGGCCAACTAGGGTCCCGATCAGCTCGTTACTCGCCCTGGGTGCGGTCCATCACCACCTTATCGAGACCCGTCAGCGCATGAAGGTCGGGGTCCTCGTGGAGACGGCCGAGGCCAGGGAAGTCCATCACATGTGTGTCTTACTCGGTTATGGAGCCGACGCTATATGTCCCTACCTGGCCTTCGAACTAGCGTTCTCTCTTCGAAACGACAACCTCATAGATCCAAACTTAACGGACAGCGACATATACCTGGCGTACCAGAAGGCCATAGAGACGGGTCTGGCTAAAGTCATGGCCAAAATGGGCATCAGCATGCTGCAGTCGTACAAAAGCGCCCAGATATTCGAGGCGGTCGGCCTCAGCGAGGAAGTTATCGACAAATGCTTCAGAGGCACTCAGTCTAGAATCGGTGGGATTACCTTTGAGATTCTCTCTCAGGAGACTTTCGACCGACACGCCCTCACGTACGGCAACTGTAACGACATGCTGGTGCTACGTAACCCCGGCAACTACCACTGGAGGGCTGGCGGGGAGAAGCATATCAACGACCCGCTGTCCATCGCCAACCTTCAGGAGGCCGCCGTCAACAACACGGCTTCCGCCTACGACAGATTCAGAGAGAGCGCTCTGGAATCTATCCGCGCCTGTACACTCCGAGGTCAACTCGAGCTGGTCACCCTGGACGAACCCCTGCCGCTATCAGAGATAGAACCGGCTTCGGAGATAGTCAAGAGATTCGCTACTGGCGCGATGTCCTTCGGTTCAATATCTATGGAGGCTCACTCCACACTGGCCATCGCCATGAACAAGATCGGAGGAAAGTCCAACACCGGGGAGGGGGGAGAAACCGCTGAGAGATATCTCAACCAGGATCCAGACCACAACATGCGCTCGGCGATCAAGCAGGTGGCGTCCGGGAGGTTCGGAGTGACGGCGTCATACCTCGCACATGCTGACGACCTGCAGATTAAGATGGCGCAGGGAGCCAAGCCGGGGGAGGGCGGGGAACTACCCGGGTACAAAGTAACAGAGGAGATAGCTCGTACCCGTTGTTCAGTTCCGGGGGTAGGGCTGATCTCCCCGCCTCCTCACCATGATATATACTCCATCGAGGACCTCGCCGAGCTGATCTACGATCTCAAATGTGCCAACCCTAAAGCACGCATCAGTGTGAAACTGGTCTCGGAGGTTGGAGTGGGAGTAGTGGCTTCCGGTGTCGCTAAGGGTAAAGCGGAACACATCGTCATATCCGGCCACGACGGAGGCACTGGGGCCAGTTCCTGGACGGGCATCAAGAGCGCGGGGCTGCCCTGGGAGTTGGGCGTGGCGGAAACACACCAGGTGCTCGTATTGAACGACCTGAGGTCGCGTGTGGTGGTCCAAGCTGACGGTCAGATCCGCACCGGCTTCGACGTGATAGTGGCGGCGCTGCTGGGAGCTGACGAGGTCGGCTTCAGCACCGCGCCATTGATAGCGTTAGGCTGCACTATGATGAGGAAGTGTCACCTGAACACCTGTCCGGTTGGCATCGCGACTCAGGACCCCGTGTTGAGGAAGAAGTTCGCGGGGAAGCCGGAACACGTCATTAACTACCTGTTCATGTTGGCTGAGGAGGTCCGTACGCACATGTCACGCGTGGGTGTCCGGAGCTTCCAAGAGCTGGTCGGTCGGACAGACCTCCTGAAGGTGAGGGAGAAGAACGACAACTACAAAGCGCGGCTCCTCAACCTCGCTCCCATACTGAAGAACGCGTTACACATGAGGCCGGGCGTCGACATACGAGGCGGCTCTAAACCACAGGACTTCCAGCTGGAGAAACGCCTGGACAACCAGCTGATCCAGCAGTGCTCTGGAATACTGGACGGAACCCAACAACACGTACACATCGACATGAAGATCACTAACGAAGACCGAGCTTTCACTTCGACACTCTCCTATCATATTGCCATGCAGTATGGAGATTCCGGTCTCCCTGATGGCACCACGGTGGACATCAGCCTCACCGGCTCAGCCGGACAAAGCTTCTGTGCCTTCCTCAGCAAAGGAATCACCGTCACCTTGGAAGGAGACGCCAATGATTACGTCGGCAAGGGCCTCTCCGGCGGCACGGTCATCATCTACCCTCCAAAGAACTCTCCATTCCAATCTCACTTGAATGTGATCGTTGGGAACGTTTGTTTGTATGGAGCTACGAGTGGAAGGGCTTATTTCAGGGGTATAGCGTCTGAGCGTTTCTGTGTCCGTAACTCTGGTTGTGTGGCGGTGTCGGAGGGCGCGGGCGACCACGGCTGCGAGTACATGACGGCTGGCAGGGTGCTCATACTAGGACTGGTGGGGAGGAACTTCGCCGCGGGGATGAGCGGTGGTATAGCGTACGTGTACGACATCGATGGGTCATTCAAGAGCAAATGTAATCCGGAGATGGTGGAACTGCTGCCACTGGAAATACAAGAAGACTTGGACGAGGTGCAGAAACTTCTAGAAGAATTCGTGGAGTATACCGGATCATTAATCGCTAAAGAACTCCTAGAGACCTGGCCGGAACCAGCTAAGAAGTTCACGAAGGTGTTCCCTTACGAATACCAGCGCGCCTTGAAACAGATCGCTCTCAAGCAGACGGCGCCCAAGGTGGAAACTAACGGAAAGCTTGAAGAAAACGGAGTCGTCGATATAGAAGAAGCTGTCAGAAACGTGGAACAGGACAAGAAGAACCTGGAGAAGGTTCTAGATAAGACCAGAGGATTTATAAAGTATCCCCGCGAGACGTCAGTGTACCGGCCGGCTGAGAAGCGTCTCCGTGACTGGGAGGAGATCTACGACCAGTCGTCCGTGAGGCGCGGCCTGAGAGTGCAGGCTGCTCGCTGCATGGAGTGCGGGGTGCCGTTCTGTCAGAGCGGCCACGGCTGCCCTCTAGGGAACATCATACCCAAGTGGAACGATCTCGTGTACAGGGCCGACTGGAAACAAGCTCTGGCACAACTCTTGCAGACTAATAATTTCCCAGAGTTCACGGGTCGCGTGTGTCCCGCGCCGTGCGAGGGCGCCTGTGTGCTCGGCATCTCCGAACCTCCCGTCACCATCAAGAACATCGAGTGCGCCATCATTGACCACGCCTTCAGCAGCGGATGGGTTCAACCGGAGATCCCGGAGTATCGTAACGGTAAGACAGTCGCCATCGTGGGGTCGGGGCCGGCCGGCCTGGCCTGCGCTCATCAGCTAAACAAGGCCGGTTACTCTGTGACGGTGTTCGAGCGCAACGACCGTCCGGGCGGCCTGCTCCAGTACGGCATCCCCAGCATGAAGCTCAGCAAGCACGTGGTGCAGCGGAGGATCAAGCTCATGATGGACGAGGGGGTCGTGTTCAAGTGCAACGTGGACGTCGGCAAGGACATCTCCGCCGCGGACCTCGCCAACGAGTACGACGCGCTAGTCCTGTGTATGGGCGCGACGTGGCCGCGGGATCTCCCTCTGAGCGGCCGGCAGCTGGGCGGCATACACTTCGCCATGGAGTTCCTCGAGGGCTGGCAGAAGAAACAGGCGGGCGGCGGCACCGGCAAACTACCCGCGCTCAGCGCCAAGGACAAGAACGTACTCGTTATAGGGGGAGGGGACACCGGATGCGACTGTATAGCGACGTCTCTACGCCAGGGCGCCAAATCTATAACGACCTTCGAGATACTTCCCGAGCCCAAACCCACGCGGACCAAGGAGAACCCGTGGCCGCAGTGGCCGAGGGTCTTCCGAGTGGACTACGGCCATGAGGAGGTGAAAGTGAAATTCGGTCACGACCCTAGAAAATTCTCGACTCTCACCAAGGAGTTCCTCGACGATGGCGAGGGCAACGTGTCGGGGGTGAGTGCGGTGGAAGTGGAGTGGACGCGCGGTCCGGGCGGGAGGTGGGAGATGGCCGAGAAGGACGGCTCCAAGCGAGTCGTTCCGTGCGACCTGGTCCTCCTCGCCATGGGCTTCCTGGGACCTGAGAGATACGTCGCCTCGCAACTCGGGTGTATTGCAAAAATAAACAAAAAGCGCTTGCTTGGGAAATTTAATGAAAGAGCTATAATAAGATCAAGCAATACGAGAGCCGATAGTGCAGATCTACGTAAAGCAAATCCATACGTGGTAGCAACTTTCTTTGCGTGTGACCCTCACTCGCTCTTCGCAGTTTCGGAATGCAAAAACTAA

Protein sequence:

>DPOGS210060-PA
MEWEAPPKQGLYDPQNEHEACGVGFVVAIDGKRSHKIVRDAEVLAKRMEHRGACACDNDTGDGAGVLTAIPHQFYCAQLRDSHQIDLPPFGKYATGIFFLDKLHHQDIEKKFQELAESLHLRVICWRTVPTNNATIGQVARNSEPYMRQVFVTGDIGDEPQLARQRPRIEGTLEAFFSDRRRSPAAAPPPLVHIFVLRKRASHELVVPGARFYICSLSLRTVVYKGLLTSNQLWEYFKDLSNPAFTTYLALVHTRFSTNTFPSWERAHPLRVLAHNGEINTLRGNVNLMKAREGVMKSDIFGDELKKLYPVVEPNLSDSGSADCVLEFLLHCGHRSLPEAVMTMVPEAWHNDVTMPAEKRDYYQWAACAMEPWDGPALVSFTDGRYIGAILDRNGLRPSRFYVTSENILVMASEVGVYDVDPEKVILKSRLKPGRMLLVDTEEKRIIQDVELKMDIARSRPHSQWLKEQITMEDIYKSVSQSDLSSNGCVNGAISGLGDKRLGLFGYTIESINMLLLPMIQNKKEALGSMGNDAPLACLSRFEPLPYDYFKQLFAQVTNPPIDPFREKIVMSLMCPIGPVANILRPGAEFVHRLFLPQPVLSIPDLKALIATTHRGWRTKVIDCTFDISDGPMGLEPALTRLAGEAHDAAEDGYQLLVLSDRQAGPTRVPISSLLALGAVHHHLIETRQRMKVGVLVETAEAREVHHMCVLLGYGADAICPYLAFELAFSLRNDNLIDPNLTDSDIYLAYQKAIETGLAKVMAKMGISMLQSYKSAQIFEAVGLSEEVIDKCFRGTQSRIGGITFEILSQETFDRHALTYGNCNDMLVLRNPGNYHWRAGGEKHINDPLSIANLQEAAVNNTASAYDRFRESALESIRACTLRGQLELVTLDEPLPLSEIEPASEIVKRFATGAMSFGSISMEAHSTLAIAMNKIGGKSNTGEGGETAERYLNQDPDHNMRSAIKQVASGRFGVTASYLAHADDLQIKMAQGAKPGEGGELPGYKVTEEIARTRCSVPGVGLISPPPHHDIYSIEDLAELIYDLKCANPKARISVKLVSEVGVGVVASGVAKGKAEHIVISGHDGGTGASSWTGIKSAGLPWELGVAETHQVLVLNDLRSRVVVQADGQIRTGFDVIVAALLGADEVGFSTAPLIALGCTMMRKCHLNTCPVGIATQDPVLRKKFAGKPEHVINYLFMLAEEVRTHMSRVGVRSFQELVGRTDLLKVREKNDNYKARLLNLAPILKNALHMRPGVDIRGGSKPQDFQLEKRLDNQLIQQCSGILDGTQQHVHIDMKITNEDRAFTSTLSYHIAMQYGDSGLPDGTTVDISLTGSAGQSFCAFLSKGITVTLEGDANDYVGKGLSGGTVIIYPPKNSPFQSHLNVIVGNVCLYGATSGRAYFRGIASERFCVRNSGCVAVSEGAGDHGCEYMTAGRVLILGLVGRNFAAGMSGGIAYVYDIDGSFKSKCNPEMVELLPLEIQEDLDEVQKLLEEFVEYTGSLIAKELLETWPEPAKKFTKVFPYEYQRALKQIALKQTAPKVETNGKLEENGVVDIEEAVRNVEQDKKNLEKVLDKTRGFIKYPRETSVYRPAEKRLRDWEEIYDQSSVRRGLRVQAARCMECGVPFCQSGHGCPLGNIIPKWNDLVYRADWKQALAQLLQTNNFPEFTGRVCPAPCEGACVLGISEPPVTIKNIECAIIDHAFSSGWVQPEIPEYRNGKTVAIVGSGPAGLACAHQLNKAGYSVTVFERNDRPGGLLQYGIPSMKLSKHVVQRRIKLMMDEGVVFKCNVDVGKDISAADLANEYDALVLCMGATWPRDLPLSGRQLGGIHFAMEFLEGWQKKQAGGGTGKLPALSAKDKNVLVIGGGDTGCDCIATSLRQGAKSITTFEILPEPKPTRTKENPWPQWPRVFRVDYGHEEVKVKFGHDPRKFSTLTKEFLDDGEGNVSGVSAVEVEWTRGPGGRWEMAEKDGSKRVVPCDLVLLAMGFLGPERYVASQLGCIAKINKKRLLGKFNERAIIRSSNTRADSADLRKANPYVVATFFACDPHSLFAVSECKN-