Monarch geneset OGS2.0

DPOGS207944
TranscriptDPOGS207944-TA816 bp
ProteinDPOGS207944-PA271 aa
Genomic positionDPSCF300090 - 256496-262624
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0142086e-11774.63% 
BombyxBGIBMGA004280-TA2e-8954.37% 
DrosophilaGs2-PB2e-8351.89% 
EBI UniRef50UniRef50_P204774e-7550.18%Glutamine synthetase 1, mitochondrial n=42 Tax=Eukaryota RepID=GLNA1_DROME
NCBI RefSeqXP_312603.43e-8352.65%AGAP002355-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123817412e-8253.03%hypothetical protein AND_05890 [Anopheles darlingi]
NCBI nr blastxgi|3479676905e-8352.65%AGAP002355-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00038246e-31catalytic activity
GO:00068071.9e-28nitrogen compound metabolic process
GO:00043561.9e-28glutamate-ammonia ligase activity
GO:00065429.2e-14glutamine biosynthetic process
KEGG pathwayaga:AgaP_AGAP0023558e-83 
 K01915 (E6.3.1.2, glnA)maps-> Nitrogen metabolism
    Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
    Two-component system
InterPro domain[116-259] IPR0147466e-31Glutamine synthetase/guanido kinase, catalytic domain
[125-262] IPR0081461.9e-28Glutamine synthetase, catalytic domain
[27-103] IPR0081479.2e-14Glutamine synthetase, beta-Grasp
Orthology groupMCL34724 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207944-TA
ATGAATACAGTTCCCCTACCTCTAAATAAAGCTGCTATGAGAAAGTATGAGGACCTAGAAGTGCCTTGTGAATCTGTTTTAGCGACGTACGTGTGGGTTGACGGCACAGGCATAAATTTGCGTTCCAAGGATAGAACATTCGACTTTGTGCCAAAAATCAATAAAGATCTGCCAATATGGTATTTCGACGGCAGCAACACAGCCCAGGCGGCAACAGACAATTCTGACACGTTTATCTTCCCCCAAGTGATTTATCAAGATCCTTTCCGACGAGGAAGCCACATTATGGTCCTAGCTGACACCTATCAACACAATTACCAGCCAACTGCTTCAAATTACCGCAAAGAATGTACACTGACTTGCGAGAAAGGTGAAGTCGAAGAGCCTTGGTTTGGTTTCAATCAAGAATTTTTCCTGACTACTCCTGACGGTAGACCTTTGGGTTGGCCTCCGGGAGGTTTTCCTGCGCCTCCTGGTCCTTATTACTGCGCGATAGGTGCTAACAAAATTGTTGCTCGAGATCTCATGGAAGCATTTTACAGATGTTGTCTTTATGCCGGAGTGCATATCAACGGTATTAACCCGGGGACCACGCCTTCCCAATGGAATTTTCAGGTTGGACCATCTCCTGGCATTACGGCAGCCGATGATTTGTGGATGGCTCGCTACATACTCAGTAGGTTAGCTGAAGAATACGGGACTGTTGCTACATTTGAACCACAACCAGTACCCGATTGGCCCGGCAACGGAACGTTCGCTTACTTTTCTTCCAAGGACATGAGGGAGGAAGACGGAATATTGCAAGTTGAATTTTGA

Protein sequence:

>DPOGS207944-PA
MNTVPLPLNKAAMRKYEDLEVPCESVLATYVWVDGTGINLRSKDRTFDFVPKINKDLPIWYFDGSNTAQAATDNSDTFIFPQVIYQDPFRRGSHIMVLADTYQHNYQPTASNYRKECTLTCEKGEVEEPWFGFNQEFFLTTPDGRPLGWPPGGFPAPPGPYYCAIGANKIVARDLMEAFYRCCLYAGVHINGINPGTTPSQWNFQVGPSPGITAADDLWMARYILSRLAEEYGTVATFEPQPVPDWPGNGTFAYFSSKDMREEDGILQVEF-