Monarch geneset OGS2.0

DPOGS209638
TranscriptDPOGS209638-TA1086 bp
ProteinDPOGS209638-PA361 aa
Genomic positionDPSCF300015 + 1036728-1037813
RNAseq coverage742x (Rank: top 17%)
Annotation
HeliconiusHMEL0170410.089.53% 
BombyxBGIBMGA006703-TA3e-16478.92% 
DrosophilaGs1-PC9e-11053.85% 
EBI UniRef50UniRef50_P204771e-10753.85%Glutamine synthetase 1, mitochondrial n=42 Tax=Eukaryota RepID=GLNA1_DROME
NCBI RefSeqXP_002059120.18e-11053.85%GJ16218 [Drosophila virilis]
NCBI nr blastpgi|1954010362e-10853.85%GJ16218 [Drosophila virilis]
NCBI nr blastxgi|1954010361e-10653.85%GJ16218 [Drosophila virilis]
Group
Gene OntologyGO:00068079e-49nitrogen compound metabolic process
GO:00043569e-49glutamate-ammonia ligase activity
GO:00038243.3e-44catalytic activity
GO:00065421.5e-17glutamine biosynthetic process
KEGG pathwaydvi:Dvir_GJ162182e-109 
 K01915 (E6.3.1.2, glnA)maps-> Nitrogen metabolism
    Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
    Two-component system
InterPro domain[114-347] IPR0081469e-49Glutamine synthetase, catalytic domain
[102-348] IPR0147463.3e-44Glutamine synthetase/guanido kinase, catalytic domain
[14-91] IPR0081471.5e-17Glutamine synthetase, beta-Grasp
Orthology groupMCL17669 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209638-TA
ATGGACAGATATATACGACTCCCCGTGCCTTGTAATAAGGTTTTAGCAACGTATTGCTGGATAGACGGTTCAGGTATTAATCTAAGATGTAAAGATAGAATTTTAAATTGCACTCCTTATAGCGCTGACGTGGCGCCAGGATGGGCATTTGATGGAAGTTCCACCGGTCAAGCCACCACAGCTAATTCTGACACTTCGTTAAAACCATGTGCGGTTTATCGAGATCCGTTCAGAATGGAACCTCATGTCCTTGTCTTGTGTGAAGTATATATGGGAGATGGTTCGCCGGCTTCCACTAACCATAGAAAATTTTGTAATGATTTATGCGAATTTCATAGAGCGGAAGAACCATGGTTTGGATTGGAACAAGAATATACTATGCTAGATGTCGACGGCTGGGGCTTGGGCTGGCCCAAAGGTGGTGGATTTCCTGCTGTTAATTACGAATTCTCTTATTGTGGAATCGGAGCTAAGTATATTGCAGGTCGGGATATTTGTGAAGCTCATACTAAAAGCTGTCTTTATGCGGGTTGTGATTTCGAAGGCACAAATGCGGAAGTAATGTTCGCTTGTTGGGAATGGCAAATTGGAACTACTATAGGAATAAAAGCCGCAGATGATATGTGGATGTCGCGATATATCATGGGTAGGATAGCTGAAGACTATGGTGTTGTCATTACTTATCATCCAAAACCCATGGGACCCAAGCATCCTGGCGTGGGCATGCATCACAATTTTAGTACTAAAAGAATGCGTTCAGATGGGGGTTATAAGTTTATTGAAGAATGTATCAAACGACTTGAACAAAATCACATGAAACACATGAAGAGCTACGGAAATGATGAAATGACTAACCGAATGCGTTTGTCTGGAAAGTTCGAGACTGCGCCCTTTGACAAGTTCTCATGGGGAATAGCAAATAGAAAGAGTTCGATTCGTTTGCAAAGAAATATAAAAGAAAAGGGTAAAGGCTTTATGGAGGACAGGAGACCAGCTGGGGACTGTGACCCTTACCTAGTTTGTGGCCTATTGATGGATACCTGTTTGGGATCTGCTGGTGGTGGCAAGGGTGGAAAATGCAAATAA

Protein sequence:

>DPOGS209638-PA
MDRYIRLPVPCNKVLATYCWIDGSGINLRCKDRILNCTPYSADVAPGWAFDGSSTGQATTANSDTSLKPCAVYRDPFRMEPHVLVLCEVYMGDGSPASTNHRKFCNDLCEFHRAEEPWFGLEQEYTMLDVDGWGLGWPKGGGFPAVNYEFSYCGIGAKYIAGRDICEAHTKSCLYAGCDFEGTNAEVMFACWEWQIGTTIGIKAADDMWMSRYIMGRIAEDYGVVITYHPKPMGPKHPGVGMHHNFSTKRMRSDGGYKFIEECIKRLEQNHMKHMKSYGNDEMTNRMRLSGKFETAPFDKFSWGIANRKSSIRLQRNIKEKGKGFMEDRRPAGDCDPYLVCGLLMDTCLGSAGGGKGGKCK-