Monarch geneset OGS2.0

DPOGS200911
TranscriptDPOGS200911-TA1371 bp
ProteinDPOGS200911-PA456 aa
Genomic positionDPSCF300066 + 381027-385854
RNAseq coverage5093x (Rank: top 2%)
Annotation
HeliconiusHMEL0122233e-10081.65% 
BombyxBGIBMGA000680-TA0.081.53% 
DrosophilaGs2-PB2e-13064.63% 
EBI UniRef50UniRef50_P204779e-12367.31%Glutamine synthetase 1, mitochondrial n=42 Tax=Eukaryota RepID=GLNA1_DROME
NCBI RefSeqXP_967731.18e-14174.11%PREDICTED: similar to AGAP002355-PA [Tribolium castaneum]
NCBI nr blastpgi|3800169952e-14460.19%PREDICTED: glutamine synthetase 2 cytoplasmic-like isoform 2 [Apis florea]
NCBI nr blastxgi|3800169952e-14360.19%PREDICTED: glutamine synthetase 2 cytoplasmic-like isoform 2 [Apis florea]
Group
Gene OntologyGO:00068073.1e-63nitrogen compound metabolic process
GO:00043563.1e-63glutamate-ammonia ligase activity
GO:00038241.3e-60catalytic activity
GO:00065421.4e-14glutamine biosynthetic process
KEGG pathwaytca:6560872e-140 
 K01915 (E6.3.1.2, glnA)maps-> Nitrogen metabolism
    Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
    Two-component system
InterPro domain[205-451] IPR0081463.1e-63Glutamine synthetase, catalytic domain
[211-452] IPR0147461.3e-60Glutamine synthetase/guanido kinase, catalytic domain
[138-197] IPR0081471.4e-14Glutamine synthetase, beta-Grasp
Orthology groupMCL13090 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200911-TA
ATGAATGATGAACGTTTGGCGAATGTAATAATAGAAGGCATCACAGATCACAGGAAAGATAATCCTAAAATCCTTTCGGGGCCGGTGTTGACAAACTCTCCGAACGCAGTATTATCCAAGACCCTATTGGGACGCTACAATGATCTTCCTTTACCGGCGGACAAGATCATAGCGACTTACGTATGGATCGACGGCACAGGAGAACATCTAAGATGCAAGGACCGCACTCTTTCATTCATTCCGAAAGCACCAAAAGACAAATTGACGGAAGTTATAGAAGATCTGAATAATAAATACCGTTTGGAGTTCTATGACACATTCGAGAGAACTCACGAACATGATTACTTGCCGCCATATTTTTTTCCGAAGGATCTAGACGGAATATTTGACATACAATGCAAGTGGAGGAAACGAGTGAATAGGGAGAGCTTTGATGAGAATTTAAAAGATCTGCCGATTTGGAATTACGACGGTAGCTCTACCGGCCAAGCCGATGGTCACAACTCCGATACTTTCCTGGTGGCTCGTGCCATTTACAAAGACCCCTTCAGACGTGGCAATCATATCTTGGTTATGTGCGACACGTACAAATATAACATGGAGCCGACAGAAAGCAACCACCGGTTGAAATGCCAGGAAGCTTACGAGAAGTGCAAAGATGACGAGCCCTGGTTCGGTATCGAGCAAGAGTATATTCTCCTGGACTCGGATCTGAGGCCCTTCGGCTGGCCACCCGGCGGCTTCCCCCCACCACAGGGCCCCTACTACTGTGGAGTTGGCGCGAACAAAGTTTTCGCCAGAGATCTCGTTGAAGCTCATTACAAGTGCTGTTTGTATGCCGGCGTTCCTATTGCCGGTACTAATGCTGAGGTCATGCCCTCGCAATGGGAGTTCCAAGTGGGTCCCTCAGTTGGCGTGTCTGCTGGTGATGATCTGTGGGTAGCGAGATACATCCTTCATAGGCTGGCCGAGGAGTATGGAGTGATCGTGACTTTCGACCCTAAGCCCGTGCAAGAGTGGAACGGCTCCGGTGCTCATACCAACTTCTCCACCAAGAAAATGAGGGAGGATAATGGAATTATTGAGATAGAGAAGGCCATCGACAAGCTGTCCAAGGTCCACATGAAGCACATCAAGGTTTACGACCCGCGCGGCGGCAAAGACAACGAGCGTCGTCTGACCGGCCTTCATGAAACCGCCAGTATTAACGATTTTAGTGCTGGTGTCGCGAGCCGCACAAGCAGTATCAGAATACCGCGTGCCGTAGCTGAGGAAAAGAAGGGCTACCTCGAAGATCGCCGACCGGCGTCCAACTGCGATCCTTACGCGGTCATCGACGCTCTTATGCGCACCTGTGTACTGAACGAATAA

Protein sequence:

>DPOGS200911-PA
MNDERLANVIIEGITDHRKDNPKILSGPVLTNSPNAVLSKTLLGRYNDLPLPADKIIATYVWIDGTGEHLRCKDRTLSFIPKAPKDKLTEVIEDLNNKYRLEFYDTFERTHEHDYLPPYFFPKDLDGIFDIQCKWRKRVNRESFDENLKDLPIWNYDGSSTGQADGHNSDTFLVARAIYKDPFRRGNHILVMCDTYKYNMEPTESNHRLKCQEAYEKCKDDEPWFGIEQEYILLDSDLRPFGWPPGGFPPPQGPYYCGVGANKVFARDLVEAHYKCCLYAGVPIAGTNAEVMPSQWEFQVGPSVGVSAGDDLWVARYILHRLAEEYGVIVTFDPKPVQEWNGSGAHTNFSTKKMREDNGIIEIEKAIDKLSKVHMKHIKVYDPRGGKDNERRLTGLHETASINDFSAGVASRTSSIRIPRAVAEEKKGYLEDRRPASNCDPYAVIDALMRTCVLNE-