Monarch geneset OGS2.0

DPOGS202178
TranscriptDPOGS202178-TA1230 bp
ProteinDPOGS202178-PA409 aa
Genomic positionDPSCF300162 + 277921-285342
RNAseq coverage741x (Rank: top 17%)
Annotation
HeliconiusHMEL0108910.085.57% 
BombyxBGIBMGA003319-TA0.083.73% 
DrosophilaGot1-PB6e-14960.60% 
EBI UniRef50UniRef50_E2BMX01e-16366.00%Aspartate aminotransferase n=3 Tax=Coelomata RepID=E2BMX0_HARSA
NCBI RefSeqXP_001601449.12e-16364.69%PREDICTED: similar to ENSANGP00000011707 [Nasonia vitripennis]
NCBI nr blastpgi|3071808003e-16562.35%Aspartate aminotransferase, cytoplasmic [Camponotus floridanus]
NCBI nr blastxgi|3072040532e-15966.00%Probable aspartate aminotransferase, cytoplasmic [Harpegnathos saltator]
Group
Gene OntologyGO:00065203.2e-255cellular amino acid metabolic process
GO:00084833.2e-255transaminase activity
GO:00038241.2e-109catalytic activity
GO:00301701.2e-109pyridoxal phosphate binding
GO:00167693.6e-84transferase activity, transferring nitrogenous groups
GO:00090583.6e-84biosynthetic process
KEGG pathwaynvi:1001206556e-163 
 K00813 (E2.6.1.1B, aspC)maps-> Novobiocin biosynthesis
    Arginine and proline metabolism
    Alanine, aspartate and glutamate metabolism
    Isoquinoline alkaloid biosynthesis
    Phenylalanine metabolism
    Cysteine and methionine metabolism
    Carbon fixation in photosynthetic organisms
    Tyrosine metabolism
    Phenylalanine, tyrosine and tryptophan biosynthesis
    Tropane, piperidine and pyridine alkaloid biosynthesis
InterPro domain[3-406] IPR0007963.2e-255Aspartate/other aminotransferase
[1-405] IPR0154242.1e-119Pyridoxal phosphate-dependent transferase, major domain
[48-321] IPR0154211.2e-109Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[29-398] IPR0048393.6e-84Aminotransferase, class I/classII
Orthology groupMCL13103 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202178-TA
ATGGGCTCCCGATTTCAAGTTGTAGAACAAGGTCCTCCTATCGAAGTTTTTCAATTAAATAAGGCTTTCACGGAGGATTCTTATAAAAATAAAGTCAATCTTAGCGTTGGAGCCTACAGAGATGAGAATGGCAAACCCTGGGTTCTGCCAGTTGTGCGGAAAATGGAGAAACAAATGGCAGAAGATGAATCTTTATTACATGAATACCTGCCCGTCCTTGGACTCGACGCTTTCACCGAAGCTTCAGTCTCTATGCTGCTTGGAAAGGACAACCCTGCGATAGCTGAGGGTCGTGCATTCGGTGTGCAAACACTGTCTGGAACAGGCGGTCTCCGGGTCGGAGCTGAACTCCTGAACAAGCATCTGAAGTACGACACGTTCTACTACTCGAATCCAACATGGGAAAACCATCACTTGGTGTTCGTGAACTCTGGGTTCACAAATCCAAGGACGTATCGCTACTGGGACGAGAAGACTCTCTCTATAGACTTCGACGGTCTCATAGAGGATCTCAAGAACGCTCCAGAGAATTCCGTCATACTGTTACACGCGTGCGCACACAACCCCACAGGCATAGACCCCTGCCATGAACAATGGGAGAAGATCGCTGATGTCATGGAGGAGAGGAAGTTGTTCCCGTTCTTCGACAGCGCGTACCAAGGCTTCGCGTCCGGAGACCTGGACCGAGACGCCTGGGCCGTGCGCTACTTCGTCAAGAGAGGCTTCGAGCTGGTCTGCGCACAGTCCTACGCCAAAAACTTCGGATTATACAACGAGCGTGTCGGTAACCTGACCGTAGTGCTGTCTGAGTCTAGTCACGTGGCTCCGCTCAAGTCACAGCTGACGTGGATCGTGCGAGGGATGTACTCCAACCCTCCAGCGCACGGGGCGAGGGTTGTCGCTCAAGTGTTGCGGAACGATGTGCTCTTTGATCTGTGGAGAGACCACATCAAGTTCATGTCCTCGAGGGTTATGCAGATGAGAGAAGCTCTGAGAGCGGAACTAATTAAGTTGGGCACCCCCGGCAACTGGGACCACATCGTCAAACAGATCGGTCTGTTCTCCTACACGGGTCTGAGCCGGCGTCAGTCGGAGCACCTGATCCAGGAACACCACATCTACCTTCTCAGAACCGGCCGCATCAACATCTGCGGCCTCAACCCCGGCAACGTGCAGTACGTGGCGCGCGCCATCAACGACGCCATCACCAAGTTCCCCACGGACCAGTAA

Protein sequence:

>DPOGS202178-PA
MGSRFQVVEQGPPIEVFQLNKAFTEDSYKNKVNLSVGAYRDENGKPWVLPVVRKMEKQMAEDESLLHEYLPVLGLDAFTEASVSMLLGKDNPAIAEGRAFGVQTLSGTGGLRVGAELLNKHLKYDTFYYSNPTWENHHLVFVNSGFTNPRTYRYWDEKTLSIDFDGLIEDLKNAPENSVILLHACAHNPTGIDPCHEQWEKIADVMEERKLFPFFDSAYQGFASGDLDRDAWAVRYFVKRGFELVCAQSYAKNFGLYNERVGNLTVVLSESSHVAPLKSQLTWIVRGMYSNPPAHGARVVAQVLRNDVLFDLWRDHIKFMSSRVMQMREALRAELIKLGTPGNWDHIVKQIGLFSYTGLSRRQSEHLIQEHHIYLLRTGRINICGLNPGNVQYVARAINDAITKFPTDQ-