Monarch geneset OGS2.0

DPOGS211865
TranscriptDPOGS211865-TA1275 bp
ProteinDPOGS211865-PA424 aa
Genomic positionDPSCF300011 - 1063814-1071312
RNAseq coverage630x (Rank: top 20%)
Annotation
HeliconiusHMEL0048103e-15974.46% 
BombyxBGIBMGA001237-TA0.080.28% 
DrosophilaCG1673-PA2e-14255.89% 
EBI UniRef50UniRef50_B2DBI40.077.14%Branched-chain-amino-acid aminotransferase n=1 Tax=Papilio xuthus RepID=B2DBI4_9NEOP
NCBI RefSeqXP_002071154.11e-14558.99%GK25289 [Drosophila willistoni]
NCBI nr blastpgi|1839793820.077.14%branched-chain-amino-acid transaminase [Papilio xuthus]
NCBI nr blastxgi|1839793820.077.14%branched-chain-amino-acid transaminase [Papilio xuthus]
Group
Gene OntologyGO:00040841.2e-176branched-chain-amino-acid transaminase activity
GO:00081521.2e-176metabolic process
GO:00090811.2e-176branched chain family amino acid metabolic process
GO:00038241.2e-176catalytic activity
KEGG pathwaydwi:Dwil_GK252894e-145 
 K00826 (E2.6.1.42, ilvE)maps-> Pantothenate and CoA biosynthesis
    Valine, leucine and isoleucine biosynthesis
    Valine, leucine and isoleucine degradation
InterPro domain[17-424] IPR0015441.2e-176Aminotransferase, class IV
[17-424] IPR0057861.2e-176Branched-chain amino acid aminotransferase II
Orthology groupMCL12316 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211865-TA
ATGCCTTTTCGTCGAAGCAAGGGTCTAGTGAAATGGATCTTCGAGAACCAGCACAAGATGCAGTCCGTCCGCCGCTGCAGCTCGTTGGCTCGCTACAAGGAGATGGAGGACAGCGCCGCCTCCGAACATGACGTCAGCGGACCCAAGACCCGTCCGGAGATCACGCCAGAAATATCCTTCAAGCACGACGATCTTCAAGTGAGGCTCGCGGCGCCGTACCAGTTGCAGACGAAGCCGGACGCGGTCGAGTTAGGCTTCGGGAAGTACTTCACGGATCACATGCTCAAGATCCACTACAGCAAGCACCTCGGGGGCTGGCAGAAGCCGGAGATCACGCCCTTCGAGAACCTCAGCCTCCATCCCGCCGCCAAGGCTCTACATTATGCAATACAATTATTCGAAGGTATGAAGGCGTATAGGGGTGTAGACGACAAGATACGATTGTTTCGACCGGAACTCAATATGGAGAGGATGAACCTGGCGGCTCAGAGGTCCGGCTTGCCCATGTTCGACGGCCAGGAACTGATCCGCTGCATCACGAGGCTCATACAGATAGACCAGGAGTGGGTCCCTCACTCGGAGACCTCCACGCTGTACGTCCGCCCCACGCTCATAGGCACGGAGCCCACGTTCGGCATCATGGAGCCGGAGAGCGCGCTGCTGTTCGTGATCTTGAGTCCCGTGTCCGCCTACTATCAGACCCGCGGGGACGGCGCCGTGTCCATCTTCGCAGACCCCGCCGTGGTGCGCGCCTTCCCCGGCGGAGTCGGCAACCGGAAGGTCGGCTCCAACTACGGACCCACGATCGAGGCGACCGCGCGCGCCGCCAAGCTGGGCCACCAGCAGGTGCTGTGGCTGTTCGGCCCCGACAGAGAGCTGACCGAGGTCGGCGCCATGAACATCTTCATGGTGTACATCAACGACCAGGGAGAGAGACAGCTGAGCACGCCGCCCCTCAACGGCCTCATCCTGCCGGGAGTGACGCGTCGCTCCATCCTGGAGCTGGCGTCGCAGTGGGAGGACCTGGTGGTCAAGGAGGAGGTCATCACCATGGACCGCCTCGAGGACCTCAACGATCGCGGCCGCCTGCTGGAGCTGTTCGGCGCGGGCACGGCCGTGGTGGTGACGCCCATCGCCAACGTGGGCTACCTGCACCGCAACATCCGCGTGCCCACCACCAGCCAGCCGCGGCCCGTGTACCGCCGCCTCAGGGACACGCTGCTCGCCATCCAGTACGGCCACGTCGACCACCCCTACGCCAAGGTCATCGCGTAG

Protein sequence:

>DPOGS211865-PA
MPFRRSKGLVKWIFENQHKMQSVRRCSSLARYKEMEDSAASEHDVSGPKTRPEITPEISFKHDDLQVRLAAPYQLQTKPDAVELGFGKYFTDHMLKIHYSKHLGGWQKPEITPFENLSLHPAAKALHYAIQLFEGMKAYRGVDDKIRLFRPELNMERMNLAAQRSGLPMFDGQELIRCITRLIQIDQEWVPHSETSTLYVRPTLIGTEPTFGIMEPESALLFVILSPVSAYYQTRGDGAVSIFADPAVVRAFPGGVGNRKVGSNYGPTIEATARAAKLGHQQVLWLFGPDRELTEVGAMNIFMVYINDQGERQLSTPPLNGLILPGVTRRSILELASQWEDLVVKEEVITMDRLEDLNDRGRLLELFGAGTAVVVTPIANVGYLHRNIRVPTTSQPRPVYRRLRDTLLAIQYGHVDHPYAKVIA-