Monarch geneset OGS2.0

DPOGS210383
TranscriptDPOGS210383-TA1449 bp
ProteinDPOGS210383-PA482 aa
Genomic positionDPSCF300025 + 923222-926329
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0051310.072.77% 
BombyxBGIBMGA011600-TA0.069.23% 
DrosophilaCG11241-PB1e-16057.36% 
EBI UniRef50UniRef50_A8E6R22e-15857.36%CG11241, isoform B n=9 Tax=melanogaster subgroup RepID=A8E6R2_DROME
NCBI RefSeqXP_001958577.15e-16359.57%GF23448 [Drosophila ananassae]
NCBI nr blastpgi|1947525359e-16259.57%GF23448 [Drosophila ananassae]
NCBI nr blastxgi|1947525352e-15859.57%GF23448 [Drosophila ananassae]
Group
Gene OntologyGO:00084835.5e-239transaminase activity
GO:00301705.5e-239pyridoxal phosphate binding
GO:00038249.2e-77catalytic activity
KEGG pathwaydan:Dana_GF234481e-162 
 K00827 (AGXT2)maps-> Alanine, aspartate and glutamate metabolism
    Glycine, serine and threonine metabolism
InterPro domain[21-481] IPR0058145.5e-239Aminotransferase class-III
[34-478] IPR0154243.6e-119Pyridoxal phosphate-dependent transferase, major domain
[94-371] IPR0154219.2e-77Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[372-476] IPR0154221.8e-40Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL14531 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210383-TA
ATGGCGAATATATTTGTTAGATATTGCCATAACTCAATCTTCAGGCGAAACGCAGTTACGCTACCCAAATATGACTTTACACCAAAACCATATACGGGTCCATCACTACAAAGAACTGATGAAATAAAAGCCAATCACATCCCTCCCGCAATGTACAATATATATAGGAAGCCATTGGTGCTGCATCAGGGTCGAATGCAGTGGTTGTTTGACCACGAAGGTCGGAGATACTTGGATCTTTTTGGAGGGATTGTCACAGTTTCCGTTGGACATTGCCATCCAAAAGTTACCGCCGCTTTACATGATCAAATTAACACATTATGGCACACGACGAATATATACAGACACCCAAAAATATATGAATACGTAGAAAAACTTATTTCTAAATTCCCAGAAGATTTAAAGACAGTGTACTTAGTGAATAGTGGCACGGAAGCCAATGATCTGGCCATTCTGTTGGCGCGTGCCTTCACCGGCCACGAGGACGTCGTTTCGCTCCAAAGCTGTTACCATGGTTACTCCAGCACGATAATGGCGCTCACCGCTTCACAGGCTTACCGCATGCCGCTCTCCGTACCGGCTGGTTTTCATCATGCGATATTGCCGGACCCGTACCGCGGTATCTGGGGCGGATGCAGAGACTCCTTGTCACAAGTAGCTGGCTCATGTTCGTGTGTTGGAGACTGTGTGACGTCAGAGAAATATGTTCACCAACTCTCTGAGCTCGTAGATAATTCAATACCAGCGGGTCGCGTGGCCGCATTGTTTGCGGAATCCGTGCAGGGCGTTAATGGGACTGTACAGTTCACGCGAGGCTACCTCAAGCAGGCCGCTGAGCTGATTAGAAGCAAGGGAGGCCTGTTTGTCGCTGATGAGGTACAAACAGGTTTCGGGAGGACCGGCGATGCGTTTTGGGGATTTGAAAAGCACGATGTAGTTCCCGATATAGTCACTATGGCCAAAGGAATTGGAAATGGATTTCCGATGGCAGCAGTTGTTACTAGGAAAGAGATCGCCGAAGCTCACACAAGGGCTGCCTACTTTAACACTTTCGGGGGAAACCCAATGGCGGCTACAGTTGGGAAGGCCGTGTTGGAGGTTATCGAAGAAGAAAATCTTCAACAAAATTGCAAAGATACAGGAAAATATTTTATTGAACAGCTAATGCAATTACAAAAGCAATATCCGGTGATAGGTGATGTCCGCGGTAAAGGGCTCATGTTGGGAATAGAACTCGTTGAACCTTGCACTAAAAAGCCGCTAGACAGAAGTGATGTAACAGACATCATGGAATCCATAAAGGACCTTGGAGTTTTGATCGGACGGGGCGGGCGTTGGAGCAATGTTCTGAGGATTAAGCCACCCATGTGTATAAATAAGATGGATGTAAATTACGCCATATCGGTTTTAGATCAGTCCCTAAAGGATTATTTACAAAATAAAATCTGA

Protein sequence:

>DPOGS210383-PA
MANIFVRYCHNSIFRRNAVTLPKYDFTPKPYTGPSLQRTDEIKANHIPPAMYNIYRKPLVLHQGRMQWLFDHEGRRYLDLFGGIVTVSVGHCHPKVTAALHDQINTLWHTTNIYRHPKIYEYVEKLISKFPEDLKTVYLVNSGTEANDLAILLARAFTGHEDVVSLQSCYHGYSSTIMALTASQAYRMPLSVPAGFHHAILPDPYRGIWGGCRDSLSQVAGSCSCVGDCVTSEKYVHQLSELVDNSIPAGRVAALFAESVQGVNGTVQFTRGYLKQAAELIRSKGGLFVADEVQTGFGRTGDAFWGFEKHDVVPDIVTMAKGIGNGFPMAAVVTRKEIAEAHTRAAYFNTFGGNPMAATVGKAVLEVIEEENLQQNCKDTGKYFIEQLMQLQKQYPVIGDVRGKGLMLGIELVEPCTKKPLDRSDVTDIMESIKDLGVLIGRGGRWSNVLRIKPPMCINKMDVNYAISVLDQSLKDYLQNKI-