Monarch geneset OGS2.0

DPOGS208998
TranscriptDPOGS208998-TA1437 bp
ProteinDPOGS208998-PA478 aa
Genomic positionDPSCF300009 + 2043155-2047055
RNAseq coverage508x (Rank: top 25%)
Annotation
HeliconiusHMEL0167820.079.05% 
BombyxBGIBMGA012526-TA0.086.46% 
DrosophilaCG8745-PA8e-17065.51% 
EBI UniRef50UniRef50_UPI00020637BE0.067.42%UPI00020637BE related cluster n=3 Tax=unknown RepID=UPI00020637BE
NCBI RefSeqXP_392348.10.067.42%PREDICTED: similar to CG8745-PA [Apis mellifera]
NCBI nr blastpgi|3504197090.064.54%PREDICTED: alanine--glyoxylate aminotransferase 2-like [Bombus impatiens]
NCBI nr blastxgi|3504197090.066.17%PREDICTED: alanine--glyoxylate aminotransferase 2-like [Bombus impatiens]
Group
Gene OntologyGO:00084839.8e-249transaminase activity
GO:00301709.8e-249pyridoxal phosphate binding
GO:00038241.4e-78catalytic activity
KEGG pathway 
InterPro domain[5-445] IPR0058149.8e-249Aminotransferase class-III
[7-442] IPR0154242.3e-122Pyridoxal phosphate-dependent transferase, major domain
[65-322] IPR0154211.4e-78Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[323-441] IPR0154222.7e-40Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL11345 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208998-TA
ATGGCTTACCTACAGTCAATGCCAAAATCAGAAACTATCAAATTGCGGGAAAAACACATCGGAGCCGCTTGCCAGTTGTTTTTCCGTTCATCCCCTCTAAAAATAGTGAGGGGTGTCGCCCAATTTATGTATGACGAGACAGGCGAGAGATACCTCGACTGCATTAACAATGTCGCACACGTTGGTCACTGTCACCCACACGTCGTTGAAGCTGGTCGTAATCAGATGTCATTGATATCAACCAATAATCGATATCTTCACGACGAAATCGTGATGCTCGCTGAAAGACTAGTTAAGACGATGCCCGAACCATTGTCAGTTTGTTTCTTTGTAAACTCCGGCTCTGAAGCTAATGATCTTGCTTTGAGATTAGCCAGAGTTCACACTAAGAGGAAGGATGTTATCACCCTTGACCACGCATATCATGGTCACCTTACGTCTATGATTGATGTGTCCCCCTACAAACTAAACCTACCAGGAGGCCCAGAGAAACCAGATTGGGTTCATGTGGCTCCAGTTCCTGATGTGTACAGAGGAAAATACACATATCCAAACGACTCGACATCAGAGGAAAACTTAGGAAAATTATACGCTGATGAGGTGGGAAAACTATGCGACGAAATTAAGAAGAAGGGCGGTGTATGCGCTTTTATAGCAGAAAGTCTCCAAAGTTGTGGGGGACAAATAATTCCTCCCGAAGGCTACCTAAAGAAAGTTTTTAAGCACGTGAGGGAAGCTAATGGAGTATGTATAGCTGATGAAGTTCAAGTAGGATTTGGGAGAGTCGGCACTCACATGTGGGCATTCGAGACTCAGGGCGTTGTGCCAGATATCGTTACCATGGGAAAGCCCATGGGTAACGGTCATCCTGTCGCGGCGGTCATTACTACACCAGAGATCGCAAAAAGCTTCACAGATACTGGTGTCGAATACTTTAACACATATGGAGGCAATCCTGTTTCCTGCGCTATAGCTAATGCAGTATTAGACGTAATAGAAGAAGAACGTCTAATGGAGCGTGCAGCACGTGTTGGAAACCATCTCTTGTCTCGTTGCGAGGGACTTCAACACAAGCACAAATTGATAGGAGACGTCCGCGGTAGAGGTCTTTTTGTTGGTGTCGAACTTGTGACTGATCGTGAAACTAGAAATCCCGCAACAGCAGAAGCCAAACATGTTGTTAACAGAATGCGTGAGGAAAAAATACTTATAAGTCGCGATGGACCAGACAGCAATGTACTAAAATTCAAGCCACCCATGGTATTTACAACTCAGGACGCTGATAGATTAGTGGACACTTTGGATCGAGTGTTATCCGAATTAAATGGCGGCATGACAGTTAATGTCAAACTTGAGATGATGGTCACACCAATTAGGGATGAAGTCAGTAACAGCAGTTTGCAAACTAATATGCCACAGTTGTTGAAAGCAATATGA

Protein sequence:

>DPOGS208998-PA
MAYLQSMPKSETIKLREKHIGAACQLFFRSSPLKIVRGVAQFMYDETGERYLDCINNVAHVGHCHPHVVEAGRNQMSLISTNNRYLHDEIVMLAERLVKTMPEPLSVCFFVNSGSEANDLALRLARVHTKRKDVITLDHAYHGHLTSMIDVSPYKLNLPGGPEKPDWVHVAPVPDVYRGKYTYPNDSTSEENLGKLYADEVGKLCDEIKKKGGVCAFIAESLQSCGGQIIPPEGYLKKVFKHVREANGVCIADEVQVGFGRVGTHMWAFETQGVVPDIVTMGKPMGNGHPVAAVITTPEIAKSFTDTGVEYFNTYGGNPVSCAIANAVLDVIEEERLMERAARVGNHLLSRCEGLQHKHKLIGDVRGRGLFVGVELVTDRETRNPATAEAKHVVNRMREEKILISRDGPDSNVLKFKPPMVFTTQDADRLVDTLDRVLSELNGGMTVNVKLEMMVTPIRDEVSNSSLQTNMPQLLKAI-