Monarch geneset OGS2.0

DPOGS208814
TranscriptDPOGS208814-TA1272 bp
ProteinDPOGS208814-PA423 aa
Genomic positionDPSCF300036 + 147532-151798
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0150883e-8146.93% 
BombyxBGIBMGA007921-TA0.079.67% 
DrosophilaCG6950-PC5e-17066.51% 
EBI UniRef50UniRef50_Q8SXC27e-16866.51%CG6950, isoform A n=47 Tax=Metazoa RepID=Q8SXC2_DROME
NCBI RefSeqXP_001649143.17e-17367.62%kynurenine aminotransferase [Aedes aegypti]
NCBI nr blastpgi|2897409131e-17368.67%kynurenine aminotransferase [Glossina morsitans morsitans]
NCBI nr blastxgi|2420096192e-17167.86%kynurenine/oxoglutarate transaminase 1, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00038242.6e-75catalytic activity
GO:00301702.6e-75pyridoxal phosphate binding
GO:00167698.8e-56transferase activity, transferring nitrogenous groups
GO:00090588.8e-56biosynthetic process
GO:00422182.6e-051-aminocyclopropane-1-carboxylate biosynthetic process
GO:00168472.6e-051-aminocyclopropane-1-carboxylate synthase activity
KEGG pathway 
InterPro domain[5-421] IPR0154243.1e-100Pyridoxal phosphate-dependent transferase, major domain
[44-283] IPR0154212.6e-75Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[32-411] IPR0048398.8e-56Aminotransferase, class I/classII
[284-419] IPR0154225e-15Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL13772 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208814-TA
ATGAGTCACAAGTTTAACTTGCCAGCTAGATATGGAACCGGCGAAAAAAGTGTGTGGGTTGAATATATACAACTGGCCGCTGAATACAAACCGGCTGTGAACCTCGGACAAGGCTTCCCAGATTACCACGCCCCTGAACATGTCACTAAAGCACTTGCAGAAATAACTACTGGAGAAAATCCATTATTCAATCAATATACAAGAGGATTTGGTCACCCCAGGTTAGTTCAGAATTTGGCAAAGCTCTACTCGCCGTTAGTAGGCAAAGAGATAGATCCAAACAATGAAATATTAGTGACGAGCGGTGCTTACGAGGCACTGTTCTCCACTATAATGGGTCACGTTGATGAGGGTGACGAAGTCATAGTGATTGAACCGTTCTTTGATTGCTACGAATTCATGATAAAGACAGCTGGTGGTATTCCCAGGTGCATCGCTTTAAAACCTAAAGCCACTAGTGGAACTATGACGTCAGCTGATTGGGTGCTAGATGAAGCCGAGCTGGTGTCATTGTTCAATAGCAAGACTAAAATGTTGATACTGAACACTCCCCACAACCCCCTGGGGAAAGTGTTCACCGCCCAGGAGTTAGAGACCATCGGCAATCTGTGCAAGAAGTACAACGTGCTGTGCGTCTCTGACGAGGTCTACGAATGGATGGTGTACGCGCCAAATAAACATATCAGAATAGCTTCAATGCCGGGTATGTGGGAGCGTACTATAACAATTGGTTCAGCGGGCAAGACCTTCTCCGTGACCGGTTGGAAGACCGGTTGGGCGTACGGTCCAGCCAACCTCATGAGGAACTTGCAGGTGGTGCATCAGAACTGCGTATACACGTGCTGCACTCCTATACAGGAAGCAGTGGCAAGATCTTTTGAATTCGAGCTGTCCAGATACAACTCACCCGAGTGCTACTTCTTCTCGCTAGCAAGAGAACTACTTTCAAAACGCGATTACCTCGTTAAAGTTTTGAGGGAGAACGGTTTCAACCCGACCGTACCTGATGGAGGGTACTTTATTGTAGCAGATTGGACTAAATTAGAAAAGAAAATAGACCTACAATCAGAATCTGATAAATACAAGGACTATCGTTTTACTAAAAAGTTCGCGAAGGAAGCTGGTGTCCTCGCCATACCGCCGTCAGCCTTCTACTCCGAGGATCACAAGCATCTGGGAGAAAACTTCGCGCGTTTCTGCTTCATTAAGAAAGATGAAAACCTTGCTCTGGCAGAAAACCTGATGAAGGAATGGAACGCTAAGAAGAAATGA

Protein sequence:

>DPOGS208814-PA
MSHKFNLPARYGTGEKSVWVEYIQLAAEYKPAVNLGQGFPDYHAPEHVTKALAEITTGENPLFNQYTRGFGHPRLVQNLAKLYSPLVGKEIDPNNEILVTSGAYEALFSTIMGHVDEGDEVIVIEPFFDCYEFMIKTAGGIPRCIALKPKATSGTMTSADWVLDEAELVSLFNSKTKMLILNTPHNPLGKVFTAQELETIGNLCKKYNVLCVSDEVYEWMVYAPNKHIRIASMPGMWERTITIGSAGKTFSVTGWKTGWAYGPANLMRNLQVVHQNCVYTCCTPIQEAVARSFEFELSRYNSPECYFFSLARELLSKRDYLVKVLRENGFNPTVPDGGYFIVADWTKLEKKIDLQSESDKYKDYRFTKKFAKEAGVLAIPPSAFYSEDHKHLGENFARFCFIKKDENLALAENLMKEWNAKKK-