Monarch geneset OGS2.0

DPOGS214664
TranscriptDPOGS214664-TA1281 bp
ProteinDPOGS214664-PA426 aa
Genomic positionDPSCF300321 - 35532-41018
RNAseq coverage379x (Rank: top 32%)
Annotation
HeliconiusHMEL0079713e-14469.65% 
BombyxBGIBMGA001876-TA0.079.30% 
DrosophilaCG10361-PA9e-15162.63% 
EBI UniRef50UniRef50_O756005e-13256.46%2-amino-3-ketobutyrate coenzyme A ligase, mitochondrial n=768 Tax=root RepID=KBL_HUMAN
NCBI RefSeqXP_001352781.15e-15263.73%GA10272 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1259774961e-15063.73%GA10272 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1259774965e-14463.89%GA10272 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00088907e-179glycine C-acetyltransferase activity
GO:00038243.2e-90catalytic activity
GO:00301703.2e-90pyridoxal phosphate binding
GO:00167696.7e-71transferase activity, transferring nitrogenous groups
GO:00090586.7e-71biosynthetic process
KEGG pathwaydpo:Dpse_GA102722e-151 
 K00639 (E2.3.1.29, kbl)maps-> Glycine, serine and threonine metabolism
InterPro domain[34-424] IPR0112827e-1792-amino-3-ketobutyrate coenzyme A ligase
[27-425] IPR0154244.6e-119Pyridoxal phosphate-dependent transferase, major domain
[90-305] IPR0154213.2e-90Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[74-413] IPR0048396.7e-71Aminotransferase, class I/classII
[306-421] IPR0154222.1e-37Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL15621 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214664-TA
ATGGCTTTACAACCTAAACTCCTGCGCATAGTGTTCAGAGGACAACAATTTAGGCAGCTTCATGAGTTCAATGTGAAAGAAAGGGCAGGGGTGACTAAACTCCGAGAGGTCCTTGAAGATAGACTCCAAGAAATAAAGAGAGCTAAGACGTGGAAACATGAGAGAATATTGACGTCACCTCAGGACACGAAGGTGAAGGTGCAGGGAGCGGAGGGGGAGTTTTTGAATTTCTGTGCTAATAACTATCTCGGTTTGTCTAATCATCCAGAGGTTGTGGAGGCAGCGCGAGAAGCCTTAAGTAAATACGGCGCAGGTCTCAGTTCGGTACGATTTATTTGTGGAACTCAAACTATTCATAAGGAATTGGAGAGACGTCTGGCTAAATTCCATGGAAGAGAGGACGCAATACTCTATATATCCTGCTTCGATGCCAACGCCGGTCTGTTTGAGACGATGCTCACTCCTGAAGATGCTGTGTTCTCCGACGCTTTGAACCACGCCTCCATCATCGACGGAATCAGATTGTGCAAGGCCCAAAAGTTTAGATATCCGCACAGAGATCTTAAAGAATTGGAACACCTTCTAGCTCATAGCGAAGCAAGACTTAAACTGATAGTGACTGATGGAGTGTTCTCTATGGACGGTACGGTCGCCCCCATAAAGGGTCTACGAGATTTGGCTGATAAGTACAGAGCCTTGCTTGCTATAGACGATAGTCATGCTACCGGATTTTTTGGAGAAACTGGCAGAGGCACCGAAGAGTACTGCGGCGTGTTGGGTGCGGCCGACATTATCTGCTCCACGCTGGGTAAGGCGGTCAGCGGAGCCGCCGGGGGCTACACCACCGGGCCCAAGGAACTCATCACCTTACTCAGAAACGTCTCTAGACCTTACCTGTTCTCAAATTCACCGCCGCCGCCCGTGGTCGCTGCTTCAATGAAGTCCCTAGAATTAGTGGAGAACAGCTCGGATCTCCGGCGGCGTCTCCGCGAGAACACGCGCCAGTTCCGCGAGGGTCTGAAGTCTGTGGGCCTGGCGGTAGCCGGGGACGAGCATCCGATCTGCCCGGTAATGGTGGGAGACGCTGCTTTGGCTGTGGACCTAGCTGCTGGGATGTTAGAGCGCGGCATATACGTAGTAGCGTTCAGTTACCCCGTGGTGCCGCGAGGCGGCGCCCGCGTCCGCGTACAACTATCAGCGGCTCACACGCGTGATGACGTCACACGAGCCATTGACGCTTTCAAACACGTCGCTCAAAACATCGGCATCATTAACAAATGA

Protein sequence:

>DPOGS214664-PA
MALQPKLLRIVFRGQQFRQLHEFNVKERAGVTKLREVLEDRLQEIKRAKTWKHERILTSPQDTKVKVQGAEGEFLNFCANNYLGLSNHPEVVEAAREALSKYGAGLSSVRFICGTQTIHKELERRLAKFHGREDAILYISCFDANAGLFETMLTPEDAVFSDALNHASIIDGIRLCKAQKFRYPHRDLKELEHLLAHSEARLKLIVTDGVFSMDGTVAPIKGLRDLADKYRALLAIDDSHATGFFGETGRGTEEYCGVLGAADIICSTLGKAVSGAAGGYTTGPKELITLLRNVSRPYLFSNSPPPPVVAASMKSLELVENSSDLRRRLRENTRQFREGLKSVGLAVAGDEHPICPVMVGDAALAVDLAAGMLERGIYVVAFSYPVVPRGGARVRVQLSAAHTRDDVTRAIDAFKHVAQNIGIINK-