Monarch geneset OGS2.0

DPOGS209467
TranscriptDPOGS209467-TA1800 bp
ProteinDPOGS209467-PA599 aa
Genomic positionDPSCF300275 + 210431-216648
RNAseq coverage1025x (Rank: top 12%)
Annotation
HeliconiusHMEL0044991e-12774.68% 
BombyxBGIBMGA005845-TA4e-10966.25% 
Drosophilalace-PA1e-17159.37% 
EBI UniRef50UniRef50_Q9V3F22e-16959.37%LD36009p n=52 Tax=Opisthokonta RepID=Q9V3F2_DROME
NCBI RefSeqXP_002003415.11e-17260.84%GI17899 [Drosophila mojavensis]
NCBI nr blastpgi|3838642253e-17155.74%PREDICTED: serine palmitoyltransferase 2-like [Megachile rotundata]
NCBI nr blastxgi|1951177602e-16560.84%GI17899 [Drosophila mojavensis]
Group
Gene OntologyGO:00038247.7e-79catalytic activity
GO:00301707.7e-79pyridoxal phosphate binding
GO:00167694e-56transferase activity, transferring nitrogenous groups
GO:00090584e-56biosynthetic process
KEGG pathwaydmo:Dmoj_GI178994e-172 
 K00654 (E2.3.1.50)maps-> Sphingolipid metabolism
InterPro domain[178-596] IPR0154242.1e-100Pyridoxal phosphate-dependent transferase, major domain
[257-474] IPR0154217.7e-79Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[236-585] IPR0048394e-56Aminotransferase, class I/classII
[475-592] IPR0154226.5e-34Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL11011 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209467-TA
ATGAGACAGATCAAAGTAATCAATAGTGGGTGCGGTGTCACTGAAATTAGGAGATATGAGGAGTGCGGAGGTGATCCTGTCGTGCCTCTCCGCAGTTTCGAACTGAAAGGTCTGCGCGCCGCTCGCCGGCTGCCGCTGGACGACGTCACTCAGCACGACAAACGTGCGGTTAGGACGGCGAGGATGGTGGACACCACGGTAGTGACTAGTGTTGATTTAAGAGAAACTAATTTAAACAAAAATGTACAATGTAATGGAAGTAGTGGAAAAAATATCAATGGAAAAATACATCTACAGAATGGGTCTTTAAAATCTCAAACTATAAGTAATGGATCCAATAAATGTAAGGACGAGAACGAGCGATTTCCGACGATGGACTGGTCTCAGTACGACACCTTCCCCGGCTCCTTCGAGAAGTGTTCCCTGATGACGGCGGCCCTAACTCACGTGGGCATGTACCTGCTCATGTTTCTCGGCTTCGTGAACCAGCTGCTCTTCAAACCGAAGGTGGCCACCGAGAGGAACCGGGAGGGCTACGCACCGCTGTACAACCCGTTCGAGCAGTTCTTCTCCCGCTACGTGTACCGCCGCGTGCGACACTGCTTCAACCGCCCCATCAGCTCCACGCCCGGCGCCGAATTGACGCTCAAGGAGCGCCACACGGACGATTACAACTGGTCCTTCAGGTTCACGGGCAAGGAAAAGCGCTGCATTAACCTGGGCTCCTACAACTACCTGGGCTTCGCGGAGGCGTCCGGCCCCTGCGCGGAGGCGGCGGAGGCGGCCGCCAGGCGCTACGGCCTCGCGCTTGCGTCCTCCCGCGGCGAACTCGGCTCTACGCCGCTGCACGACGAGTTGGAGCGGACCACCGCCGGCTTCCTCGGCGTGGAAGCGGCCGTGGTGTTCGGGATGGGATTCGCGACCAACGCGCTCGGCCTGCCGGGCCTGCTTGGGAGCGGCTCGCTCGTGCTCAGCGACGAGAACAACCACGCCTCGCTCATCCTGGGCCTGAGGCTCGCCCGCGTCGCCGTGCGCGTTTTCCGACACAACGACGTGCGACACTGCGAGCGACTGGCCCGAGCCGCCCTGGCTGAGGGACGTTGGACGAAGATCGTCATCGTGGTGGAGGGCGTGTACAGCATGGAGGGCTCCGTGGCGCCTCTGGCCGCTCTCGTGGCCCTCAAGAAACGTCTCGGCTTGTATCTATATCTGGACGAGGCGCACTCGGTGGGGGCCATGGGTCCACGCGGCCGCGGCGTCACCGACCACTGCGGCGTCTCCCCGCGGGACGTCGATGTCCTCATGGGCACCTTCACCAAGAGCTTCGGCGCCGCTGGGGGATACATAGCGGGCTCCGAGAAGCTGGTGGATTGGGTCCGTGCCCGCTGTCACGCGCACGGCTACGCCCACGCCATGTCTCCCCCGGTTGCGGCGCAGGTGCTGGCGGCCATGCGCGCCATATCCTCGCCCGTCGGCCTCAAGCGCGTGGCTCGTCTGAGGGACAACACTCATCACTTCCGGCGCCGGCTCCGCGAGATGGGCGCCATCACCTTCGGCCACGAGGACTCGCCCGTGGTGCCGATGCTGGTGTACACCTTCAGTAAGATGGCGGCCACCGTGGAGCGCCTCACGGAGCGCGGCCTGGCCACGGTCGGCGTGGGCTACCCCGCCACGCCCCTCAACAAGGCTCGCATCAGGTTCTGTCTGTCGGCGGCGCACTCCCGGGAACAGATCGACCGCTGCCTGGATTCCATCGAGGCCGTCGTGGAGGAGGTGGGTCTGCGGTACTCCCGCCGCTAG

Protein sequence:

>DPOGS209467-PA
MRQIKVINSGCGVTEIRRYEECGGDPVVPLRSFELKGLRAARRLPLDDVTQHDKRAVRTARMVDTTVVTSVDLRETNLNKNVQCNGSSGKNINGKIHLQNGSLKSQTISNGSNKCKDENERFPTMDWSQYDTFPGSFEKCSLMTAALTHVGMYLLMFLGFVNQLLFKPKVATERNREGYAPLYNPFEQFFSRYVYRRVRHCFNRPISSTPGAELTLKERHTDDYNWSFRFTGKEKRCINLGSYNYLGFAEASGPCAEAAEAAARRYGLALASSRGELGSTPLHDELERTTAGFLGVEAAVVFGMGFATNALGLPGLLGSGSLVLSDENNHASLILGLRLARVAVRVFRHNDVRHCERLARAALAEGRWTKIVIVVEGVYSMEGSVAPLAALVALKKRLGLYLYLDEAHSVGAMGPRGRGVTDHCGVSPRDVDVLMGTFTKSFGAAGGYIAGSEKLVDWVRARCHAHGYAHAMSPPVAAQVLAAMRAISSPVGLKRVARLRDNTHHFRRRLREMGAITFGHEDSPVVPMLVYTFSKMAATVERLTERGLATVGVGYPATPLNKARIRFCLSAAHSREQIDRCLDSIEAVVEEVGLRYSRR-