Monarch geneset OGS2.0

DPOGS210326
TranscriptDPOGS210326-TA1332 bp
ProteinDPOGS210326-PA443 aa
Genomic positionDPSCF300025 - 668138-672111
RNAseq coverage706x (Rank: top 18%)
Annotation
HeliconiusHMEL0138573e-11352.98% 
BombyxBGIBMGA011964-TA0.067.95% 
DrosophilaSpt-I-PB2e-11646.71% 
EBI UniRef50UniRef50_Q6NR463e-11446.71%RE58623p n=15 Tax=Coelomata RepID=Q6NR46_DROME
NCBI RefSeqXP_001847608.11e-12350.22%serine palmitoyltransferase 1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700395803e-12250.22%serine palmitoyltransferase 1 [Culex quinquefasciatus]
NCBI nr blastxgi|1700395802e-11950.66%serine palmitoyltransferase 1 [Culex quinquefasciatus]
Group
Gene OntologyGO:00038241.5e-74catalytic activity
GO:00301701.5e-74pyridoxal phosphate binding
GO:00167694.5e-40transferase activity, transferring nitrogenous groups
GO:00090584.5e-40biosynthetic process
KEGG pathwaycqu:CpipJ_CPIJ0061644e-123 
 K00654 (E2.3.1.50)maps-> Sphingolipid metabolism
InterPro domain[26-441] IPR0154243e-76Pyridoxal phosphate-dependent transferase, major domain
[89-316] IPR0154211.5e-74Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[73-431] IPR0048394.5e-40Aminotransferase, class I/classII
Orthology groupMCL12298 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210326-TA
ATGATTTTAATGGACATATACGATACTGCGAGTTGGTGGACATTAGTAATTGCAGCCCAAGTGGTTATCGGCAGCGCGTACTTCTTGTGGACTAAAAAGAAGAGAGAACCGGAAGATAAGTTGACTTATAAAAAGTTCAATGAATGGAAACCTTTACCTCTTGTAGAATACGACCTGAAAGTTGATGAGCCTCCATTGATGGGTAAAATTGACGAAAATGTTCTCAATGTTGGCGCTACTAGTTTTCTGAATTTTGATAAGGAGCCCCAAATAATGGAAAATGCTATTGCATCCCTCCATAAATATGGTGTAGGATCATGTGGGCCGAGAGGTTTTTATGGCACAATTGATGTGCATCTGGAACTTGAAGAGAGATTGGCAAAGTTTTTGCAGGTTGAAGAGACTTGTGTGTATTCATATGGATTTTCCACCATGGCCAGCGCCATACCAGCCTATGCTAAGAAAGGGGACATCATATTTGCAGATGAGAAGGTCTGGTTCGCAATCCAGAAAGGCATAGAAGCGTCCCGCAGCAATGTTAAATACTTCAAACACAACGACATGAAGGATTTAGAAAGGTTGCTCGAGGAGGGCGTCCAGAAAAAGGAATTGCATCCTAAGAGACGTGCGTTCCTGATCGTTGAAGCAATATACTTTAATTCTGGTAAAATGTGCCCTTTGAGGAAGGCTGTGGAACTGGCGAGGAGGTACAAATTAAGAATCATGCTGGACGAGAGCTTGAGCATTGGTGTAATAGGTAAGAACGGCCGCGGTCTGACCGAACACTTGAACGTGCCTCGTGATGAAGTGGATCTCATCATGGGCTCGCTGGAACACTCGTTCGCCACCATCGGCGGCTTCTGCGCTGGGACGCACTTCATCGTAGAACACCAAAGGCTATCAGGCCTAGGGTACTGTTTCAGCGCGTCCCTTCCACCTATGCTAACAGAGGCGGCCATCACCGCCCTGGATATACTCGAGTCCCAACCAGTCGTGGTTAAGGAGTTGGACGAAGTGTCCGTTAAGTTGAACAAGGCCCTCGACAATCTGAAACACTACACATATAGCGGGGATGAATTGTCACCTGTAAAACATGTTTACCTCAAAGATGATATGTCTCACAATGATAAAGAATCATATCTCAAGAAGATAACCAAGTACTGTATGGAGAAAGGCATAGCTATGACGACAGCTGCCTACCTTAAAGACCAGGAGGTCAAATGTCCGGAACCTTCCATACGCTTGGCCTCAAATAGAAAATTAACTGATGACAATATAAAGCAGATCTGCGAACTTTTGGATGAGGCCTATGAAAGAGTTGGTCCTAAATAG

Protein sequence:

>DPOGS210326-PA
MILMDIYDTASWWTLVIAAQVVIGSAYFLWTKKKREPEDKLTYKKFNEWKPLPLVEYDLKVDEPPLMGKIDENVLNVGATSFLNFDKEPQIMENAIASLHKYGVGSCGPRGFYGTIDVHLELEERLAKFLQVEETCVYSYGFSTMASAIPAYAKKGDIIFADEKVWFAIQKGIEASRSNVKYFKHNDMKDLERLLEEGVQKKELHPKRRAFLIVEAIYFNSGKMCPLRKAVELARRYKLRIMLDESLSIGVIGKNGRGLTEHLNVPRDEVDLIMGSLEHSFATIGGFCAGTHFIVEHQRLSGLGYCFSASLPPMLTEAAITALDILESQPVVVKELDEVSVKLNKALDNLKHYTYSGDELSPVKHVYLKDDMSHNDKESYLKKITKYCMEKGIAMTTAAYLKDQEVKCPEPSIRLASNRKLTDDNIKQICELLDEAYERVGPK-