Monarch geneset OGS2.0

DPOGS203745
TranscriptDPOGS203745-TA999 bp
ProteinDPOGS203745-PA332 aa
Genomic positionDPSCF300010 - 88320-90626
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0025612e-14677.16% 
BombyxBGIBMGA011491-TA4e-11263.11% 
DrosophilaCG10425-PA1e-6541.87% 
EBI UniRef50UniRef50_F4WRR81e-6643.43%3-ketodihydrosphingosine reductase n=7 Tax=Formicidae RepID=F4WRR8_ACREC
NCBI RefSeqXP_001657877.12e-7645.15%short-chain dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|3838555928e-7948.16%PREDICTED: 3-ketodihydrosphingosine reductase-like [Megachile rotundata]
NCBI nr blastxgi|3838555924e-7648.46%PREDICTED: 3-ketodihydrosphingosine reductase-like [Megachile rotundata]
Group
Gene OntologyGO:00054883.3e-48binding
GO:00081521.1e-23metabolic process
GO:00164911.1e-23oxidoreductase activity
KEGG pathwaydre:3941148e-66 
 K04708 (E1.1.1.102)maps-> Sphingolipid metabolism
InterPro domain[31-270] IPR0160403.3e-48NAD(P)-binding domain
[37-204] IPR0021981.1e-23Short-chain dehydrogenase/reductase SDR
[37-54] IPR0023477.5e-14Glucose/ribitol dehydrogenase
Orthology groupMCL14038 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203745-TA
ATGTTTTTGTATATACTGTCGTTGTTTGTTATTTTATTGTTTATATTGTTCGTATTTTTGTACAATGTATTGGATAAAAAAATTGTGTATAAAGACTTGAAAAACCGGCATGTTGTTATCACTGGAGGTTCAAGCGGTATCGGCAAGGCGGCTGCGATTGAAGCAGCGAAATTGGGGGCTAACGTTACTATAATTGGACGTGATCTTCAAAAACTTATTTTGGCTGTTGGTGACATAACAAACCATTGTAAACCTTACAGCGGCCAGAAAATACAATATGTGATATTAGATGTTACGTCTGATTACGATTTAATAGAGAAATGTTTGAAAAAAGTTGAGTCAGACGTCGGTCCTATATTTATGCTTATTAACTGTGCTGGAACGTGTATTTGTGGGGAGTTCGATGGAGTTGATGTTAAAGTATTAAAACAAATGATAGATCTCAACTATTTTGGCACTGCTTATCCTACTAGATGTGTTCTACCTGGTATGAAGAAACGGGATGTGGGGATCATAATTTTTGTTTCAACCGAAGCAGCTTTACTAGGTATATATGGCTACAGTGCTTATGGTGCAGCTAAATGGGCCGTCCGTGGTTTAGCAGAATCAGTATTCATGGAGCTTACGGGAACAAATGTGAGATTGACCCTTGCATTTCCACCTGATACTGACACTCCTGGATTCGAAAAGGAGGAGTTGACTAAACCGAAGGAGACCAAATTAATATCTGGTTCTGGCGGATTACATACAGCTGAGGATGTCGGCAAGAAAATGATTCAAGATGCACTGAACGGCAAGATATATTCTGTGTTCGGCTTCAGCGGTCATTTGCTGTCTACCCTCTACTGTGCTACTATAGATGGGCCGTTTCAAATTATTGTACAAATACTTTCATTGGGTTTACTACGGGCTGTGATGGTTGTACCTCAGCTGTCGTTCCAGAAAATAGTCAAGGATGGTCTCAAAGAGAAGTGTTTAGAAAATAGTAAGGATAAATAG

Protein sequence:

>DPOGS203745-PA
MFLYILSLFVILLFILFVFLYNVLDKKIVYKDLKNRHVVITGGSSGIGKAAAIEAAKLGANVTIIGRDLQKLILAVGDITNHCKPYSGQKIQYVILDVTSDYDLIEKCLKKVESDVGPIFMLINCAGTCICGEFDGVDVKVLKQMIDLNYFGTAYPTRCVLPGMKKRDVGIIIFVSTEAALLGIYGYSAYGAAKWAVRGLAESVFMELTGTNVRLTLAFPPDTDTPGFEKEELTKPKETKLISGSGGLHTAEDVGKKMIQDALNGKIYSVFGFSGHLLSTLYCATIDGPFQIIVQILSLGLLRAVMVVPQLSFQKIVKDGLKEKCLENSKDK-