Monarch geneset OGS2.0

DPOGS207584
TranscriptDPOGS207584-TA1557 bp
ProteinDPOGS207584-PA518 aa
Genomic positionDPSCF300072 + 814537-820839
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0164476e-4533.81% 
BombyxBGIBMGA004692-TA0.083.72% 
DrosophilaSMSr-PA4e-12948.37% 
EBI UniRef50UniRef50_E2ADP21e-13762.92%Sphingomyelin synthase-related 1 n=7 Tax=Pancrustacea RepID=E2ADP2_CAMFO
NCBI RefSeqXP_396152.33e-14058.81%PREDICTED: similar to SMSr CG32380-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3812179630.082.95%sphingomyelin synthase-related protein [Chilo suppressalis]
NCBI nr blastxgi|3812179630.082.95%sphingomyelin synthase-related protein [Chilo suppressalis]
Group
Gene OntologyGO:00055154.8e-07protein binding
KEGG pathwaytgu:1002299004e-66 
 K04714 (SGMS)maps-> Sphingolipid metabolism
InterPro domain[1-72] IPR0109934.8e-07Sterile alpha motif homology
Orthology groupMCL12702 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207584-TA
ATGGAAATGTGTATTGTTAAGTGGAACAACAAAGAAGTTTTAGAATTATTACATAAAGAGAAAATATCAAGCTTCATAATTGAAATTTGCAAGACTCAAGATATTGATGGACAGTGTTTACTTTCTTTTGTCGACCGCGATTTCTATGACTATCCATTTGATCAGTTAAAACTGGGTGAAAGAAAAAGGTTTATACTGTTAGTGAAAAAATTACAAAGAAATAATAGAAGTGCAATGTATGAACTTGGATTATACGATGATCATACATCAAACAATCCTGCTACAAATATTAATTTCGTTGGAACAAATTTATCACATTTAAGCTACAATCTCCATAATCAATTACAAAATGAACTGTATGCAAGAAATACAGAATGCATCTCAAATTTTACACCAGATGTGAAAGCTTCCAAATTAAAGCCAGAAGTGTGGAAAACTGCAATTGCCTTAGGGTATGTGTTTCTTGTGACATGGGTGACGGCAGTAGTAATGGTGATTGTCCACGACAAAGTTCCGGACATGAAAAAATATCCACCTCTGCCTGATCTCTTCCTAGATAATGTACCACATATACCCTGGGCTTTTGATATGTGTGAAATAACTGGCTCATTCCTAATGGCTATCTGGCTAGTTGTTCTGTTCTTCCATAAACATAGGTTTATAATCCTGAGAAGATTTTTCGCACTGGCTGGTACGGTGTTCCTGTTGCGTTGTTTTACTATGTTGATAACGTCGCTCTCTGTGCCGGGATCACATCTCAAATGTGAGCCTCGGTTCTATCCGCCTGCTGACGATCTCACGGTTTGGGGGCGCCGCCTCCGACAGGCCTACGACATCTGGAGCGGTGCTGGCATGTCAGTACGCGGAGTCCGTACTTGCGGAGACTATATGTTCTCAGGGCACACCGTGGCCTTAACGCTCCTAAACTTTTTTATCACCGAATACACATCCCGTAGTCTATACCTCCTTCATATACTGACATGGGTAATGAATATGTTCGGTATCTTTTTCATTCTGGCGGCTCATGAACACTACTCCATAGACGTGTTCATAGCCTTCTATATTACATCGAGACTCTTCCTTTACTACCACACGCTCTCAAACAACCAAGCTTTGATGCAATCTGATTCTTCCAGAACTAGAATATGGTTCCCGCTGCTGTCGTTCTTCGAGTCCGAAGTGGACGGCATCGTTCCTAACGAGTACGAAGGGCCGATGGAGATCGTGATCAACCTGAAGCAATGGCTGGTGCAGCTAGTGTTGGATGTGAAGGCGTCGTCTGTGGCCAGGGTCGCGGGCAGCAAGCTACAGGAGGGCGCCGCTATGGGAGAGTACTCGATGGTGAGGCTCGTGGACGGCATCAAACGGAACATCAGCATTATGGAGGAGTATAAAAACTCGCGCCAGAGGCTCGCCACACTAGACAAGAACGTGCAAGCCGGCCACGACCTGCCCGAGGCCGATTTACGACATGGGAAAATAGCTGAGCTTACCAACGACGCCCTCTTCAGAGATTTCAGCAAACCACCCTCACCGGCCATCAAGAAGAATATCTGA

Protein sequence:

>DPOGS207584-PA
MEMCIVKWNNKEVLELLHKEKISSFIIEICKTQDIDGQCLLSFVDRDFYDYPFDQLKLGERKRFILLVKKLQRNNRSAMYELGLYDDHTSNNPATNINFVGTNLSHLSYNLHNQLQNELYARNTECISNFTPDVKASKLKPEVWKTAIALGYVFLVTWVTAVVMVIVHDKVPDMKKYPPLPDLFLDNVPHIPWAFDMCEITGSFLMAIWLVVLFFHKHRFIILRRFFALAGTVFLLRCFTMLITSLSVPGSHLKCEPRFYPPADDLTVWGRRLRQAYDIWSGAGMSVRGVRTCGDYMFSGHTVALTLLNFFITEYTSRSLYLLHILTWVMNMFGIFFILAAHEHYSIDVFIAFYITSRLFLYYHTLSNNQALMQSDSSRTRIWFPLLSFFESEVDGIVPNEYEGPMEIVINLKQWLVQLVLDVKASSVARVAGSKLQEGAAMGEYSMVRLVDGIKRNISIMEEYKNSRQRLATLDKNVQAGHDLPEADLRHGKIAELTNDALFRDFSKPPSPAIKKNI-