Monarch geneset OGS2.0

DPOGS209802
TranscriptDPOGS209802-TA1350 bp
ProteinDPOGS209802-PA449 aa
Genomic positionDPSCF300117 - 427099-437481
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0060211e-8851.00% 
BombyxBGIBMGA008045-TA1e-13758.95% 
DrosophilaCG32052-PB4e-9043.54% 
EBI UniRef50UniRef50_Q172E76e-9140.21%Sphingomyelin phosphodiesterase n=6 Tax=Diptera RepID=Q172E7_AEDAE
NCBI RefSeqXP_969606.12e-9646.92%PREDICTED: similar to AGAP005806-PA [Tribolium castaneum]
NCBI nr blastpgi|910918764e-9546.92%PREDICTED: similar to AGAP005806-PA [Tribolium castaneum]
NCBI nr blastxgi|910918761e-9747.62%PREDICTED: similar to AGAP005806-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL10824 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209802-TA
ATGTACACTGTGATGTTTAAAGATATGGTTAGGGGTTGGTTGGTTACTTCTGGCACATTACTGACTTTCATTACGACCCGTTATACACAGCACAGGGCGACACTAGAAGACGTAGATTGTCGAAGAGCTGACGAGCGAGGGAGCTCGGGCCACCACAGAGCCCTCGGTAGACTTGGGGATTACTCCTGCGATAGTTCTTTGGAACTGATACAATCTGCCTTAAGATATATGCGGACACGACATTCAGAAAATGTTGAGTTTGTTTTATGGACCGGGGACATCGTAGCAGTACAATACTCTGACAATGAAGACAATAGGTACCAGGCGATCAGAAATATAACCGAGCTTCTAAGGATGACATTCAGTTCTCATTTCGTTTTTCCTGTACTTGGTCATACGGATCCAGCGCCTTCTGAGAGACTTACAAATTTATGGAGCCATTGGCTACCGCTGGAAGCGCTGCAAACATTAAAAATGTATGGGTATTACACAATAGAACAGTCACACAGTAAGTTACGTATCGTGGCGTTAAACACAAACCTATTCAGCCACAGTCAAGCGAACAGTGTTCAGGCTAAGCGGCAGTGGGAATGGTTGGATGCTGTTCTTGATAAGGCAACGGCTAATAGTGAGATGGTTTATATAGTGGGACATTCAGCCCCCGGTTCAGGATCTCGTTATAACGCCTACTCGGTTGATGCAAACGTTAAATTCCTGAACACGATCAGACGACACGCTGGTATTATAGCGGGACAGTTCTTTGGGCACCTACACGTGGATACGTTTCGAGTAATATATGATAAAGAACTGCCGGTATCCTGGGCCTTCCTGGCGCCATCTCTTAGCCCTCATCACGATCCCGCCGGTTCCTCAAACCCAGGATTAAGACTTTACAAATTTGACTCAGATACAGGAAAGGTTTTAGATTACACTCAGTTTTACTTGGATTTGGCCGTGGCGAATCGAGTTGGGGACAGTGGTGCAACGGTTGTGGGAGGTGACTGGGTGGCTGAGTACAATTTGACACAATACTACGCCATTAGAGACGTCTCTGCGGAATCCTTGCATCACTTGGCTGACAAGCTAAGGATTGGCACGGCTCACGAAACTACCATGTTCAATAAATATTTACGCTCTTATAACGTTAAACGGGATAACTCTGACAACTGCGATGGGGCGTGTGCACATCAACACTATTGTGCTATAACCTGTTTGGAGCACATCGCGTATAGGCAATGCGTCGAGGCAGCAGCCAGCGCTCTCGCAGCTTCTGGACGATCCGCTCCGCTCGTCGTTCCGCTCATTAGTCTGATCCTCACCCTTATTTTAGTATGTATAGCTGTGCTTTAA

Protein sequence:

>DPOGS209802-PA
MYTVMFKDMVRGWLVTSGTLLTFITTRYTQHRATLEDVDCRRADERGSSGHHRALGRLGDYSCDSSLELIQSALRYMRTRHSENVEFVLWTGDIVAVQYSDNEDNRYQAIRNITELLRMTFSSHFVFPVLGHTDPAPSERLTNLWSHWLPLEALQTLKMYGYYTIEQSHSKLRIVALNTNLFSHSQANSVQAKRQWEWLDAVLDKATANSEMVYIVGHSAPGSGSRYNAYSVDANVKFLNTIRRHAGIIAGQFFGHLHVDTFRVIYDKELPVSWAFLAPSLSPHHDPAGSSNPGLRLYKFDSDTGKVLDYTQFYLDLAVANRVGDSGATVVGGDWVAEYNLTQYYAIRDVSAESLHHLADKLRIGTAHETTMFNKYLRSYNVKRDNSDNCDGACAHQHYCAITCLEHIAYRQCVEAAASALAASGRSAPLVVPLISLILTLILVCIAVL-