Monarch geneset OGS2.0

DPOGS214052
TranscriptDPOGS214052-TA2739 bp
ProteinDPOGS214052-PA912 aa
Genomic positionDPSCF300171 - 304005-311976
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0128656e-13576.47% 
BombyxBGIBMGA010385-TA0.063.05% 
DrosophilaCG33090-PB0.043.32% 
EBI UniRef50UniRef50_B4MZB30.044.01%Non-lysosomal glucosylceramidase n=9 Tax=Diptera RepID=B4MZB3_DROWI
NCBI RefSeqXP_319575.40.046.10%AGAP008830-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582994370.046.10%AGAP008830-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582994370.045.84%AGAP008830-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00066653.4e-139sphingolipid metabolic process
GO:00160213.4e-139integral to membrane
GO:00043483.4e-139glucosylceramidase activity
GO:00038243.5e-20catalytic activity
KEGG pathway 
InterPro domain[415-887] IPR0067753.4e-139Glucosylceramidase
[638-890] IPR0089283.5e-20Six-hairpin glycosidase-like
Orthology groupMCL14549 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214052-TA
ATGCCCCTCACAGCAAGATATTTTAAATACTACCTGGAAAGAAAATTTCAAAAAAGACGGCCCATAATGGATTATATTAATATAATCTCAGCTCAACGTATGTATGGTTGTCCTATTGGAGGCATTGGGGGTGGCACAATCGGCAGGGGATTTAAAGGGGAATTCTGTAGATTCCAACTGTATCCTGGCTTGTATGAATATGTCACAGTGCCTGAATGCCAATTCATTGTCAATATTAGGGATGCCAAGAAGGAAACTATTTTTCAATCTGTCCTATCCACTTACAGCAAACCAAAGAAAGTACTTCCTTCATGGGAATGGAATATAAAGGGAGCGGATTGTGAATACACGGCCTTATATCCAAGAGCATGGACTACCTATGATTTAACTAAGTATGGCATCAAATTGATATGTCGACAGATTTCACCCGTAATACCTCACAATTATAAGGACAGCAGCTTGCCCTGTGCTGTGTTTGTATTCTCCGCTAAAAATATCAGCAATGAACAGAGAGATGTCAGTATAACTTTCACATGGACAGAATGTTTGGGTGAATCGAAAAAGAAAGGTGGATGTAAATTGGACTTGGAAAGATATTCAGCGGAAAATACCTGCGGTGTTACATTAGAGCAGAAGATAGCGGACACGCCATGTACATTCACCATAGCTAAGAATAAACCGGACAACGAGACCATACACGAGGGGTATTGTCTATGGAACTCGACCAAAACATCCGGCTACGTGTGGGAGTGCCTTAAGAAACATGGCAGGCTTGAACCGGATCCGAGCCAAACACCGCCGCCGCCGAAAGTCGACAAGACTGACAAAGATAACAAGGAGGACAAAGAAAAAGCAAAGAAAATATACAAGAATGATTCCATAGCATTATCGAGCACTATATCCCTGGACCCGAGAGGCAGCGGTTCCACTGAATTCTGTCTCGTATGGGACATGCCTGTCATTAAATACAAAAAAGATAGCAAAGTCCATAAAAGGTATTATACTAAATATTTTGGTAGCGATGGCATAGCTGGTGCGAGTATAGCATCGTACGCGCTCAGGAGTTACAGGAGGTGGGAGAAGCAACTCGCTGAGTGGCAGGATCCCATATTAAATAATAGTTGTATACCAGACTGGTTGAAAAGTGCCTTAATGAATGAGCTTTATTTTGTTGCTGACGGTGGTACTATATGGTTCGACGTCGCCGAAGATTATCCAGATACCGATCCGAGACATGAATTCGGCGTTTTCGGTTACCTGGAGGGTCACGAGTATAGAATGTACAACACATACGACGTACATTTCTACGCCTCCTTCGCTTTAGCACAGCTGTGGCCTAATTTACAAGTTGTATTACAATACATGTTCAGAGAATCGATATCCGTTGAAATAAACAAACTGCGACCGTCGCTGTACGACGGGAAAAGTTGCAAGTTTAAGACTAAAGACTCGATACCACACGACTTGGGCGAACCAGATGGCGTGCCATTCTCTCATATTAACGCGTACAACATCCACGACGTGTCGGAGTGGCGCGACCTCAACCTGAAGTTCATACTGCAGGTGATGAGGGACTATCGCCTGCTCCGAGGCCACAACGCGCCCTTCAACAACGAGAGGTACCACTCGATGGCGTACATAGACGATATCAGAGACGGTACCGAGGACGATCTGTATTCAAGATCCACCGGCCAGGTCGACGACCCGACGCCCGTCCACTCGCCGGATCTGTCGACCAACCTGTCCACCGACATATCTACTAACCTGTCGAGCGACGCGTCCAACGATCCGTCCGCGTCGGAGAGGCCGAGCGGCATGGCTAATATATTGGAACTGCTGTATCGCAGCACGGAAGTGGTAGAGGATTGTGGGGAGAAATTACAACCGCAGGAATATGCGGCGGTTGTGTGGAACTCTAAGCAGTATCTGTCCGACATGTACCCGTCCTGTGTGACGCTGCTGAGGCGCGGCCTGGACTGGGACAGAGACGGAGACGGCCTCATAGAGAACGGCGGCTTCCCAGACCAGACCTACGACGCTTGGGTGATGACCGGCCCCAGTGCGTATTGCGGCGGGCTGTGGGTAGCGTCGGTGAGTGCGGTTCATGCGATGGCAAAAATACTCGGCTTTACGGACGATGAGAAGGAATTCAGCACATTGTTGGAGAAAGCAAGGGATTCGTACGAGAGGAAGTTATGGAACGGGTCGTATTACAAGTTTGATACAAAGCCCTGTAACAGCGAGGTCGTGATGGCCGATCAGCTCGCTGGCCAATGGTTCCTAAGAGCATCTGGTTGGACGGAGCCTGTGTTTCCTGAGGCGAACGTGAAGAAGGCTCTCCACACTATATACGAGAACAATGTTCAGAGGTTTCTAAATGGACGCATGGGTGCCGTGAACGGGTTCGTTAGGGGTCCGCGACCAGGGATCGATACTACCGCCATACAGAGCGAGGAGGTGTGGACGGGGGTCACCTACGGCCTGGCAGCACTCATGATATACGAAGGTATGCACGAACAGGCCTTCTCAACCGCGGGCGGTTTATACAACACCCTCATGAAGATGGGCCTCGCGTTTGAAACCCCTGAGGCGCTCTACGAAAACGGGAACCATCGCTCCGTGGCCTACATGAGACCTCTCTCTATATGGTCGATGTATCACGCGATCATAACAAAACCACCCCAACACGTGTCAACTAGTAATGGCCAACCTCATCAAGCAAAGGAAAATCATTTATAG

Protein sequence:

>DPOGS214052-PA
MPLTARYFKYYLERKFQKRRPIMDYINIISAQRMYGCPIGGIGGGTIGRGFKGEFCRFQLYPGLYEYVTVPECQFIVNIRDAKKETIFQSVLSTYSKPKKVLPSWEWNIKGADCEYTALYPRAWTTYDLTKYGIKLICRQISPVIPHNYKDSSLPCAVFVFSAKNISNEQRDVSITFTWTECLGESKKKGGCKLDLERYSAENTCGVTLEQKIADTPCTFTIAKNKPDNETIHEGYCLWNSTKTSGYVWECLKKHGRLEPDPSQTPPPPKVDKTDKDNKEDKEKAKKIYKNDSIALSSTISLDPRGSGSTEFCLVWDMPVIKYKKDSKVHKRYYTKYFGSDGIAGASIASYALRSYRRWEKQLAEWQDPILNNSCIPDWLKSALMNELYFVADGGTIWFDVAEDYPDTDPRHEFGVFGYLEGHEYRMYNTYDVHFYASFALAQLWPNLQVVLQYMFRESISVEINKLRPSLYDGKSCKFKTKDSIPHDLGEPDGVPFSHINAYNIHDVSEWRDLNLKFILQVMRDYRLLRGHNAPFNNERYHSMAYIDDIRDGTEDDLYSRSTGQVDDPTPVHSPDLSTNLSTDISTNLSSDASNDPSASERPSGMANILELLYRSTEVVEDCGEKLQPQEYAAVVWNSKQYLSDMYPSCVTLLRRGLDWDRDGDGLIENGGFPDQTYDAWVMTGPSAYCGGLWVASVSAVHAMAKILGFTDDEKEFSTLLEKARDSYERKLWNGSYYKFDTKPCNSEVVMADQLAGQWFLRASGWTEPVFPEANVKKALHTIYENNVQRFLNGRMGAVNGFVRGPRPGIDTTAIQSEEVWTGVTYGLAALMIYEGMHEQAFSTAGGLYNTLMKMGLAFETPEALYENGNHRSVAYMRPLSIWSMYHAIITKPPQHVSTSNGQPHQAKENHL-