Monarch geneset OGS2.0

DPOGS214218
TranscriptDPOGS214218-TA2307 bp
ProteinDPOGS214218-PA768 aa
Genomic positionDPSCF300014 + 765923-770772
RNAseq coverage603x (Rank: top 21%)
Annotation
HeliconiusHMEL0022500.077.97% 
Bombyx% 
DrosophilaCG2818-PA0.050.31% 
EBI UniRef50UniRef50_D6WL750.051.36%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WL75_TRICA
NCBI RefSeqXP_973461.10.051.79%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910829490.051.79%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910829490.049.86%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00060713.6e-69glycerol metabolic process
GO:00088893.6e-69glycerophosphodiester phosphodiesterase activity
GO:00080811.4e-56phosphoric diester hydrolase activity
GO:00066291.4e-56lipid metabolic process
GO:00302462.7e-17carbohydrate binding
GO:00059751.1e-10carbohydrate metabolic process
GO:00038241.1e-10catalytic activity
KEGG pathway 
InterPro domain[345-628] IPR0041293.6e-69Glycerophosphoryl diester phosphodiesterase
[343-630] IPR0179461.4e-56PLC-like phosphodiesterase, TIM beta/alpha-barrel domain
[30-150] IPR0137842.7e-17Carbohydrate-binding-like fold
[37-107] IPR0137834.7e-12Immunoglobulin-like fold
[39-116] IPR0020441.1e-10Glycoside hydrolase, carbohydrate-binding
Orthology groupMCL15382 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214218-TA
ATGCAACGCTGGTTTTTCTTGAAAGAAACTAAGAAGTTGCCACAAAAAATATCTAAAAAAATGTCGACGAAACAAATTACTCAAGAAAAGGAAATTGAATCACAAGAATGGGTGTTCACTGTGTCTGTGCCAGAAATTGCTCCGAGGGAAAAAGTTTATTTAACGGGAAGTACAACTGAACTTGGCGAATGGGATTACAAAAAAGCCGTCCCTCTCGAAAATGTAGAAAACACTGATATGTGGACTAAAACAGTAATTATACCAAATACTTGTGATGTATACTATCGCTATGCTATTTGTCTTGTTAATATTGACACAAATGATATCATAGTGCGCCGCTGGGAAACTCACATACACCCAAGAGTTATAAAGGAAACAATGCTTCATCCTCATACTGATATATTTGGTAATAATGAAGGGATATACAAGGTTAATAAAGGCTGGCTTACCTTTCAAACATTAGTCCAGTTTAAATTTATGCACAACCCTTTAAAGTTAAAAAGTCGCTTAACTGGCCGCCTCATGAATATTAAAGTAACACCTGTTAAGCTATCTTTTGGAGCTGAGCCACAAATTGAAGAATCCTCCTTAAGTACAGACACAATGGAAGCAGAAGTTCCTTGTGGGGTTTTTGTGGAAGTAGCTACTCTTGAAAATGATCCTATGGTTTGTCACCTACAACCTCAAGAACAATTTGGAAGAGAATACAAGCCGAATGATGTATTACTAGTTAATTTGTTTGCCCCAAATCCTAAAGGCCTAGCATATCTTATAGATTTTTATTCATACAGTACTAGGGCTTCTGTAGAAGATCCCCCTTGTCATGTTGGTTACACTTATGTGTTACCTAACATGTTTAAGCCATCTGAGGGCTCATTAGAATTACCAGTTACTTGTAATGTGAAACATCGGCCATTAGGCACTATTAATTTTGAATACCTTATAATTTCTCCAATGGAAGATGGTTTGTGTAATTTAGAAGTTTCCTATACAAAACACTGGGATCCTTCATGGACTGGATTGGAAGTTGGTCACAGAGGTTTAGGTGCCAGTTTCAAAACCAAAGAGGGAAATGCCATTCGTGAAAATACTATAGCTTCTCTTAAAAAAGCAGCTGCCAGTGGGGCAGACATGCTGGAATTTGATGTGCAACTAAGTAAAGACATGATACCTGTTATTTATCATGACTTTCATGTGTGTATTTCTATGAAAAGAAAAAAGGAAGTTGACTTCACTGAAATGCTTGAATTGCCAGTGAAAGACTTAACCCTGGAGCATCTGCAGAAATTGAAGGTTTATCATTTAGTAGAAGGACGAAATCACGAAATTCTTTTCTTCGATGAAGATCTTGAGGAGCATCAGCCATTTCCAACTCTTGAAGAAGCCCTAAAGAGTTTAGATGAGCATGTTGGCTTCAACATTGAACTTAAATGGACCATGGAACTCAATGATGGCACATTTGAATTGAACAACCCATTTGATATGAACACATATGTAGATAAGGTTTTGGAGGTGGTACTAAAACATGCTGGACAAAGAAGAATTGTACTTTCCTGTTTTAATCCTGATATATGTACAATGGTCCGTTACAAACAAAACAAATATCCTGTAATGTTTCTCACAGTGGGAGTAACAGAGAAATACCAGCCTTATCGCGACCCACGTTGTCTATCCATACCTGCTGCAGTACAGAATGCTATCAGCTCTGACATTCTTGGCATTGTGGTACACACAGAGGATTTACTTAGAGATCCCACACAGGTCAAATTAGCCACCGACGCCGGTCTAGTCATATTCTGTTGGGGCGATGAAAATAATGACAAAAATACTATAAAGAAACTCAAAGAAATGGGTTTGCATGCTGTGATATATGATAAACTGGACCAGTACACCACCAAAGAAGTTAAGGAAAGTATATTCCTGGTGGAGGCGCGCGATTCGCAGCGTGATATCATGAGGTTGGCTGCATTGGACGCGTTGCCAACCGATAGCACGTCGAGCCGTTCCCGCTCCCCCTCTCGCCAGTTGTCGTCGGCTCAATTCTCCGGGGACGGACAGCTCTATCTAGATCTAAAAGCACGCCAAAAAGCGACCCGCTCAACTGTCACGTCGCTGGAATCCCTCGCCTCGTCTATAGACATCCGCGACGAACCCGAACGTAATCTTAAGCGCAACAAGGATCTCATTATGAGCATAGACAAGGAGTCCCAAATTAAAGAGCAGAGGAATTCGTTTAAGGGCATCTTCCCCGCTCCCAATACGACTACGGCGTCGCCGAAAAAATCCCGTAAAAGCGACTTATCTTGA

Protein sequence:

>DPOGS214218-PA
MQRWFFLKETKKLPQKISKKMSTKQITQEKEIESQEWVFTVSVPEIAPREKVYLTGSTTELGEWDYKKAVPLENVENTDMWTKTVIIPNTCDVYYRYAICLVNIDTNDIIVRRWETHIHPRVIKETMLHPHTDIFGNNEGIYKVNKGWLTFQTLVQFKFMHNPLKLKSRLTGRLMNIKVTPVKLSFGAEPQIEESSLSTDTMEAEVPCGVFVEVATLENDPMVCHLQPQEQFGREYKPNDVLLVNLFAPNPKGLAYLIDFYSYSTRASVEDPPCHVGYTYVLPNMFKPSEGSLELPVTCNVKHRPLGTINFEYLIISPMEDGLCNLEVSYTKHWDPSWTGLEVGHRGLGASFKTKEGNAIRENTIASLKKAAASGADMLEFDVQLSKDMIPVIYHDFHVCISMKRKKEVDFTEMLELPVKDLTLEHLQKLKVYHLVEGRNHEILFFDEDLEEHQPFPTLEEALKSLDEHVGFNIELKWTMELNDGTFELNNPFDMNTYVDKVLEVVLKHAGQRRIVLSCFNPDICTMVRYKQNKYPVMFLTVGVTEKYQPYRDPRCLSIPAAVQNAISSDILGIVVHTEDLLRDPTQVKLATDAGLVIFCWGDENNDKNTIKKLKEMGLHAVIYDKLDQYTTKEVKESIFLVEARDSQRDIMRLAALDALPTDSTSSRSRSPSRQLSSAQFSGDGQLYLDLKARQKATRSTVTSLESLASSIDIRDEPERNLKRNKDLIMSIDKESQIKEQRNSFKGIFPAPNTTTASPKKSRKSDLS-