Monarch geneset OGS2.0

DPOGS210647
TranscriptDPOGS210647-TA2064 bp
ProteinDPOGS210647-PA687 aa
Genomic positionDPSCF300401 - 34023-43096
RNAseq coverage741x (Rank: top 17%)
Annotation
HeliconiusHMEL0107940.066.61% 
BombyxBGIBMGA001646-TA0.061.90% 
DrosophilaCG8552-PA6e-15044.87% 
EBI UniRef50UniRef50_C5I9W80.062.10%Triglyceride lipase n=2 Tax=Obtectomera RepID=C5I9W8_MANSE
NCBI RefSeqXP_001812899.17e-16550.52%PREDICTED: similar to sec-23 interacting protein P125 [Tribolium castaneum]
NCBI nr blastpgi|2388464080.062.10%triglyceride lipase [Manduca sexta]
NCBI nr blastxgi|2388464080.062.99%triglyceride lipase [Manduca sexta]
Group
Gene OntologyGO:00468721.1e-49metal ion binding
KEGG pathway 
InterPro domain[449-642] IPR0041771.1e-49DDHD
Orthology groupMCL10844 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210647-TA
ATGGCTATATATTCTGATACAGTTAAACCAGTCGCTGTTGAGCCTTCTGATGTACTGCCAATATCCTGGCATTCAAGTCTACATTCCGGTGAGACGGGCGTGGATAGGAGACTGGCTGCAGTTACCCTCGAGAGTATACCCCGACTAAGGAACTTCACCAACGACACCATACTGGACGTACTGTTCTATACCAGCCCAGTCTACTGTCAGACCATAGTTGATACGGTCTGCAGAGAACTGAACAGAATATATGAGTTGTTTAAGTCGAGGAACCCCGAATTTAAGGGCGGCGTCTCATTGGGGGGTCATTCCCTGGGGAGTGTCATTCTATACGACCTGCTCTGCCACCAGACGGCTCAGGAATTGGATATCAACTCCAGTAAGCAGTACGTCCAGGGTTGCGCTGGAACAGGCCAGCCGACTGTCAAATATCCCCGGCTGGTTTTCTATCCGGACGCTCTCTACGCGCTCGGCAGTCCCATAGCTGACCTAAACGAAAATACAGTTGTAGCAACAGACGGGGGTAGATTTGACGTGAACGTAGTTGGACGCCTTAGAATTCCAGTATACTGGTCGGAGAAACCAACGAACGTTATGAGATGTAGCTGGTTTTACAAAGGGAACATAGACGCGAGATACGTGCCGTATGCCGAGTCGGTGGCTGAGAAATTAGAGGAGGAATACCACCACGGTGTAACGACGGGGGAGTGGCACAGGCGTCTGATGCTGCCGAACAACGAAATGGTGGTGATGCACGGTCCCGCGGTCATGGTACACTTCCTGCAGACCAGCCCCACAGACTCGTTCGGCTCGACACCGCTGTTTACCTTTGGTAAGCGAGTGTACAGGAACCCTCCAAAAATTAAGGTAGCCGGCGATCAGCTCAACAAAATAGACCATCTCATCTTCCTCGTACACGGCGCTGGTGAATACAGAAACCAGAAGGAGAAAATTGTACTGCCAATATCCTGGCATTCAAGTCTACATTCCGGTGAGACGGGCGTGGATAGGAGACTGGCTGCAGTTACCCTCGAGAGTATACCCCGACTAAGGAACTTCACCAACGACACCATACTGGACGTACTGTTCTATACCAGCCCAGTCTACTGTCAGACCATAGTTGATACGGTCTGCAGAGAACTGAACAGAATATATGAGTTGTTTAAGTCGAGGAACCCCGAATTTAAGGGCGGCGTCTCATTGGGGGGTCATTCCCTGGGGAGTGTCATTCTATACGACCTGCTCTGCCACCAGACGGCTCAGGAATTGGATATCAACTCCAGTAAGCAGTACGTCCAGGGTTGCGCTGGAACAGGCCAGCCGACTGTCAAATATCCCCGGCTGGTTTTCTATCCGGACGCTCTCTACGCGCTCGGCAGTCCCATAGCTATATTCGAATGCATAAGGGGCGTGGAGACCCTGGGCAAGGACTTCTGTCTGCCGACCTGCAAGAATTTCTTCAACATCTTCCATCCCTACGACCCGATCGCTTACAGGTTAGAGCCGATGATAAATCCGTCACTAAGGAATGTTAAGCCTTATCTGATACCACATCACAAGGGAAGAAAGAGAATGCATTTAGAGTTGAAGGACACTATGGCGAGGGTTGGAGCAGACATCAAACAGAAGCTGATAGAGTCGATACGGAACACGTGGACCAGTATGTGGAGGAAGGAGGCGGGCCAAGCGGAGGTTGGGCAAGAGGAGCCGGAGAAGGAGGAGCTGACGTCAGACGTCAAGGAGGAGGACGACCGCTATGAGGCTACTCCTGAGGAGTTGGGGAAGCTGAACGGCGGCCGGCGCGTGGACCACGTGCTGCAGGAGGCGCCCTTCGAGATATTCAATGAGTATCTGTTCGCTATGACCAGTCATGTGTGCTATTGGGAATCCGAGGACACGATGTTGGTGATGCTGCGTGAGATATACAACGCCCTGGGCGTCACGCCCGACTGCAGCCTGCCGCAGAACACCATGACCGTGGAGAGGAGCCGCGTCACGGAAGCTCAGGACGCCAAGAGCGACATGTCTCCAGAATTCCCATCAACAAGCCGCGGTTCAATGTGA

Protein sequence:

>DPOGS210647-PA
MAIYSDTVKPVAVEPSDVLPISWHSSLHSGETGVDRRLAAVTLESIPRLRNFTNDTILDVLFYTSPVYCQTIVDTVCRELNRIYELFKSRNPEFKGGVSLGGHSLGSVILYDLLCHQTAQELDINSSKQYVQGCAGTGQPTVKYPRLVFYPDALYALGSPIADLNENTVVATDGGRFDVNVVGRLRIPVYWSEKPTNVMRCSWFYKGNIDARYVPYAESVAEKLEEEYHHGVTTGEWHRRLMLPNNEMVVMHGPAVMVHFLQTSPTDSFGSTPLFTFGKRVYRNPPKIKVAGDQLNKIDHLIFLVHGAGEYRNQKEKIVLPISWHSSLHSGETGVDRRLAAVTLESIPRLRNFTNDTILDVLFYTSPVYCQTIVDTVCRELNRIYELFKSRNPEFKGGVSLGGHSLGSVILYDLLCHQTAQELDINSSKQYVQGCAGTGQPTVKYPRLVFYPDALYALGSPIAIFECIRGVETLGKDFCLPTCKNFFNIFHPYDPIAYRLEPMINPSLRNVKPYLIPHHKGRKRMHLELKDTMARVGADIKQKLIESIRNTWTSMWRKEAGQAEVGQEEPEKEELTSDVKEEDDRYEATPEELGKLNGGRRVDHVLQEAPFEIFNEYLFAMTSHVCYWESEDTMLVMLREIYNALGVTPDCSLPQNTMTVERSRVTEAQDAKSDMSPEFPSTSRGSM-