Monarch geneset OGS2.0

DPOGS202754
TranscriptDPOGS202754-TA3777 bp
ProteinDPOGS202754-PA1258 aa
Genomic positionDPSCF300335 + 79942-102019
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0179894e-16043.10% 
BombyxBGIBMGA010531-TA4e-9146.81% 
DrosophilaCG14204-PA2e-6830.95% 
EBI UniRef50UniRef50_C3XYM88e-9225.47%Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3XYM8_BRAFL
NCBI RefSeqXP_001651084.17e-7433.21%hypothetical protein AaeL_AAEL005543 [Aedes aegypti]
NCBI nr blastpgi|1571103941e-7233.21%hypothetical protein AaeL_AAEL005543 [Aedes aegypti]
NCBI nr blastxgi|1571103941e-7534.10%hypothetical protein AaeL_AAEL005543 [Aedes aegypti]
Group
Gene OntologyGO:00167473.4e-23transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333372e-52 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[158-542] IPR0026563.4e-23Acyltransferase 3
Orthology groupMCL10225 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202754-TA
ATGGATTTGGATGAATGTGTTAAGTCCAAATTAAGGGAATATGAGCATATGAAAATCACAGGTTTTAAATTATTAGACGGAATGGACACTTATATTGGTGACCGAAAAGTAGGAGATGATAATCCACTCTCAGATATAAATTTTCGTTTAGCTTTATGTTTACCTAAATCGTGTACCACAAAAGAAGCTCTTAATGCCTTCCTTTTTGATATTGCTGATAGAGGCTTTATATACAAAGATGACTTTTGTCGCTTTGCCAATGATAAACAGTGGCTTGTTGGTGATACAGTAGCAGTTGTTATTTTATCCTTTTTGGGATTTATCGTACTTATTTGCACAAGTTATGACATTTGGTATAACATCATACTTAAGAAAGATTCCAAAAACTACAACTTAGTGCTTGGTTCTTTCTCAGCATACACTAATACAAGATATGTTTTAACTTTTAATTCCACTTCTAATTCGTTACTTTGTTTGGATGGAATCAGAGCGTTATCCATGATTTGGATTATTTTTTCCCATTCATTTTCAACCCAAATGTTTATTATGAATCCTATTGATAGCCTTAAGTATTTTTTGGTGCAGTGGGTGACGTCTTATAAAGCCATATTAATAGTTGCTGGTACAATTTCAGTAGACACTTTCTTTATGCTAAGTGGATTGTTAATAGTGTATACGGCAGCTAGGAAGTTAAGCGCCCAACTCCTGAAAAATCTTCACTGGTTTTATTTTAATCGTCTCCTTCGAATGTTTCCTATTCTTGCGCTTTGTGTGCTCTTAGATGCTACACTTTTCAATCACGTTGCTGATGGACCGTTTTGGGGGCAGGTGGCTGGAAATGCTGATAGGTGTAAATCATTTTGGTGGACTGCACTCTTACATATACAGAACTATCTAAATTGCTACAATTTATGTTTAGGTCATTCTTGGTATTTATCAGTGGATGTTCAGTTACATATTTTATCACCCCTCATATTATTTTGGGTGTTAAATAAAGAAAGAACAGTAGCATGGTCAGCTCTTATATTTGCAATGCTCGGAATACTAATTGCTGCAACAGCATATAATTTTATGAAGGAACTTCCTTCTAGCACATTTATGCCCACGAGAACACATGAAGCTATGAACTATGTAGTTTTTTACTATACAAATACACTGACTAGAGCTCCTCCTTTTATAGTTGGTATGATATTTGGGTACTTGTTGCATACTCTCAAAGGAAGACAAATAAAGATTCATTCGGTTGTAAACGTATTCCTTTGGCTGTGTTCTTTGTGCGGTCTTGGTGTTATTCTTTATTCTGTTAACCCTACTCTGCAATTGGATTACTCCAATCAGATTGCTGACAGTTGTATAAATTCTTTCATGAGACCGATGTGGGCTTTAGGCTTAGGGTGGATTATTTTTGCCTGTGTTCATGACTACGGAGGTCCAATAAATTGGTTTCTGTCTTTGCGTATATGGAAGCTACCAGCACGACTGTCATACGCATTGTATCTATTTCACTACTCACTTATGGTTGTCATTAATTCCACAAAGTTACAACCAACTTACTTTACGGTTGAAACAATCGTATTTGAGTTTATAAGTTTCTTTTCTCTATCTCTAATTGTTGCTTTCGCCGCAACAGTGTTAGTAGATGAGCCATTTGCAAATTTGACTAAATTATTTTTGGTTCTAGGTGAACCCGCCAAAAAAGAAAGAATATTGGAAAATACCCAAGAATCAAGGGGTGATTATTTGGATGCTCTGGATCGTGATTTACACCAACGTGTTTTAGATCCAGAACAATGCCAAAATCAATTGCAATACATCCGTAGATCTGATCCATTTCTTTCTGCACAGTTTGTGGATGCCGGTTTAAGAATTCCGAAAGGAATTTTACAAGGAAATCTTAAAGATTATGGGAACTATCACCAATGCCTTGGTATACACCAAGATGTATCAGAGGATATGCAAATACAAGGAAAATTTTGTTTGATATCCGTACCAGTTAATTTAAATTCAAGAAAAAATCCACAATCGACATTAGAATTTGATCCAAGCCTATTACAATTACCTCTACAAACTAAAAGGAAGCTTGAAGAAAGAAATATGTATCTGGACAAAATGCGTTCCATATTCGGCAATACAATAGAATATCAGAGGATGGATCCAAATAATACACTGCCTGATATTGATTTTCAACTTGGTATATGTATACCAAAAGTATGTACAACAGAAGAAGCGCTATTATTTAACTTATTTAATTACACATTAGAACATTCTAATATGTACTGTCGCCTTCCAAACGATAAACACTGGGTTACTGCCGATTATATTGCTATAGTTATCTTTTCTCTACTTGGTTTGTTGATTATATTGAGTACTAGCTACGATGTATATTACACAGTTTTCTCTAAAAAAGATCCCAAAAATATGAACGTTCTCCTAAGTGCATTTTCGGCGTACACCAATACTAAACGCGTTATATCAACTAAGCCTCAACCAGGTATTATCGAATGTCTGGATGGTATACGATCGCTAGCACTATTTTGGGTGGTGTTTGGACACGTTTTCTTCATATTTAATTTTTTTCCCGCAAATACGATAGAAACGATGGAATGGTCACTATCTTACGATTCAATAATGATACGGACGGCTCTCCTTTCGGTTGACACTTTCTTTTTGATGAGTGGAATTCTTTTAGTGTACACTACTGCTCATAAACTAACTGGATGTCAATTGCTGAAAAATATTGTGCCTTTTTACCTTAACCGTTTGCTTCGTATGTTTCCTGTATTGGCCTTTGTAATTTTGTTCCAAGTGTCTATATATAATCGTATAGGAGACGGTCCTATGTGGACAGTTGTTTTACGCCACGTCCAGAACTGTAGAACAGTATGGTGGTCAACTTTACTACATTTACAAAACTTCCTCAATCCCGAAGAATCTTGCATTCCTGTGACATGGTATCTAGCAATTGATGTGCAATTACATATTCTTTCGCCAATCGTTCTTTTTTGGGTTTTGGGCAGAGAAAGACGACTAGCGTGGTCAGCTCTGTTTCTTGCTCTATTTTTGTCTCTGACTGCTTCTACAGCTTATATTTTTGCAAATACTTTCTCAGATGAAGGTTTGAAATACTTCGTTTATGTTTATGTCAATATTTTAACAAGAGCCCCTCCTTTCTTTGTGGGAATGGTGTTTGGATATGCTCTTCATTTGGCACGAGAAAAAAAAAAGAAAATGCCGATGCATTTACATGTCGCTTTAACTGCATTTTCTATTTCACTCTTGGCCCTTATCCTATACTGCCATGATAAGCGTGATGATCCAAACTTTGATAACCAAATTGTTAAAGATCTTTTTCAATCATTTTTACGACCATTTTGGGCTGCCGGTCTTGGTTGGATTATTTATGCTTGCTATAATGGCTATGCAGGCCCAATCAACTGGTTACTTAGCTTGAACCTTTGGAAAATCCCATCTCGGATATCTTATGCAATGTACCTGTCACACTTGTCACTAATCTTGGTGATTTACTCAAATGCTTTACAGCCACTGTACTTTTCAGTTCAAAGGGTGATGTTTGATGCCATGGGATTCATAGCAATAGCACTGGTTACTTCATTTTTGATCACAATATTTGTTGACTTACCGTTTTCAAACCTGATTAAGCTATGCCTCTCATTGATTACAAGAAAGAGGACAAACCAAAATGGCGTTAGTAATGTACAAATTACAAATGGCAATTTACCGACCCATCAAATTCCAGAAAAGAAACAAGACTCATAA

Protein sequence:

>DPOGS202754-PA
MDLDECVKSKLREYEHMKITGFKLLDGMDTYIGDRKVGDDNPLSDINFRLALCLPKSCTTKEALNAFLFDIADRGFIYKDDFCRFANDKQWLVGDTVAVVILSFLGFIVLICTSYDIWYNIILKKDSKNYNLVLGSFSAYTNTRYVLTFNSTSNSLLCLDGIRALSMIWIIFSHSFSTQMFIMNPIDSLKYFLVQWVTSYKAILIVAGTISVDTFFMLSGLLIVYTAARKLSAQLLKNLHWFYFNRLLRMFPILALCVLLDATLFNHVADGPFWGQVAGNADRCKSFWWTALLHIQNYLNCYNLCLGHSWYLSVDVQLHILSPLILFWVLNKERTVAWSALIFAMLGILIAATAYNFMKELPSSTFMPTRTHEAMNYVVFYYTNTLTRAPPFIVGMIFGYLLHTLKGRQIKIHSVVNVFLWLCSLCGLGVILYSVNPTLQLDYSNQIADSCINSFMRPMWALGLGWIIFACVHDYGGPINWFLSLRIWKLPARLSYALYLFHYSLMVVINSTKLQPTYFTVETIVFEFISFFSLSLIVAFAATVLVDEPFANLTKLFLVLGEPAKKERILENTQESRGDYLDALDRDLHQRVLDPEQCQNQLQYIRRSDPFLSAQFVDAGLRIPKGILQGNLKDYGNYHQCLGIHQDVSEDMQIQGKFCLISVPVNLNSRKNPQSTLEFDPSLLQLPLQTKRKLEERNMYLDKMRSIFGNTIEYQRMDPNNTLPDIDFQLGICIPKVCTTEEALLFNLFNYTLEHSNMYCRLPNDKHWVTADYIAIVIFSLLGLLIILSTSYDVYYTVFSKKDPKNMNVLLSAFSAYTNTKRVISTKPQPGIIECLDGIRSLALFWVVFGHVFFIFNFFPANTIETMEWSLSYDSIMIRTALLSVDTFFLMSGILLVYTTAHKLTGCQLLKNIVPFYLNRLLRMFPVLAFVILFQVSIYNRIGDGPMWTVVLRHVQNCRTVWWSTLLHLQNFLNPEESCIPVTWYLAIDVQLHILSPIVLFWVLGRERRLAWSALFLALFLSLTASTAYIFANTFSDEGLKYFVYVYVNILTRAPPFFVGMVFGYALHLAREKKKKMPMHLHVALTAFSISLLALILYCHDKRDDPNFDNQIVKDLFQSFLRPFWAAGLGWIIYACYNGYAGPINWLLSLNLWKIPSRISYAMYLSHLSLILVIYSNALQPLYFSVQRVMFDAMGFIAIALVTSFLITIFVDLPFSNLIKLCLSLITRKRTNQNGVSNVQITNGNLPTHQIPEKKQDS-