Monarch geneset OGS2.0

DPOGS206815
TranscriptDPOGS206815-TA1791 bp
ProteinDPOGS206815-PA596 aa
Genomic positionDPSCF300001 - 3775094-3782040
RNAseq coverage513x (Rank: top 24%)
Annotation
HeliconiusHMEL0100096e-7248.06% 
BombyxBGIBMGA013102-TA9e-10939.43% 
DrosophilaCG9447-PB2e-3723.79% 
EBI UniRef50UniRef50_Q7PTG87e-4324.81%AGAP007079-PA n=3 Tax=Culicidae RepID=Q7PTG8_ANOGA
NCBI RefSeqXP_001661710.19e-4726.50%hypothetical protein AaeL_AAEL011513 [Aedes aegypti]
NCBI nr blastpgi|1571295292e-4526.50%hypothetical protein AaeL_AAEL011513 [Aedes aegypti]
NCBI nr blastxgi|1571295293e-5126.39%hypothetical protein AaeL_AAEL011513 [Aedes aegypti]
Group
Gene OntologyGO:00167472.7e-10transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333375e-13 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[194-577] IPR0026562.7e-10Acyltransferase 3
Orthology groupMCL34680 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206815-TA
ATGGCCGTGGGGGCGGTTGCTCTTGAACTCACCGAGGAGCAGTATTATAAATTGCCGCAGCTTTTCGAATTGGACAACTACGAGCAGTGCTTGTCTCAACATGGGACCTTCTGCCTCGGTTCCTTCCACCTACATTCCTCTCGATCCAATCAGCTGATTCGCTTGCTCCAGGAAATATCAGCTGAGAAACATCGATTCAACCACACGGTTATCCACCGAGGTTATTGTCTCAAAAGCACGTGTGCACATATTCTCGGCGAGTCTCATCCGCAAATCTTCAAAGATTGTGTTAATAATATAACAAAACTCAAATATGACCTTGGGGCTGACCTTATTCACCTGGAGTACTGTAAATCCGAGGAGATATCGAAACCTATACATACTTTGGATATTATAATTGCTTCTGTTCTTACGCTAATATTATTAGCAAATATCATTGGCACAGCCTATGACTTCTTCCGGGACTCGAACAGCAAACCTAACCGTTATCTAATGACATTCTCATTATTTGCAAACTGGAACAAGCTGACAGCCACTTACCAGAAGGAAGACCAGAGACTCTCTGCACTAAACCCTATACATGGAATGAAGGTGTTGACTCTAATGGCAGTTGTGTTAGCGCATTCTATAATGGCTTATCATATGACATATCTTTATAATCCTTCGTTTTTTGAAAAGGCCAATCTCCATCCACTATCGGCGATATTCAACAACGGCACGGCGGCTGTACAAACATTCATTCTCTTCTCAACATTCCTACTGGCTTACAACTTGTTGCTCTTACTAGAACGTGAGAAGGAAAAGAAACTAAGTTTTAGTTTCTGGTGCAAGATAATTCTTCATCGGATAATCAGTCCAATCTATCTAGTAGTGTTGGGTATAACAGCTACATGGCGTTTCCATTTCGGCAACGGTCCGCTGTGGTGGTTAGCAGAAAACGAGGGAGAGAAGTGTCGACGCTCTGGCTGGACTAATGCACTGTACATTAATAACTTTTTGCGGTTTGATGACTCATGCCTCATACAAAGCTGGTTCTTGGCAGTAGATATGCAGCTCTATGTAATATCATCACTGTTACTACTATTTCTGGCACGGAGGCCGCGAACAGCTATCACTGTTTTGGGAGGACTCTTCGTTATATCTGTTATTGGCAACTTTTTAGCGGCTTATTACTTGGATCTCAAGACACTCGTTTACATAGCTCATCCTGAATACATCCGTATTCAATATAGCGGAGTCATTTCTTTCTGGCGGCACTACGCTGCACCGAGCTCGTCAGCACCGGCAGCACTGCTGGGTCTGTTGCTGGCTTTCCTCTATCACCTTCTGAAGGAGAATGGGTTCGACGCCCGTAACAGCACTACTCTGCACATTCTCTATCGTCTGTCGGTTCCTTCCATGCTGGCATGGATCCTAAGCGGCCACTTTCTCAAGGATGCCACATCACCAGTTATGGTTTCCCTGTACACAGCCCTTGATCGACCAGTGTTCACAATACTAGTTACTGTGGCTAGCATTGGTTTTTTCTTCAAAATAGATAAAATATGGTGGCAGTTTCTTTCATGGCGTGGTTGGCATCCTCTGTCCCGCATGTCACTGTGTGTACTTCTGACACACTGGGACCTAAGTCTGGCGCTAATCGCACTACGCACCACCCTATCACAGGCTTCTATACTAGAGATTGGTTGTCACTGGCTGGGTTCGTTATTCCTCACATATTGTTTCTCATTGCCTCTACATCTCATGGTTGAACTTCCAATGCAAAAATTCTTGCAGGCTGTATTTTTATGA

Protein sequence:

>DPOGS206815-PA
MAVGAVALELTEEQYYKLPQLFELDNYEQCLSQHGTFCLGSFHLHSSRSNQLIRLLQEISAEKHRFNHTVIHRGYCLKSTCAHILGESHPQIFKDCVNNITKLKYDLGADLIHLEYCKSEEISKPIHTLDIIIASVLTLILLANIIGTAYDFFRDSNSKPNRYLMTFSLFANWNKLTATYQKEDQRLSALNPIHGMKVLTLMAVVLAHSIMAYHMTYLYNPSFFEKANLHPLSAIFNNGTAAVQTFILFSTFLLAYNLLLLLEREKEKKLSFSFWCKIILHRIISPIYLVVLGITATWRFHFGNGPLWWLAENEGEKCRRSGWTNALYINNFLRFDDSCLIQSWFLAVDMQLYVISSLLLLFLARRPRTAITVLGGLFVISVIGNFLAAYYLDLKTLVYIAHPEYIRIQYSGVISFWRHYAAPSSSAPAALLGLLLAFLYHLLKENGFDARNSTTLHILYRLSVPSMLAWILSGHFLKDATSPVMVSLYTALDRPVFTILVTVASIGFFFKIDKIWWQFLSWRGWHPLSRMSLCVLLTHWDLSLALIALRTTLSQASILEIGCHWLGSLFLTYCFSLPLHLMVELPMQKFLQAVFL-