Monarch geneset OGS2.0

DPOGS207137
TranscriptDPOGS207137-TA2052 bp
ProteinDPOGS207137-PA683 aa
Genomic positionDPSCF300001 + 3812859-3821138
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0176094e-14060.47% 
BombyxBGIBMGA013102-TA1e-7734.93% 
DrosophilaCG4576-PA7e-4225.50% 
EBI UniRef50UniRef50_Q16FG42e-4224.80%Putative uncharacterized protein n=5 Tax=Culicinae RepID=Q16FG4_AEDAE
NCBI RefSeqXP_001649634.12e-4425.51%hypothetical protein AaeL_AAEL014801 [Aedes aegypti]
NCBI nr blastpgi|1571071253e-4325.51%hypothetical protein AaeL_AAEL014801 [Aedes aegypti]
NCBI nr blastxgi|1571295293e-4524.46%hypothetical protein AaeL_AAEL011513 [Aedes aegypti]
Group
Gene OntologyGO:00167477.8e-20transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333371e-06 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[252-638] IPR0026567.8e-20Acyltransferase 3
Orthology groupMCL30984 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207137-TA
ATGTCGGTGCGCTCGTTCTACTTCGTACTCGCGTGCGCCGCCTGCGCCCACTGTGACGACCAAAACACAGAAGATATAATACTAAATAAAGACATAAAACCGGCGAGTCTGTACGCAGCAAATAGCACGATAAATACTGATGGTGAAGAAAATACTCTAGCAACAAAAAAAGAAGCTGTGGGGACAAATGGAGATATTATATACGATCTCAGTGATGAAGAATACTACGCCATGCCGCGTCTTTTTGACCTTGAGGAGTTCCCAGGATGTCTTGCAACACGCGGAGTTTACTGCCTCGGCAGCTTTGAGCTCACCGCCACCACCGATCATCCTCTATTTCGGACCATGAAGCAATACTCAGCTAATTGGGTGGACAACTTTAACCATACTCGCCTTCACCGCGGTGTTTGTCTGCCGCGATCATGTCAACAACATCAGACTTCTTCCTTGGAGATGTGGTTCGAGACTTGTATAAATGCAAGCACCACATCTTCCTACAACCTATCGGCTAGGTTGTATAAATTGGAGTACTGCACTAGAGGAACTGAATATACACCTCTAAGTGGGAATGAACAGGCGTTCGCTGCTGTGTTAGCGGCTCTCTTAGCCTTTGCGATCATCAGTACTGTACTTGACCTAACACTCTCAGCTCATGTCAAGAAAGGTTGCGGTTGGGCTTTGTCGTGGTCATTGCGTCAATCGTGGCAATCCCTAGTAACACCTGCCCCAGCGACCAGGGAACTGGACCTTCGCTCCTTTGACGGTCTAAGGGTCTTCTGTATGCTATGCGTCATCATTGAACACGTTTGTTGGCTCGGCACTATTTCTTACATTGAAAATACGAGGATATTTGAACAGTTGCGTCATGAGGGTGATGTGATCCTGATGACAAACAGCACACTGGTAGTGCAGATATTCTTCTTGATGGCTAGCTTCCTTCTCGCCCACAAAATATTGGAGCAAAAGAACCATCTACCACCATTCAGCACATTTTTCGATACTATGATTAACAGGATAATTAGGGTAAGTCCCAGTTACTTCATGGTTTTATGGTTCGCGTCGTCGTGGTGGTGGCGTCTGGGACAGGGCCCAATGTGGACGCCACTTGTCACCGCTGAAGCTGACATCTGTCGCAGAAAGTGGTGGACTCACCTTCTATACCTCAACAACGTTCTCATCAAGGATGATAAGTGCCTTATACAGACCTGGTATCTGGCAGCCGACATGCAGTTGTATGCTCTGTCACTGGCATTGACACTACTATTACGAGGTCGACGCTGTGCTGTTCGATTGCTGATTGTACTCTTCGGTGTCGTCTCTGCAACGTTGACGGTACTAGCATATATGTGGAAACTCGTGCCCACCTTCGTCCTACATAATCCAGAGTCTGTTCGTCTGCAGTACAGCGGAGAAGCATCTTTCAACTGGTTGTATCAGTCTCCTTTGGGTAATGCGCCAGGAGCGCTAGCCGGCCTGCTGTTGGCACATGCTCAACGCCGACTGCCACGCTCCGGCCTACTCGAACACCAGTTATTTCGATGGGTGTCGGTGGCGGGGGCACCGGCTGCTCTGTGCTGGGCGGCGCTTTCTCCAATGGCTTTAGGAGCTGGTCCGCCTAACAGACTCGTAGCGGCGTTGCTGGCAGCTTTGGAGCGACCAATCTTTCTACTCTGCGTCACTTTAGCACTCCTCGGAGCCATTCACGGAATCAAGTCTCCATGGCGAACGTGGCTGTCTCACTCCTCGGCTGCGTTAGCTCGTCTGTCTTTCGGAGCGCTGCTACTACACATGCCGCTTAACAAAGCTCTGTTGGGCTCACGGCTCACGCCCACTCAACTCGACAGACAGTATGCGATATATGGGTGGTTTGGTGTGGCGGTGGTGTCTTATGCAGCTGCATTACCTCTAGCTCTGCTGGTAGAGCTGCCAGTGCAACGACTCTACAAAGAATTGACCCTCATATTCCGGAAGAAGCCACCGAGTACACACAATAACAAAATACCATCCTTAGATAAGACAGAGTTTTGTTCTACAGAACATTGCAGTAAATAA

Protein sequence:

>DPOGS207137-PA
MSVRSFYFVLACAACAHCDDQNTEDIILNKDIKPASLYAANSTINTDGEENTLATKKEAVGTNGDIIYDLSDEEYYAMPRLFDLEEFPGCLATRGVYCLGSFELTATTDHPLFRTMKQYSANWVDNFNHTRLHRGVCLPRSCQQHQTSSLEMWFETCINASTTSSYNLSARLYKLEYCTRGTEYTPLSGNEQAFAAVLAALLAFAIISTVLDLTLSAHVKKGCGWALSWSLRQSWQSLVTPAPATRELDLRSFDGLRVFCMLCVIIEHVCWLGTISYIENTRIFEQLRHEGDVILMTNSTLVVQIFFLMASFLLAHKILEQKNHLPPFSTFFDTMINRIIRVSPSYFMVLWFASSWWWRLGQGPMWTPLVTAEADICRRKWWTHLLYLNNVLIKDDKCLIQTWYLAADMQLYALSLALTLLLRGRRCAVRLLIVLFGVVSATLTVLAYMWKLVPTFVLHNPESVRLQYSGEASFNWLYQSPLGNAPGALAGLLLAHAQRRLPRSGLLEHQLFRWVSVAGAPAALCWAALSPMALGAGPPNRLVAALLAALERPIFLLCVTLALLGAIHGIKSPWRTWLSHSSAALARLSFGALLLHMPLNKALLGSRLTPTQLDRQYAIYGWFGVAVVSYAAALPLALLVELPVQRLYKELTLIFRKKPPSTHNNKIPSLDKTEFCSTEHCSK-