Monarch geneset OGS2.0

DPOGS213974
TranscriptDPOGS213974-TA2091 bp
ProteinDPOGS213974-PA696 aa
Genomic positionDPSCF300306 + 40283-58703
RNAseq coverage354x (Rank: top 33%)
Annotation
HeliconiusHMEL0153760.075.75% 
BombyxBGIBMGA013740-TA0.072.98% 
DrosophilaCG9447-PB1e-4726.17% 
EBI UniRef50UniRef50_Q7PTG83e-5825.45%AGAP007079-PA n=3 Tax=Culicidae RepID=Q7PTG8_ANOGA
NCBI RefSeqXP_001688057.12e-6327.67%AGAP007074-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582863353e-6227.67%AGAP007074-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571295295e-6325.03%hypothetical protein AaeL_AAEL011513 [Aedes aegypti]
Group
Gene OntologyGO:00167474e-11transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333379e-10 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[212-648] IPR0026564e-11Acyltransferase 3
Orthology groupMCL22336 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213974-TA
ATGAGGTGGATTGTGGTGTTTTGTACAGTGAACATAGTTTTTGCCCAAAATATTAGTGAAAAGGATGGAACAACAGAAGCAACGTTCGACTATTATTCAATGCCTGCACTCTCTCAGTTGGATGATTTCGATTTGTGCCTTCGAAAACCGAAATCAATCTACTGTATCGTAGACTTTGAACTTCTGGAAGATGAAACACCCCTCTACAAATATATACAGAACTTTTCGTCTTTAAGTTATAAGAACTACAAGCATTCAAAATTGCACAGAGGAGTCTGTGGGGATAAGTGCGGTCTTAACTTTACCCATGTAAACTTTACCGATACACTTGTAGGGAACTTGAAGAAGTGTTTCAATGAGACCATCCACGACAAATATGGCTTACAGGTCAATACATTATCATTAAGATACTGTAAAACTCAAAAGGACTTTATACCGGTGGATTCACTAGATTTTGTTTTCGGTTGCATTTTATTATTAATACTTTTGGTCAACCTTGGATGTACTGTGTATTATTTCTTTTGGCGTCCACAGAATGGGAAAGAAAACAGATACATGTTAGCTTTTTGCGTACAAAAAAATTGGAAGGCTTTAAAATATGGCGGCAGCGCTGAAGGTGGAATCTTCAAATGTTTTCAGGCAATGAGGTTTTACACAATGGTTATGATTCTTGGCCTTCACTCTATGATTTTCATCGGTTACGGCTACACCGAAAATCCTGAGTTCATAGAAAATTCGTATGATGACTTCTTCAAGTCCCTTCTTTTCAATGCACGTGTCATTGTACAAATTTTCTTCGTCATGGGCGGTTTTCTAATGGCTTACAAAATGCTCCTTTACGCTGAAAGTCATCCGTTCACACTCAAAACCGTTCCCATGGCTATTGTCAATAGATGGCTCAGGTTAATGCCAGCAGTATTAGTAGTGATGGCTCTAGCCATGACTTGGGTACCCCACATGGGGTCGGGGCCGATGTGGGATGCAGTAGTGAAACGCGAACGTGACTTATGCAGAAAGAATTGGTGGCAGTTAGTAATCCTGATGCCAAATTTGTTCCCATTTGAAAACCTTTGTTTGCCACAGGCTTGGTATCTTGGTACCGACACCCAGCTGTTTTTCGTAACACTCGTCGTTTTGCTCATAATCTGGAAGTGGCCGAGATTCGGAGCTCCTGTTCTCAGCGGCGTGATGATAATTAGTCTTATTATTCCATTTCTGCAAAGCTATTTTATGAATCTATTACCAATACGAGTTAGCATTTTCCCAGAGTCCATAAGAGACATTTTCGGCTACAACGACACTTTCTACCATGCGTACGTGTCTGCGCAAGGAAATTGGGCTGGGTATCATCTTGGAGTGCTCACTGCCTGGTTTTACCATAAAGCACAGACTAAGAAATGGGATTTAGGCGAATCTTGGCTTCTCAAAGTCCTCTTCGTCATCTCTATCCCAGTAGCGATGGGAACTGTCCTCATGGGGTGGGATCTTCATCATCGGGAGGCATCAGCATTTGAAGCGGCTGTCTTCAATGCTTTGAATCAAAACTTCTTCGCGCTTGCTATCTGTGTATTCGTCATCGGATATTTCTATAGATGTAACATAGCAATGGGAACTGTCCTCATGGGGTGGGATCTTCATCATCGGGAGGCATCAGCATTTGAAGCGGCTGTCTTCAATGCTTTGAATCAAAATTTCTTCGCGCTTGCTATCTGTGTATTCGTCATCGGATATTTCTATAGATGTAACAGGATCTACGTTGGTGCAGTTGAATGGGGCCCATTACAACCTCTGGGCAGGTTATCCTATTGTGCTATGTTGCTGCACGCGACCGTTCTCAGAACTTACGGTGGCCAAATGAGAAGGTCTTTTTATGCAACTGATTACACTGCTATTATGTTATTCTGTGGTATTGTTTGCTCCACTTATCTTCTGTCACTGCCTCTTCATCTGTTTGTCGAGGCTCCGGCCTGCCAAATACAGAAGATACTGTTTGGTTCTAGAAGACAAAAGAAGCACGAGGAACATAACACGGTGACCGTTAAACCTGGCATTTCTAATGTCTCAGTTTCCACCGTCTCCACACATATTTAG

Protein sequence:

>DPOGS213974-PA
MRWIVVFCTVNIVFAQNISEKDGTTEATFDYYSMPALSQLDDFDLCLRKPKSIYCIVDFELLEDETPLYKYIQNFSSLSYKNYKHSKLHRGVCGDKCGLNFTHVNFTDTLVGNLKKCFNETIHDKYGLQVNTLSLRYCKTQKDFIPVDSLDFVFGCILLLILLVNLGCTVYYFFWRPQNGKENRYMLAFCVQKNWKALKYGGSAEGGIFKCFQAMRFYTMVMILGLHSMIFIGYGYTENPEFIENSYDDFFKSLLFNARVIVQIFFVMGGFLMAYKMLLYAESHPFTLKTVPMAIVNRWLRLMPAVLVVMALAMTWVPHMGSGPMWDAVVKRERDLCRKNWWQLVILMPNLFPFENLCLPQAWYLGTDTQLFFVTLVVLLIIWKWPRFGAPVLSGVMIISLIIPFLQSYFMNLLPIRVSIFPESIRDIFGYNDTFYHAYVSAQGNWAGYHLGVLTAWFYHKAQTKKWDLGESWLLKVLFVISIPVAMGTVLMGWDLHHREASAFEAAVFNALNQNFFALAICVFVIGYFYRCNIAMGTVLMGWDLHHREASAFEAAVFNALNQNFFALAICVFVIGYFYRCNRIYVGAVEWGPLQPLGRLSYCAMLLHATVLRTYGGQMRRSFYATDYTAIMLFCGIVCSTYLLSLPLHLFVEAPACQIQKILFGSRRQKKHEEHNTVTVKPGISNVSVSTVSTHI-