Monarch geneset OGS2.0

DPOGS202750
TranscriptDPOGS202750-TA1302 bp
ProteinDPOGS202750-PA433 aa
Genomic positionDPSCF300335 - 118005-128183
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0179892e-10949.74% 
BombyxBGIBMGA010531-TA9e-8743.33% 
DrosophilaCG14205-PA2e-4931.59% 
EBI UniRef50UniRef50_Q7QFJ97e-6234.57%AGAP000535-PA n=1 Tax=Anopheles gambiae RepID=Q7QFJ9_ANOGA
NCBI RefSeqXP_310550.41e-5935.73%AGAP000535-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479640043e-6134.57%AGAP000535-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479640047e-6635.38%AGAP000535-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00167478.9e-19transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333371e-37 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[44-391] IPR0026568.9e-19Acyltransferase 3
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202750-TA
ATGACAGTGACTCCTGTAGATATCTCAAATTGCGTAGGCGTAGTTCTCATGGGTCGCTTGTATCAGGACGTCACTACAGCAGCTTTACCCCAGCTTGCCAAGCACCAAGTCGGTCTATGGATGTCTTCACTGGAAGCCATTTGGATAATGGCAGCACCTATTTGCGTTGACACATTTTTTCTTTTAAGTGGATTATTAATTGTTTATGTGACAGCTGCTAAATACAATTACATGCAACTTCTAAAAAATCTTCATTTATTTTATCTAAACCGTTTGTTGAGAATGTTTCCACTCCTTGCACTTACGGTTCTTTTGGAAGCAACGCTTTTTAATCATGTAGCTGATGGGCCGTTTTGGAACAAATATGCGGAAAATATTCATAAGTGTAAAACATATTGGTGGAGCACTCTTTTATATATACAAAATTATGTAAACCCAGAGAACTTGTGTTTAACTGCAACCTGGTACCTGGCGATAGATGTTCAACTTCATATTTTGTCTCCGTTGTTATTGTTCTGGGTTCTAGGCAGACAAAGAAAAGTTGCCTGGGCGGCCCTCACATGTGTATGGTTAGCATCGCTTACTGCAGCTACAATTTATAATTTCATTAAAGAATTTCCTTCAAGTCCATTGATGCCAAGTCGATTACTGGAAAATGTATACTACATGACATATTATTACATGAACACTTTAACAAGAGCTAACGTTTTTATTATTGGAATGATGCTGGGTTATCTATTATATATTTGGAAAGGACATCAATTAAAAATATCAAAAGTATTAAACATGTTTCTTTGGATAATCAGCCTTTCCATCTGTGCTTTAATTATATATTGTATTCATCCAACTCTGCAAATAGATTATGAAAATCAAATTTTGGACAGCATTATAAATTCTTTTATGAGGCCTTTGTGGGCTTTGGCTTTATCCTGGATAATTTTTGCTTGCGTCAAAGGTTACGGAGGTCCAATCAATTGGTTATTTTGTTGGAGCTTGTGGAAATTGCCAGCTCGAATATCGTTTTCCTTGTATCTACTGCACATGCCATTTATAGAAATAATAAATGCCACAACAATATTTCCCCTCTATTTTGACGATCGAGCTATAATATTTAAATATATGGGTATATTATTTTTAACAATGATGGTTGGTTTTATTGCTACCGTGTTAATTGAACAGCCTTTCAATAATCTCATCAAGTTGGTTCTTGAACCAGCTGCACGTAAATCTGCAAAGTCTCAAGAAGTGCTTAAACCAGCCAATACAGCCGCTGGCCACAACAGACGGACGGACGGACAATAA

Protein sequence:

>DPOGS202750-PA
MTVTPVDISNCVGVVLMGRLYQDVTTAALPQLAKHQVGLWMSSLEAIWIMAAPICVDTFFLLSGLLIVYVTAAKYNYMQLLKNLHLFYLNRLLRMFPLLALTVLLEATLFNHVADGPFWNKYAENIHKCKTYWWSTLLYIQNYVNPENLCLTATWYLAIDVQLHILSPLLLFWVLGRQRKVAWAALTCVWLASLTAATIYNFIKEFPSSPLMPSRLLENVYYMTYYYMNTLTRANVFIIGMMLGYLLYIWKGHQLKISKVLNMFLWIISLSICALIIYCIHPTLQIDYENQILDSIINSFMRPLWALALSWIIFACVKGYGGPINWLFCWSLWKLPARISFSLYLLHMPFIEIINATTIFPLYFDDRAIIFKYMGILFLTMMVGFIATVLIEQPFNNLIKLVLEPAARKSAKSQEVLKPANTAAGHNRRTDGQ-