Monarch geneset OGS2.0

DPOGS215307
TranscriptDPOGS215307-TA2664 bp
ProteinDPOGS215307-PA887 aa
Genomic positionDPSCF300120 - 231911-253544
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0100172e-16469.68% 
BombyxBGIBMGA007617-TA0.074.36% 
DrosophilaCG32645-PB0.043.85% 
EBI UniRef50UniRef50_Q7PXM90.047.37%AGAP001485-PA n=2 Tax=Culicidae RepID=Q7PXM9_ANOGA
NCBI RefSeqXP_321637.40.047.96%AGAP001485-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479660100.047.37%AGAP001485-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479660100.047.50%AGAP001485-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00167471.4e-21transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333372e-25 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[483-860] IPR0026561.4e-21Acyltransferase 3
[189-341] IPR0066216.5e-06Nose resistant-to-fluoxetine protein, N-terminal
Orthology groupMCL15859 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215307-TA
ATGTATCGTGTGTTAATTTTTTTAGTTTTTACAGCCCATGTTTTTGGTGTCACCCCCGAACCCGATACTAGGGAGAGTGCACGAGAAGAGAAATTAAGAAGCGCTCTGATAGACGCTTTCCAGAATATAAAAAAAGCATCCGGCAAAGGTGGACTGGACGACGACCTGGCGTCGGATCTCATCGGCTCCAGGATCGCCGTGCCGCATCTGACCAAGACGGAAAAAACGACGGCTGACAGTTTTGACTTATTCGCTGATGACGACGGCCTCACGTTCTCTGAGGTCTTTGGCGATGTCGAGACGGAAAAGAGAGTGTCAAGTACGGAGAAGTCGCACAACGTACAACATATAGTTAAAACGACGAAACCAATGGAAATAACGACAATTAAACCGAACGCCGTGGTCAAAACTACGATCCCCGTCCGACATGTAGTAGAGACTGTCGAACCGAAAATACAGCCTGCGGGTCTAAACGACACGGTGATGCTGGGCCTGGAGAGCATGTTGCGAATTTTGGATAGCGAAATTCTGATGCAGCAGTGGGATAAACTCAAAAATTCCGTGACAGGAGACTGTAGGAAGAATTTACAGGAGTACGTTCAAGGACTACAAGAGAGACGGCTCTGGGCTTTGAAAATGGAGGACTCCAGCGGCAGATACACGTCCATGTTCTACTGGGGTAACAACTACTGGACCGGCTCAGCCGAGCTGTGCAGCATCCTCAACCAGCACCACAGCACGCAGCACGCACACACCACACAGAATAGCAGTGGATCCCAGATCTTCAGCGAGTGGCGCCAGGACGTGTCGGTAACGGGAGATGGACCGCCCTTCAGCACCGCCTTCTTCAACGTGAGGATGACAGTCTCCACGGACGTCGCCGAGATAATCAAGACTAAACGTACTCTCCACCTGGGTCTGTGCTTACCCCGCACGTGCTCCCGTGAGCAAGTACGACAACTCGTAGATAATGTCCGTGCTCCGCTGTTACGTCACAAGGTGGTGGCCGTGCGCTCTCCGCAGCTCGGAGGATACTCCTACATAGAAGACCCCACCTTCCAGATACTGCTTGGAGTGTCGTGCGTGGTGGGCGCGCTCCTGATAGCGGCCACGGCCTACGACATGAAGATAGCGAGCGACGTGCGCGCGCGCAGGCGACGCGTCAACAACATGGCCGCCGCCGAGAGCGGGCACGCTGACCTTAGGCTCAACATGAACGGACTGGACGCCATCACCGTGTCCAAAGGCAAGTCGATGTACAACGTGAACAACAACATCGCGATCAACACCAACAACTCTGAGGAGCGCCTCACCGCCGAGACCGCCTCCACTGACGAAGAGCTCAAACTCAGTATCTGGTCGGAGCTGCTGCTGTCGTTCTCCATGAAGGCCAACATCCTTCAGATCTTCGACCAGAGCGTGGGCTCGGACACGGTGCCCGTCGTGCACGGCCTCCGGACCTTCTCGATGCTGTGGATCATATTCGGACACACCTGCATCGTGGTCTTCAAGTACGCGGACAACACAGCCTTACGAGCCGTCTTGGAGAAGAGTTTTTGGTTCCAACTGATCCTGAGCGCTGTCTACAGCGTCGACACGTTTTTCTGTCTCGGCGGTCTCCTATTTTCGTTCCTGTACTTCCGTACTAACGCTAAAGGTAAGCTGGAGCGCCTCACGAAGGGCCGGCCGAAGATCACCGCCGGCCTGTTCCAGTTCCTTGGACTTATTGGTTATAGATTTGCGAGACTGACCGCGCCCTACCTGTTCATGCTGGGAGTGGTAGAAGTCACCATGAAGTGGTTCGCCTACAACGCGGTGTTCGAGCCTCCCGCCCACGACCACGAGACCTGCCCCAGCTACTGGTGGCGCAATGTACTCTACATCAACACGCTCTTCCCAGTCGAGCAAATGTGCATGTTGTGGAGCTGGTATCTGTCCGACGACACTCAGTTCTACGCAGTCAGCGCCGTCCTCCTTATCTTAGCGACCAGCCACTTCAAGCTGTCAGCGATCCTGACGAGTGTCTTCTTCGTGTCGTCGCTGTTCACGACGGGCTACGTGTCGTACAGCAGTCAGCACGTACCCAACGGAGAAGATCCCTTCACGCAGTTCGATAAGATCTACGACAAGCCCTGGACGAGATTGGGACCCTACCTCGTGGGAATCGCCACGGGGTGGATACTGTTCAAGACTAACTTGAAAATCAACATGAGTAGGGTGTGGTGGTGTGTGGGGTGGGCGGTGTGCGCGGGCGTGCTGCTGTCGCTGGTGTTCGGTCTGCACGGCGCTCGCCTGGCCGGCGTGACGGCGGCCGTGTTCAGCGCTCTCTCACACTCGCTGTGGGCGGCCTGTGTGGGCTGGGTCATCATCGCCTGTTCCACTGGACACGGAGGTTGGGTTCGTCCGCTGCTGTCATCCCCTGTGTTATATCCGTTCTCCCGCGTCACGTACTGCGCCTACCTCGTGCACCCCGTGGTCCTGCGCTACGTGGCCATGCACCTCACCCACCCGATACATCTGGGAGAACTGCTCGTGTTCGTGTTGTTCCTGGGCCTGGCGGTGATATCGTTCTTCCTGGCGTTCGTCATCTCCGTGGCGTTCGAGGCTCCGATCGTGACGATGCTGAAGATCGTATCTCCTCAGAAGAAGCCCCACCGCGTGTAG

Protein sequence:

>DPOGS215307-PA
MYRVLIFLVFTAHVFGVTPEPDTRESAREEKLRSALIDAFQNIKKASGKGGLDDDLASDLIGSRIAVPHLTKTEKTTADSFDLFADDDGLTFSEVFGDVETEKRVSSTEKSHNVQHIVKTTKPMEITTIKPNAVVKTTIPVRHVVETVEPKIQPAGLNDTVMLGLESMLRILDSEILMQQWDKLKNSVTGDCRKNLQEYVQGLQERRLWALKMEDSSGRYTSMFYWGNNYWTGSAELCSILNQHHSTQHAHTTQNSSGSQIFSEWRQDVSVTGDGPPFSTAFFNVRMTVSTDVAEIIKTKRTLHLGLCLPRTCSREQVRQLVDNVRAPLLRHKVVAVRSPQLGGYSYIEDPTFQILLGVSCVVGALLIAATAYDMKIASDVRARRRRVNNMAAAESGHADLRLNMNGLDAITVSKGKSMYNVNNNIAINTNNSEERLTAETASTDEELKLSIWSELLLSFSMKANILQIFDQSVGSDTVPVVHGLRTFSMLWIIFGHTCIVVFKYADNTALRAVLEKSFWFQLILSAVYSVDTFFCLGGLLFSFLYFRTNAKGKLERLTKGRPKITAGLFQFLGLIGYRFARLTAPYLFMLGVVEVTMKWFAYNAVFEPPAHDHETCPSYWWRNVLYINTLFPVEQMCMLWSWYLSDDTQFYAVSAVLLILATSHFKLSAILTSVFFVSSLFTTGYVSYSSQHVPNGEDPFTQFDKIYDKPWTRLGPYLVGIATGWILFKTNLKINMSRVWWCVGWAVCAGVLLSLVFGLHGARLAGVTAAVFSALSHSLWAACVGWVIIACSTGHGGWVRPLLSSPVLYPFSRVTYCAYLVHPVVLRYVAMHLTHPIHLGELLVFVLFLGLAVISFFLAFVISVAFEAPIVTMLKIVSPQKKPHRV-