Monarch geneset OGS2.0

DPOGS204385
TranscriptDPOGS204385-TA2148 bp
ProteinDPOGS204385-PA715 aa
Genomic positionDPSCF300002 - 1651413-1668677
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0130803e-15056.47% 
BombyxBGIBMGA007617-TA2e-11035.43% 
DrosophilaCG32645-PB2e-12832.77% 
EBI UniRef50UniRef50_UPI0002060E617e-18045.71%UPI0002060E61 related cluster n=1 Tax=unknown RepID=UPI0002060E61
NCBI RefSeqXP_001946627.10.045.85%PREDICTED: similar to CG32645 CG32645-PB [Acyrthosiphon pisum]
NCBI nr blastpgi|3287028273e-17945.71%PREDICTED: nose resistant to fluoxetine protein 6-like [Acyrthosiphon pisum]
NCBI nr blastxgi|2700008920.046.35%hypothetical protein TcasGA2_TC011151 [Tribolium castaneum]
Group
Gene OntologyGO:00167476e-16transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333379e-31 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[308-693] IPR0026566e-16Acyltransferase 3
[55-181] IPR0066211e-07Nose resistant-to-fluoxetine protein, N-terminal
Orthology groupMCL16668 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204385-TA
ATGTATAAAATTAATTTATTTGTGACATTGTTTGTGTGTGTGAGTTACGTAAGTGGGAAAGTGAACTCTACGGAGGATGGGACAAGCTCTCGTATGGCTTGGATGAAGAGTTTATTGGACCATCATGACTGGATTAACGTACTGAATGGAAGCCATAATGTGTCCGAAAAGTGTGAATCGGATTTGAGGCAGTACTTGACGGCGTTGAATGATGGACTGCTGTGGGCTTCCAAAATTTATGATGCATCAGGCCAGTATAGTCCAAACATGCTATTTGGGAACGAATTCTGGCTCGGATCAATAAACGCATGTCGTGATTTGCAACACAAAGAGTACTATGCGCAAACGCCGCCTTTCCCTTGTACCTTCTATGTTGCCAAAATTAATCTCACCGTCGACGGAGATCACATACCACAGACAAGGCCGATGCTTGTCGGTCAGTGCGTGCCGGCGTCTTGCGATAAAGATGATTTGAAATCTGTATTGGATGCTGCGGAGACGACAATCGTTGAAAGGGCCGTAGCCAATGGCTTCGTCGCTTCCTTCACTACCCTCTACGTCCGACCTGTCCCTGGCACTTATGATGTTTTAAAAGATTTTAAATTTCATATATTAACGACCGTTATTTTGACGGTGTCAGTGTGGATGTTGGTAGCTTCTGCCTACGAAGGTTACTTGGAGAGAAAATACCGCAATAAAGAACCGAAGGATCTAGAAGTAGCCAACAACAATCACAAACCAACTGCGAATAACAATAACACACCACAAGCGTCTGCAAAAAATGTGGACAACGATATCAAAGAAAAAGACATAAGAAGGGATGTTTGCGGAGTATGGTCTGAAATTCTGCTGTCATTTTCAATACTCTCTAATGGGCGAGCCATTCTGAGCACACAAAAGCCGAGCGACGGAGCACTAACTTGCTTACACGGCATGAGATTCCTTTCCGTGTTGTGGGTCATCATGGTGCACACATATTTGACAGTTTTCTACATAGCAGATAACAAGACTATGAGAGTGGTCACTGAAAGGAATTTTCTTTATCAATCAGTCGGTAATGCATCCTACTGTGTGGACACATTTTTCTTTATCAGCGGTCTGCTCGTCACTGTGCTTTTTTTGAGAACAGAGGAGAATTTACTTGACAAGCCGGAGGTTAGGGTTTACAGCAAACGAGAAGTCTTCGGTATGACGAAGTCTTTTCTCGTCCTCCTATCATACCGCGTGGTGAGGCTGACGCCGGCGTATGCGTTCGTCATCGGTTTGAACGAGCTGGCGCTTCGGTACACCTACGACCACACGGTGTTCGAGCCGGCTATCTTCGACCACATCAACTGCAACCATTACTGGTGGCGTAACTTGCTCTACATAAATAATTTATTTCCTCAAAAAGACATGTGCATGGTCTGGTCCTGGTACATGGCTAATGACACGCAATTTTATGCTGTCGGTATAATACTGCTGTTGATATCCATCAAGCATACGAGATTCGCGATGGTGTCCCTGATCCTGGTGTTGGTTAGTTCCTGGGCAACCACCATCTACGTGTCAGTGTGGCACCAGTACAAAGCTCGCATTCAAGAGCCGTTTGAAATGTTTGATCCACTTTATGACAAACCGTGGTCCCGCATCGGACCTTATTTGGTTGGAATGATCGTAGGGTGGTATTTACATAAAACTAAATGTCAAATAAAAATGCCATATTGGCTGGTAGCGGTTGGCTGGCCGGCCTCCCTCATTATTATTGCCAGCCTCATCTTTGGTATGGTGGACGGATACTTTGAAGTCTGGCCAACCGCCTTTTACGTCAGTGTTGGTCATACAGGGTGGGGCGTGGCTCTCGCATGGATTTCAATAGCGTGTTGCTGTGGTTACGGAGGACTTATCAAATCAGGGCTGTCCTACCGTGGACTGTTACCACTCAGCCGACTCACGTACTGCGCGTACCTCGTGCATCCAACCATCATGATGTATACCTCCTTCTTGCTAGACGGGCCTCTGCATCTGGAAAACTCTATGGTGCTCGTCATATACTCGGGGTACGCCGTCATGGCATTCCTGGCTTCGTTCGCTATTTCACTGGCATTTGAGGCGCCCGCAGTGAGACTGTTGAAGATTATCACTGGAGGAAGCAAGAGCGAAAAATAG

Protein sequence:

>DPOGS204385-PA
MYKINLFVTLFVCVSYVSGKVNSTEDGTSSRMAWMKSLLDHHDWINVLNGSHNVSEKCESDLRQYLTALNDGLLWASKIYDASGQYSPNMLFGNEFWLGSINACRDLQHKEYYAQTPPFPCTFYVAKINLTVDGDHIPQTRPMLVGQCVPASCDKDDLKSVLDAAETTIVERAVANGFVASFTTLYVRPVPGTYDVLKDFKFHILTTVILTVSVWMLVASAYEGYLERKYRNKEPKDLEVANNNHKPTANNNNTPQASAKNVDNDIKEKDIRRDVCGVWSEILLSFSILSNGRAILSTQKPSDGALTCLHGMRFLSVLWVIMVHTYLTVFYIADNKTMRVVTERNFLYQSVGNASYCVDTFFFISGLLVTVLFLRTEENLLDKPEVRVYSKREVFGMTKSFLVLLSYRVVRLTPAYAFVIGLNELALRYTYDHTVFEPAIFDHINCNHYWWRNLLYINNLFPQKDMCMVWSWYMANDTQFYAVGIILLLISIKHTRFAMVSLILVLVSSWATTIYVSVWHQYKARIQEPFEMFDPLYDKPWSRIGPYLVGMIVGWYLHKTKCQIKMPYWLVAVGWPASLIIIASLIFGMVDGYFEVWPTAFYVSVGHTGWGVALAWISIACCCGYGGLIKSGLSYRGLLPLSRLTYCAYLVHPTIMMYTSFLLDGPLHLENSMVLVIYSGYAVMAFLASFAISLAFEAPAVRLLKIITGGSKSEK-