Monarch geneset OGS2.0

DPOGS202659
TranscriptDPOGS202659-TA5316 bp
ProteinDPOGS202659-PA1771 aa
Genomic positionDPSCF300039 - 40789-74974
RNAseq coverage1187x (Rank: top 11%)
Annotation
HeliconiusHMEL0037290.078.82% 
BombyxBGIBMGA000858-TA0.048.26% 
Drosophiladrd-PA8e-9531.81% 
EBI UniRef50UniRef50_F4WQ682e-15034.86%Nose resistant to fluoxetine protein 6 n=2 Tax=Myrmicinae RepID=F4WQ68_ACREC
NCBI RefSeqXP_001599124.12e-14441.11%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3320234489e-15034.86%Nose resistant to fluoxetine protein 6 [Acromyrmex echinatior]
NCBI nr blastxgi|3320234481e-15532.20%Nose resistant to fluoxetine protein 6 [Acromyrmex echinatior]
Group
Gene OntologyGO:00167472.1e-19transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333377e-53 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[1338-1728] IPR0026562.1e-19Acyltransferase 3
[393-545] IPR0066212.2e-13Nose resistant-to-fluoxetine protein, N-terminal
Orthology groupMCL17431 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202659-TA
ATGGTGTGCCACAGATGGACTTTAGCTCTAATTTTATTCGGCCTTCCTATATTAGCTACAGTAGGCGCATTCCGCGTGCGCTTACCTGGAAGTGTTTCTGAAGTTAGGCCCGATTTAAAAAAACACAAATGTGAGAATTGTGATATTGGTGACCTTAACTTTGACCGAGTAGATAGTGTTAGAAAGAGTGAGAAGATTCCGTACGATAACGTGAAAAGTATTCTTAGTGAAAAACGAGATTTGCTTAGTGAGAAGTTAAAAAAAATAAGCCATAAAACCATAAAAACTCAAGATGATCTTGATGAAACTGAACAGAATAAGGTTATGGGCAAAAATGAACACGATATTTCAGTCCACTTAAAAAAAGATAAAGAATCGAAAACAAAAGTAAAGAAACCGTCAATAATCTTAGACTTTTCTGATGACGATGATAGTGATATTAACGAAGACGACGATGACGAGGACGAAGATGACATAAAAAAAGTAGTGAAAGAAGATTACAAGAAGAAAGTAGATATAGTTCCAAAATTTACACCTAAAGTAGAGAAATTAGTACAGCTACCAAAATCAAAGGACAAACTGATTGTAATAAATGAAAAAATATCTGACAAAAAAGAACCAAGTAAGAATGAACCAACGGCACAAATTCAAGAAGAATCGTTTAAAGATATCAACATAAAAAATGTTAAAACTGTTGCTAAAGATAATAAAGTAAGCGATGACATCGAAAAAAAAAGTAAAAGTAAGGAAAATGATTTAAGCAAAAGAAAAATTGAGATCAAAATAGAAGAAAAAGATATCGACAAAAATAATGTAAAAAATACTAAAAAGTTGTCAGAAGATAATACAAGTATAAAACACGGTGACAAAAAACACCTAAGCAGTAAGACTTCACAAAAAAAGGATGAGTTAATGAATGAGAGTGCTAAAGAGAAAAAAAATGAACAGGCGTCGAAAAAGCTATCTGATCTCAAAGAAAGTATTCATGAGGAATCTAAAAAAGATAAACCTTTAAAAGTAGTTGATGCTACAAAAAAAGATATTGTAGATCAGAGTACTTCGAAAGACATTCGTATAATCTCAGATGCATTACAAAGAAAAAATTTGTTACATAGTGAGTTTGAAGACTTCTATTCCTTCTTACCTACTTTTGCACCAAATTTTACACGAATACACAATCCAGAATGTCGACGTCATGGACAAATACTTTTAAGACAGCTACGTGGAAGTAAGCTTTGGGCTTTAAAGATGCTAGATGCTACAGCAAAGTTTCCCTCGGGGTTTCTTCAAGGGAACGGAATCCAGTTGGGTGATTTTGACCAGTGTTTGGGGGTACGAGCAAGAGTACAACTGGACACTGGCAGCGTTGTTAGACTACAAGGAAAATATTGTCTAGCCATGATCGATGTCAAAGCTGAACACCCAGAACTTGAAATACCAGTTCATTTGGCACAGGGCAAGAATTTGTTTAAAAGTCGCATTGATGATCCAGGTCACTTCGTGCCTAGGTTTTCAACCTTGAGTTGGGGAGTGTGCATACCATCGGCTTGTACTTCAGAAGATGTGGAGGTGGTTCTCAGAGATGCTGTTAAACATTATCAATACAAGACTGGTATTAGTACCCGGATCAGGGTCGATGAACATGATTGTTTTACCAGGAAAGGTAGTAATTGGTGGAAGGAATGGATAGAGTTACCGACTGTTGTAACTTTATCCTTGTACGCGTTGGTTATCCTGATAGTATTGATAGCGACAGTCCAAGATATGTCTGCTAGAAATGATTCTAACCACGAGGAACATGAAGAGGAGAATAAACAGATGAAAAAGCAAGAGACCAAAACCAGTTCAGGTGGCTTCCTGAGCTGTTTTTCTTTGTACCATACGCTGAATAAGCTTATAGCACCAGCAAGCGAGGATGAGATAGCTTGCATCCATGGCATCCGAGCTGTTGTAACTGTGGCACTCATAGTCGCTCATAAGTTCCTACCGATGGCTTTAACACCGTACACAAACCGGATACGTTTGAGTGAGATCTTAAGCTCCTCTTTGTTGTCCTGGTGCAGAGCTGGTTGGATCTTCACAGATTGTTTCCTCCTGATAAGTGGTACTCTCACTTCATATAGGAAGTCTCCAAGTGATAATGTCGCAACAAAATTATTAACGCCGGCTCTTTTAGCAATAGTTTTGTTTTATGCTTATGTTTGGGATAATATATCCTCCGGTCCCATGTGGGGGACTCTCGTTTGGAAAAATTCTCAGCTGTGTCGCGATGGTTGGTGGTGGAATATACTATACGTGCAGAACTACTTTGGATTTGAAGATATGTGTGCTCCTCAAACCCACCACATGGCTATGGATTTTCAACTGACGATCGTAGGAAGTATAATTGTGTGGATGATCCAATCTGAGGTTCCATTTGCTGGGTCCTTGCTGCCAACGTTACATATACTGTCTGCATACTCCAGATACACGACTGTACGGGATCATCGATTGACACTCTTGGCATATCAAGGTGTCAGTGTCAGCCAACTATACAGGACAGGTCGATTGAGCTACACGTCTATTTATCATCGATGCTCGCCCTACTTGATCGGACTAAGCTTAGGGCTTTATCTTAGAAACCGCTCACGTCTCACTAAGCCCTTAGTATACTTAGGTTGGTTCATCTGTGGCACTTTATGGGGGGCAATATTTTGGGCGGGCTATGATTCCGGATACCTTGATTACCGGTATGACCTCACATACGCCGCGCAATATGCAACGCTTGCACCCATCGCCGCTGCGTTAGCCTTTGCCTGGATAATCTATGCTGCTCAAAATGGACACTCTGAAACTCTATCTCGAATGCTTTCCGGTCGTCCACTTCTATTTATTAGTAGAATATCATATGCGCTTTATCTTGTACAATTTGTTGTCTACCTTACAAACACCGCCTCTATTAAGGCTTCAAGGGAGTTTTCTCTTACATCGCTAATTGATCTTCAAGAGATCGTTACTATTCTAGTATCTTCCATAATTCTCACCGTGACCTTGGTACTTCCACTGCAGTCGCTACCAAAGTTATTTGAAGCTCCAACAATCGATAACTTGGAAAATAACGATAATAAGGATGTTTCGGAATATAACAATTTGCAAGAAAAGCTAGAATCGAAATATAAAACAGATGAGAAGGAATTAATTTCAGAATCTCATCAACCAAGGCGATCATTTTTAGCTCATCGAGAAGTTTTGGAAGAAATACCAGAAGTCGAAGTAGAATATGAGGTACAAAGAGATTCACATGACGGTTTAGAAGAAATTTTAGAAGAAGAAGATGACGAAATGATGGATCGAGAAGAAGATGATTTAGAAATTATTGAGGAAGAACAAGGAGGTGAAGAAGATTTTTGGGCAGACAGGGAAGAATATTCATCAAGTTATTTAAGAAATGGCGATCAAGAAGTTGACGAGTGGGAGTGGACTGCAAATGTGTTCGACGCATCAGCCAAGTCTCCACAAGGTCTTCTGTTCGGTTCTTCTTACCATCTCGGTAATTTCGATGAATGTGTAGGGATCGACGAGCCAGGAGAAGGTCTGACCGTGGAAGGTCAATATTGTCTGGCTACTATCAAGTGGAGGCAGTCAGAGGAAACGAAAAAGATAAGAACTGGTCGCGGCGAGACTCTTCGTTGGGCTGTATGCGTTCCGAGCGCTTGTGATGCAAAAGCTGTGGCTGGCTTTGTAGGAGATGTGTTGTCTCATACAGTCGGAAATTCAACTGGAGTGGAAGTTACCGAGAGGGATTGTTACACACGGAAACCTATAACTGTTACGAAACTTGATATCGCTTTCCTTGGCATACTATTTTTCTTTGGTTTGCTGTGTCTGTTTACAACGTCGTATGAATTATACATTATGAAGTATCCACGGAAGAAAAACAGTCCTGTCCAGGATCTGATAATTGCTTTTTCCTTGATAAACAATATGAAAAAGATTCTATCAACAAAACAAAATAATAGCTTGGGGCTAGAATGTATTAACGGCATCAAAGCGTTAGCTATGATCTTTATTATAGCGGGACACGCCTGTCTTTTTATTGGCAGTGGACCCGTTATGGACGCTGAAGCTTGGGACAGACTGATCCGAGATCCAATAAATGCCTTCATGTTAAACAACACGCTCCTTGTCGATACATTCTTGTTTCTCAGTGCATTTCTCTTCAGCCGATTGCTCCTTATTGAGTTAGATAAGCGCCGAGGGAGGCTTAATGTCCTACCCATATTAATATTTCGATATGTCAGAGTAACTCCAGCCTACCTCATTATTATACTATTTTATATGACTTGGTTACCAAAGATCGGAGAAGGTCCGTTATGGGAAGGAAGGTTGCAGCTGGAACAAGAACGTTGTATGGAAGTTTGGTGGGCCAATATACTATATATCAACAATTATATTAATACCGATAAATTGTGTATGTTCCAGTCATGGTATTTGGCAGTAGACACACAGCTTTTCTTTGTCGCACCCATATTTATCTACAGCCTGTGGCACTGGAGACGATTTGGAGCCATATTTACGTCGGCGGCGACTTTCATATCCCTCGTCATCCCATCTGTCATCACATATAAAGAACGGCTGGATCCCACTTTGTTGTTTTATGCAAAAGAATTCACGGATTTTGCCACCAATAATTATTTTGTGGGCGCTTATATAAGAACGCACATGAAGATGACGCCATATTTCATGGGAATAATAACAGGGTATATGTTACATAGAATTCAGTTGGAAAAATACCAGTTTTCAACGATTCTCAAAACTCTTGGATGGACAATAAGTATAATACTTGGTACAGTGACAACGCTCTCTGTCAGTCTATTCTATCAGGACTGGTATCAGTACAGCGAGCTTGAAGCCGCTGCATACATATCACTCCACAAATTTGCATGGAGTATAGCCAATGGCTGGCTGGTTGTTGCTTGCGCATCTGGTAACGGAGGTGTTCTCGGCAAACTTTTGAATTGGAAGTTCTTGGTGCCTATTGCTAGATTAACATTCTGCGCGTATCTCGTCAATGGTATAGTTGAGTTATACTATGTCGGCCAGTTGCGACACCCTCTTCACATAACATTCTTTACTGTGGCGGCGAACGCGATATCTCACATAGTGCTCACATTCTTTCTTGCTTTAATACTCTGTATTATATTCGAGTCTCCCTTACATGGAATAGAAAAGATTCTCCTCAGAATGTTTGCTCGTCCCGTATTAAGTGACAATGCCACGCCACCCGAATTACGTGAAACATCACGGAATACTAGTCAAACAACATTGGATAATTAA

Protein sequence:

>DPOGS202659-PA
MVCHRWTLALILFGLPILATVGAFRVRLPGSVSEVRPDLKKHKCENCDIGDLNFDRVDSVRKSEKIPYDNVKSILSEKRDLLSEKLKKISHKTIKTQDDLDETEQNKVMGKNEHDISVHLKKDKESKTKVKKPSIILDFSDDDDSDINEDDDDEDEDDIKKVVKEDYKKKVDIVPKFTPKVEKLVQLPKSKDKLIVINEKISDKKEPSKNEPTAQIQEESFKDINIKNVKTVAKDNKVSDDIEKKSKSKENDLSKRKIEIKIEEKDIDKNNVKNTKKLSEDNTSIKHGDKKHLSSKTSQKKDELMNESAKEKKNEQASKKLSDLKESIHEESKKDKPLKVVDATKKDIVDQSTSKDIRIISDALQRKNLLHSEFEDFYSFLPTFAPNFTRIHNPECRRHGQILLRQLRGSKLWALKMLDATAKFPSGFLQGNGIQLGDFDQCLGVRARVQLDTGSVVRLQGKYCLAMIDVKAEHPELEIPVHLAQGKNLFKSRIDDPGHFVPRFSTLSWGVCIPSACTSEDVEVVLRDAVKHYQYKTGISTRIRVDEHDCFTRKGSNWWKEWIELPTVVTLSLYALVILIVLIATVQDMSARNDSNHEEHEEENKQMKKQETKTSSGGFLSCFSLYHTLNKLIAPASEDEIACIHGIRAVVTVALIVAHKFLPMALTPYTNRIRLSEILSSSLLSWCRAGWIFTDCFLLISGTLTSYRKSPSDNVATKLLTPALLAIVLFYAYVWDNISSGPMWGTLVWKNSQLCRDGWWWNILYVQNYFGFEDMCAPQTHHMAMDFQLTIVGSIIVWMIQSEVPFAGSLLPTLHILSAYSRYTTVRDHRLTLLAYQGVSVSQLYRTGRLSYTSIYHRCSPYLIGLSLGLYLRNRSRLTKPLVYLGWFICGTLWGAIFWAGYDSGYLDYRYDLTYAAQYATLAPIAAALAFAWIIYAAQNGHSETLSRMLSGRPLLFISRISYALYLVQFVVYLTNTASIKASREFSLTSLIDLQEIVTILVSSIILTVTLVLPLQSLPKLFEAPTIDNLENNDNKDVSEYNNLQEKLESKYKTDEKELISESHQPRRSFLAHREVLEEIPEVEVEYEVQRDSHDGLEEILEEEDDEMMDREEDDLEIIEEEQGGEEDFWADREEYSSSYLRNGDQEVDEWEWTANVFDASAKSPQGLLFGSSYHLGNFDECVGIDEPGEGLTVEGQYCLATIKWRQSEETKKIRTGRGETLRWAVCVPSACDAKAVAGFVGDVLSHTVGNSTGVEVTERDCYTRKPITVTKLDIAFLGILFFFGLLCLFTTSYELYIMKYPRKKNSPVQDLIIAFSLINNMKKILSTKQNNSLGLECINGIKALAMIFIIAGHACLFIGSGPVMDAEAWDRLIRDPINAFMLNNTLLVDTFLFLSAFLFSRLLLIELDKRRGRLNVLPILIFRYVRVTPAYLIIILFYMTWLPKIGEGPLWEGRLQLEQERCMEVWWANILYINNYINTDKLCMFQSWYLAVDTQLFFVAPIFIYSLWHWRRFGAIFTSAATFISLVIPSVITYKERLDPTLLFYAKEFTDFATNNYFVGAYIRTHMKMTPYFMGIITGYMLHRIQLEKYQFSTILKTLGWTISIILGTVTTLSVSLFYQDWYQYSELEAAAYISLHKFAWSIANGWLVVACASGNGGVLGKLLNWKFLVPIARLTFCAYLVNGIVELYYVGQLRHPLHITFFTVAANAISHIVLTFFLALILCIIFESPLHGIEKILLRMFARPVLSDNATPPELRETSRNTSQTTLDN-