Monarch geneset OGS2.0

DPOGS215327
TranscriptDPOGS215327-TA2331 bp
ProteinDPOGS215327-PA776 aa
Genomic positionDPSCF300120 + 222434-231420
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0100200.050.92% 
BombyxBGIBMGA007966-TA8e-9441.81% 
DrosophilaCG17707-PB1e-4222.86% 
EBI UniRef50UniRef50_E2B9T33e-4522.49%Nose resistant to fluoxetine protein 6 n=9 Tax=Formicidae RepID=E2B9T3_HARSA
NCBI RefSeqXP_001605014.11e-4523.61%PREDICTED: similar to ENSANGP00000011489 [Nasonia vitripennis]
NCBI nr blastpgi|3072113931e-4422.49%Nose resistant to fluoxetine protein 6 [Harpegnathos saltator]
NCBI nr blastxgi|1935756796e-4823.85%PREDICTED: nose resistant to fluoxetine protein 6-like [Acyrthosiphon pisum]
Group
KEGG pathwaydme:Dmel_CG333378e-11 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
Orthology groupMCL25411 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215327-TA
ATGGAGAATTTGTTTATAGTTTTCGATCCGTCTTATTTGTCGTATGTTTGGCCGAAAATAAAAAATGGGGTTCATTTAAATTTAGGACATTACTGTTGGGAGGATGTGAGTGTGTTTTTAAAAGATTTAGCCGAAGGTAGAGCCTGGGCGTACAAGGAACGAGTGCTAAAAAATTTATATGAACTACAAAATCGATTGGGAAAAGACGACAGCTATAAGTCTCAAAACAACGAGACGTTCGGTGAGTTCTACAATGGGGACTACCTCAACACACTCATGAGAAGAGAGAAATTTGGAGGGAATGTAGCTGAGTGGTCCTCAATGGTTCAGCGTGACGAGCTGATGAGAAAAATGGTCCGTTCAGACAACGACGCCGCCGTCTCGCTAGCATACACCGCGCTGCAGGTTGACTTGAATATTACCAAGATTACGCTGCTGAAGTCGTACAAAGTAACTCTCGGCCTATGTTTGCCGCGGTCCTGTGTTCCCCAAGACGTGTTGTCCATTATAAACTTCTCTATAATGCTGAACGACAATTTGAAAACCAACAAAACTGTCTCGAGGACCATAAAGATAACATCTCTGAGACACGTCGAGAGCTTCTATGATATCAGAACTGATTTCGGAGCCGTCTCCCTACTGATGGTCACCCTCACCCTGATAGTAATCGCCTTCATAGCCACAATAATCGACTTCGAGTTAATAAAGTTCAAGCCTTACGTGAAGACTTCTAGTTTCGATATAGAACAGTACAACGACAATCAGAACAGAAACGGACACCACAATAATCACTTTTCAAAGCAGACGCATCGGATCGACGAGACATTGGCCGCGAACTTGAACGAAATCAAGAAGGCGATGCGGGAGAAGGGGGTGCCTCCGTCCGTTACGCTGGACGTAATGACGCAAGCGAGGAAGGAAGGCATCACGAGCTGCAGGCGGTGCGGCAAGTACAAGAAGCAGTGCCCTAACCTGACAAAACAAAACGTGGAACCTTGTGCTACAGTGAAATACAACTCCAGCGCCAGTCTCACCACCGAGTACAAAATGGAGAACACGGTGTGCAAGAACTTGTTGTTGAGCTTCTCCTTCAAGCACAGCTGGGCGAGGATCTTCAACACTAATATAGCGAATAAGAACCTAGCGCTGGTCCATGCGATGAAGATAACAGCTGTCCTGTGGATCGTGTTAGTACATGTGGCGGCCGTAGTCTGGTACAACGCAGACAACAGTATGGACATCAATGATGACAGCACTATGTTTTATGTGCTGTGTTCTGGAACTTTAGCATTCGATTTGTTGTTCTTCGTAAGCGGCGTGTTCAGCTCACAGCATTTCTTCTATTTGAAGAGTCGTTATTCAGCTCAGGAGCTGGTGAGTATGGGAGGCGTCTGCGGGGAGATGCTGCAGTTGATCTGCTTCATCACCAACAGAGCTGTGCGGTTGCTTCCTCCGTATTTGTTTACGATCTTCCTAACGTCCGTGTTAGCGCGTGTGGCTCGCGACACCGCGGTGCTGTCGACACCGGAGCGAGACTACGACACCTGCGACAGCTACTGGTGGAGAAACATCTTGTATCTCAATAATCTATATCCTCAACAAGAGCAGTGCATGCAGGTCTCCTGGTATCTGTCCAGCGAGAGCCAGCTCCACGGAGCGATGTCGCTGGCGTGTCTGCTGGTGGCGTTGTACCGGCGGCGGACGGCGGCGCTGCTCGTGTTGGTCGTAGTCCTCACAGCTGTCACTGTGGACGTCACGCGAGCCATCAGCGACCTCGGTCAACGCGTGTCGTCATCTTTCTCCCTGTACTCCCTGATGGTGTCTCGTCCGTGGGGCGGCGTGCCGTCCTACTGTCTGGGAGCCCTCGTGGGCTGGCTGCTGCATGTGACCGAGGGTCGGTGTGCGGTCAACAGAACCACGAGCTTGTGTCTGTGGTCATCTTCTGCGCTGTGCGTGGTCTCGTCCCTGTTGGTGAGTGGTCGTGGCCCTCGCTGGACGGCGGCGGCCCACCTCGCCTGGCCGGTCGGACTCGTGTGGCCGATGCTGGCAGGTGCTACTACATATTCCGACGTGACCCGCGGCCTGGTGTCCAGTCCAGTGGTGGCGGGTGTCAGTCGTCTGTGCTACACCACGCTCGTGTGTCACGGAGCTGTGTGCCGCGCTCTCCTGCTCTCGGCGGGCACCGCTCTCTGCTCTGATCTATCATGTTTGTTCTGTTACTTCGCTGGTTGTGCGCTCGTGTCTCTGTGCGCGTCTCTGTGTCTCTCTCTACTGGTCGAGATGCCGAGCTGCTGTTTATTGAGACGGATCTCAGATTATACTTACCGGTGA

Protein sequence:

>DPOGS215327-PA
MENLFIVFDPSYLSYVWPKIKNGVHLNLGHYCWEDVSVFLKDLAEGRAWAYKERVLKNLYELQNRLGKDDSYKSQNNETFGEFYNGDYLNTLMRREKFGGNVAEWSSMVQRDELMRKMVRSDNDAAVSLAYTALQVDLNITKITLLKSYKVTLGLCLPRSCVPQDVLSIINFSIMLNDNLKTNKTVSRTIKITSLRHVESFYDIRTDFGAVSLLMVTLTLIVIAFIATIIDFELIKFKPYVKTSSFDIEQYNDNQNRNGHHNNHFSKQTHRIDETLAANLNEIKKAMREKGVPPSVTLDVMTQARKEGITSCRRCGKYKKQCPNLTKQNVEPCATVKYNSSASLTTEYKMENTVCKNLLLSFSFKHSWARIFNTNIANKNLALVHAMKITAVLWIVLVHVAAVVWYNADNSMDINDDSTMFYVLCSGTLAFDLLFFVSGVFSSQHFFYLKSRYSAQELVSMGGVCGEMLQLICFITNRAVRLLPPYLFTIFLTSVLARVARDTAVLSTPERDYDTCDSYWWRNILYLNNLYPQQEQCMQVSWYLSSESQLHGAMSLACLLVALYRRRTAALLVLVVVLTAVTVDVTRAISDLGQRVSSSFSLYSLMVSRPWGGVPSYCLGALVGWLLHVTEGRCAVNRTTSLCLWSSSALCVVSSLLVSGRGPRWTAAAHLAWPVGLVWPMLAGATTYSDVTRGLVSSPVVAGVSRLCYTTLVCHGAVCRALLLSAGTALCSDLSCLFCYFAGCALVSLCASLCLSLLVEMPSCCLLRRISDYTYR-