Monarch geneset OGS2.0

DPOGS214433
TranscriptDPOGS214433-TA1833 bp
ProteinDPOGS214433-PA610 aa
Genomic positionDPSCF300069 + 624829-629252
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0064853e-1944.25% 
BombyxBGIBMGA011352-TA1e-4652.20% 
DrosophilaRanGap-PA1e-0928.29% 
EBI UniRef50UniRef50_D6WMN81e-6138.53%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WMN8_TRICA
NCBI RefSeqXP_966731.12e-6238.53%PREDICTED: similar to leucine rich repeat containing 34 [Tribolium castaneum]
NCBI nr blastpgi|910836494e-6138.53%PREDICTED: similar to leucine rich repeat containing 34 [Tribolium castaneum]
NCBI nr blastxgi|910836492e-5938.35%PREDICTED: similar to leucine rich repeat containing 34 [Tribolium castaneum]
Group
KEGG pathwayecb:1000591234e-08 
 K10165 (NOD2, CARD15)maps-> Shigellosis
    NOD-like receptor signaling pathway
InterPro domain[23-167] IPR0161816.4e-08Acyl-CoA N-acyltransferase
Orthology groupMCL18902 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214433-TA
ATGGTGTTCCGCCGGCCCGAGTCCCTGCCCCTGCCGTCCACGTGGCGCAGGTTCACCGTATGTAACACTAACCTGGTCGTCAGAGACCTCACTGAGGACCTGCGGGAGACGGCTGTCCAGCTTCTGGTCAAGTACTTCACGGCTCATGAACCGCCGTGCAAATATATCGAAATCAACAAACATCCAACAGCTTTGGGGGAATTGGAGAAGTTGTGGAGGAAAACCATAGACGACCAGCTCTCGATAGTCTGCGTGAAAGAAGACGACCCCACTGACGTCATCGGAGTCAACGTCCTCACGGTCTCTGACCAAAACGACAAGGAGGAAGAGTTTAAAACGGAGGACAAGATCTGGGCAAAACTGTTTGGAGCCGTGGACCTGGTGACGCGAGCCGTGGACGTGTACCAGACCTTCGGCGTGGAGAGATATCTGACGGCATACGGATTGGTCGTGGATCCGCAATGGAGAGGCTGGGGCATCGGCAAGGAGATGCTGTTGGCTAGGAAATGCATCTGCGATACTGTGGCCATTTTGAAAAAGATAACAGGAAATCCGGAACCATTGAGTTCGACGGAAATTAACAGCATACGCCTGAATCTTTTCACTGAACGTAACTCGGACGGCACCGGGTACTTGGTGCTGAGAGGTAAAGACATATATGAAAAATATAACAGAAGAATTTGTGACAGCGACGTGAGAGCTATATGTCTTTACTTAAAACATTCACCAAGAATAATAACAAAAGTCGACCTGAGTTACAATTCCATAACGGATACAGGATTTTTTAAATTGTTAAAAAACGTTCTAATAAAGGGCAGATCCAGCGTTACCAACCTCAATATTATGAATAACAATATCACAGAGTTGTCCATATTGAACCTGTCGAAGTATGCGAAGTTTTTGAAACTTAAATATCTAAGAATTAATGGAAACGATTTCGGGACTAAAGGCGGCGAATATTTCGCCGATCTATTGTCGAATAACAGGAGTATTGAATGTTGCGACATCGGGGAGACGGGACAGACGTTAACCAGTGTTGCCCACATCATCACCGCCCTGCGCTACGATCACGCTGGCAACACTACTTTAAGAGTTTTCGATTTCAGCCGAATTATTCCATTGTTCAATAGATACTCGTATGAAACAAAGTGGCTCGCTTATCACATCGAATATTTATTAGAACGAAACGATACCATCGTAGAGTTACATTTACAGAAAAACGAATTAATTGGACACGACGTAGAATACTTAGTGAGAGGTCTGCGGAACAACAAGACGTTGCTGTACTTGGATATTGGATATAATAAGATAGGGGAATACGGCGCTGAATTATTCGGACAATATTTATCTGAGAAGCCGCAACTGATATTATTGAACTTAGCGGGAAACGGCATCAGGGACACAGGCGCCAGAGCTCTCAGCTTCGGTCTCCCATATTCAAGAATACGTGCCTTAGACCTGGGACACAATAAAATCACCGACGACGGTATCCTGTACATTCTGAACACGATCAAGAAACCATTTTACATGAGATTCTTAAACTTGTGGGGCAATGACATCGGGGAGACGACGTGCGGCGTCATCCAGAGGATGTTACTAAGTGGGGCTCTGTTCCAACACACGATAGACGTCAGAATCCCGCTGTGCAAAGCTCTCGATATAAAGGTGACAGCAACCGTATTTACCGCGGGGGCGTCACAGGCCGTGGCTAGGAAGGCCGGGTTTAAGGAGCTGTACAAGATCTCCTACCAGGAGTTGGCAGAGCAGGGCTACAGGTTCCCGGGTATCGAAGAAGACACGAAATACTCCAAACTGATGGCACTCGAAATTTAA

Protein sequence:

>DPOGS214433-PA
MVFRRPESLPLPSTWRRFTVCNTNLVVRDLTEDLRETAVQLLVKYFTAHEPPCKYIEINKHPTALGELEKLWRKTIDDQLSIVCVKEDDPTDVIGVNVLTVSDQNDKEEEFKTEDKIWAKLFGAVDLVTRAVDVYQTFGVERYLTAYGLVVDPQWRGWGIGKEMLLARKCICDTVAILKKITGNPEPLSSTEINSIRLNLFTERNSDGTGYLVLRGKDIYEKYNRRICDSDVRAICLYLKHSPRIITKVDLSYNSITDTGFFKLLKNVLIKGRSSVTNLNIMNNNITELSILNLSKYAKFLKLKYLRINGNDFGTKGGEYFADLLSNNRSIECCDIGETGQTLTSVAHIITALRYDHAGNTTLRVFDFSRIIPLFNRYSYETKWLAYHIEYLLERNDTIVELHLQKNELIGHDVEYLVRGLRNNKTLLYLDIGYNKIGEYGAELFGQYLSEKPQLILLNLAGNGIRDTGARALSFGLPYSRIRALDLGHNKITDDGILYILNTIKKPFYMRFLNLWGNDIGETTCGVIQRMLLSGALFQHTIDVRIPLCKALDIKVTATVFTAGASQAVARKAGFKELYKISYQELAEQGYRFPGIEEDTKYSKLMALEI-