Monarch geneset OGS2.0

DPOGS204648
TranscriptDPOGS204648-TA3423 bp
ProteinDPOGS204648-PA1140 aa
Genomic positionDPSCF300462 + 5922-18863
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0055300.051.83% 
BombyxBGIBMGA001848-TA6e-9748.84% 
DrosophilaIr75d-PA3e-5833.42% 
EBI UniRef50UniRef50_E5FIA94e-10440.35%Putative chemosensory ionotropic receptor IR75p n=2 Tax=Obtectomera RepID=E5FIA9_SPOLI
NCBI RefSeqXP_975640.22e-8939.53%PREDICTED: similar to ionotropic glutamate receptor-invertebrate [Tribolium castaneum]
NCBI nr blastpgi|3790700881e-16878.57%putative ionotropic receptor IR75p, partial [Cydia pomonella]
NCBI nr blastxgi|3790700889e-16378.57%putative ionotropic receptor IR75p, partial [Cydia pomonella]
Group
Gene OntologyGO:00160207.7e-10membrane
GO:00052347.7e-10extracellular-glutamate-gated ion channel activity
GO:00049707.7e-10ionotropic glutamate receptor activity
KEGG pathway 
InterPro domain[347-582] IPR0013207.7e-10Ionotropic glutamate receptor
Orthology groupMCL34558 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204648-TA
ATGCAAAGATTAAAAAATTGCTATAAATTTAGAACGCAGACGTTACAGGCGCAGACTATTATTGTGGATCTCATTAGAATAGGCAATTTCACAACAGAAGCTATAAAATCATTAAACCACAATCCCAAATGTAAACTGCAAATGCAGAGGTTTGATGCGAACCACTGCGATGCATACATTAACAAAGAGGACGTGAACCCCATACAAGAGATCGTTATAGAGGAACAGGTTAAAATTGAAAAGGAATTTATTAACGAGCAAGACACAGTCGCTATGGGCTACACAACAGACGACAGTCTGCCACTAGAGACGCAGAGAACCAAAGCCAAGAAGACAAAGAAGAAGAAAGAGAAGAAGATACAGGAACCGAAGGTAGATAGGAGGAGAAAGCCGTTCCTTAACGATGACCTGAATGAGAGTCTGTTCACTATCACCGATCTGACCTTGGAGGAACAAATAGCTGATATCCAGAAGAGACAGGAGAGTTCTAACTTCAAGAATTCAGTGTACAAGTGTATGGAGTGCTTTAAGGGTTTCCTTGATGAAGGAGCGTACAACGGACATATGACAAGGCATACTACTTCTGTGTACGGTGTTATGGAGGATTATAGAGGTGTCAGAATGTTGCCAGTATCAGCGAGACGCAGGGATTTGAGGAAGCACAATCTGACTATGGCCAACGTCATCACAGATAGCAATGAGACCAGACAGCACCTCGACGATAGGCTAAACCTGCACCAGGATTCCATAACCAAGATGTCGTATGTCGTGGCGAAGATCTGCTTCGATATGCTGAACGCTACCGAGAACAGAATCTTCACGCACACTTGGGGCTACAAGGACAAAAACGGCAACTGGCAGGGCATCATCGACCATCTGCTCAAGAAGAAGGCGGATCTTGGTACCCTGACTATATTTACCCAGGAGCGCATGAAGGCCATAGACTACATAGCCATGGTAGGTTCTACCGCGGTCCGGTTCGTGTTCAGAGAGCCCCCGCTGGCTCTATTGGAGAATATCTTCACTTTGCCATTCACATCGGCTGTATGGATAGCGATCGGCATCTGCGTGCTTGGTTGTGCTGTATTCTTATACATAACATCTAAATGGGAAGCCACGGTGGGAATGCATCCGTTACAGCTAAGCGGTTCATGGGCGGATGTGTTGATACTGATCATAGGAGCCGTCCTTCAACAAGGATGTACGCTTGAACCAAGATACGCCGCTGGTCGATGTGTGACCTTACTGTTGTTCGTGTCTCTCACCGTGTTGTTCGCGGCGTACTCCGCCAATATCGTGGTGCTGCTCCGAGCCCCCAGCAGCTCTGTGCGCTCGCTGCCCGACTTACTGAACTCACCGCTGAAGCTGGGTGCCAGCGACTTTGAATACAACCGGTATTTCTTTAAGAAACTCAACGATCCCATACGCAAGGCAATTTATAGCAAGAAAATAGCTCCGTCAGGCAAGAAGCCAAACTTCTACAGCATGAAGGAGGGAGTCGAGAAGATAAGGAAGGGATTATTCGCTTTCCACATGGAACTTAACCCCGGATATCGCTTAATCCAGGAGACTTATCAGGAAGAGGAGAAATGTGATTTGGTTGAGATCGATTACATTAATGAAATTGATCCCTGGCTGCCCGGACAGAAGCGATCGCCTTACAAGGATTTGTTTAAAATAAGCTTCATCAAGATTCGTGAGTCGGGCGTGCAGTCGTGTGTGCACCGCCGGCTGCACGTGGGTCGGCCGCGCTGCTCCGGCAGCGTGTCCACCTTCAGCAGCGTGGGCATCACGGACATGTACCCCGCGCTGCAAGCCACGCTCTACGGCGCTGTCATGTCCGTAGCCGTGCTCATGATGGAAAAAGTTCACTACAAACTTTTCATTGACAACGAGAAAAAGTCAACAATAGTTATTCTTGATAACATTTGTTGGGACAAAAGTTTTTATCGCGAGATATTTTTAAACTTTATTGTTTCAGCGGAGATGTTGAAATTGATAAAGGCTTTGTCGCGTAATAATGTTCGAGTGTCGTGTAAGACTTGGAATAAAAATAACCTACAGGATCACATGCTCTTATTTTTAACTGATTTAGATTGTCCAGGTGCTGAAGAATCGCTGAAGTTATCTCCTTATCTACGATATCCTTTCCGATGGCTAGCACTCACTAAGAGATCTGATGATATAAAATATATTTGGAAACTTCCACTGTTTGTTGATAGCGATTTTGTGCTAGCAAAAGAAATGGTAGACCACTTCTCTCTCACAGAACTGTATAAACCCTCGACATTTGGACCTATGAGCTCAATCGCTCGAGGTTATTATAATGGAAGTCTAATTGACACGAGGGAAAATAGAGAAATCTTTAGACGTAGGAAGGACATTATGGGACATCCCTTAACCATCTCCAACGTCATACAGGACAGCAACACTTCGCAATATCACATAATAAAAGAGAACAGATTGGAGCTTCATTACGATGGTACGACAAAACTATCCTATGTACACGTGCAAATAGCCTTCCAAATGCTAAACGCTACACCGAGACATGTCTTTAGTCATCGATGGGGTTACAAGAAAAACGGACAGTGGTCAGGAATGATTAATGATATAAATACAGGAAGAGCAGACCTAGGCACAAACTGCGTTCCGGCTGTCGAACGCCTCAGCGTAGTTGTCTTCACGGACTGCATCGCCAACTTCGAAGTTAAATTCATCTTTCGCCAACCACCACTCTCTTACGTGTCCAACATCTTTACTTTGCCATTTTCGAAAAGTGTTTGGATCGCCATAGCGACGTCATTTGCCATATCTACAATAACAATATATATAGCAACTAAATGGGAGGTCAGGACATTTAAAACGGCACAAAAAGATCCAATAAGGAAAGCAATATATAGGAAAATTAGTCCGGAAAAGGGCAAGGAGAATTTTTATAATTTCAATGAAGGAGTTGAACTCTTACGTCAGGGCTTATTTGCATTCCACGCAATTTTGGAACTGGTGTACTTACGCGTCGAGGAAACATTCTTGGAGAATGAGAAATGTGATTTGATGCAATTGGATTTTATTAACTCACACGACCCCTTTGTGCCAGTTTATAAACATTCCCCGTATTTGGAGCTGCTGAGAGTTGTGTTCAAACGTATCCGCGAATCAGGCATTCAGATGGCCAACCACAGGAGGTTTCAAGTTCCAAAGCCGCGATGCACCGAGAAGATATCAACCTTCAGTAGTGTGGGTATTGTTCACATGAAGCCAGTGCTGCTGTTTATAACTTACGGTTTCCTGGCGGCATTTCTCATAATGGTGGCCGAGATTTTCGTGTTTAGGATGAAAATGTTCAAGAGAAAGGAGTTGAAATACTTTTCTTTGAGGAATAGGCCCTCAAAAGAAAATTTGACCATAAAGTATCCTAATTAA

Protein sequence:

>DPOGS204648-PA
MQRLKNCYKFRTQTLQAQTIIVDLIRIGNFTTEAIKSLNHNPKCKLQMQRFDANHCDAYINKEDVNPIQEIVIEEQVKIEKEFINEQDTVAMGYTTDDSLPLETQRTKAKKTKKKKEKKIQEPKVDRRRKPFLNDDLNESLFTITDLTLEEQIADIQKRQESSNFKNSVYKCMECFKGFLDEGAYNGHMTRHTTSVYGVMEDYRGVRMLPVSARRRDLRKHNLTMANVITDSNETRQHLDDRLNLHQDSITKMSYVVAKICFDMLNATENRIFTHTWGYKDKNGNWQGIIDHLLKKKADLGTLTIFTQERMKAIDYIAMVGSTAVRFVFREPPLALLENIFTLPFTSAVWIAIGICVLGCAVFLYITSKWEATVGMHPLQLSGSWADVLILIIGAVLQQGCTLEPRYAAGRCVTLLLFVSLTVLFAAYSANIVVLLRAPSSSVRSLPDLLNSPLKLGASDFEYNRYFFKKLNDPIRKAIYSKKIAPSGKKPNFYSMKEGVEKIRKGLFAFHMELNPGYRLIQETYQEEEKCDLVEIDYINEIDPWLPGQKRSPYKDLFKISFIKIRESGVQSCVHRRLHVGRPRCSGSVSTFSSVGITDMYPALQATLYGAVMSVAVLMMEKVHYKLFIDNEKKSTIVILDNICWDKSFYREIFLNFIVSAEMLKLIKALSRNNVRVSCKTWNKNNLQDHMLLFLTDLDCPGAEESLKLSPYLRYPFRWLALTKRSDDIKYIWKLPLFVDSDFVLAKEMVDHFSLTELYKPSTFGPMSSIARGYYNGSLIDTRENREIFRRRKDIMGHPLTISNVIQDSNTSQYHIIKENRLELHYDGTTKLSYVHVQIAFQMLNATPRHVFSHRWGYKKNGQWSGMINDINTGRADLGTNCVPAVERLSVVVFTDCIANFEVKFIFRQPPLSYVSNIFTLPFSKSVWIAIATSFAISTITIYIATKWEVRTFKTAQKDPIRKAIYRKISPEKGKENFYNFNEGVELLRQGLFAFHAILELVYLRVEETFLENEKCDLMQLDFINSHDPFVPVYKHSPYLELLRVVFKRIRESGIQMANHRRFQVPKPRCTEKISTFSSVGIVHMKPVLLFITYGFLAAFLIMVAEIFVFRMKMFKRKELKYFSLRNRPSKENLTIKYPN-