Monarch geneset OGS2.0

DPOGS209174
TranscriptDPOGS209174-TA2793 bp
ProteinDPOGS209174-PA930 aa
Genomic positionDPSCF300061 + 46356-53427
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0097420.068.92% 
BombyxBGIBMGA011528-TA3e-14334.44% 
DrosophilaCG3822-PA2e-14133.22% 
EBI UniRef50UniRef50_D6WAN31e-14536.88%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WAN3_TRICA
NCBI RefSeqXP_309276.42e-15437.72%Anopheles gambiae str. PEST AGAP012447-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582871913e-15337.72%Anopheles gambiae str. PEST AGAP012447-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700022093e-15336.44%hypothetical protein TcasGA2_TC001185 [Tribolium castaneum]
Group
Gene OntologyGO:00160204e-67membrane
GO:00052344e-67extracellular-glutamate-gated ion channel activity
GO:00049704e-67ionotropic glutamate receptor activity
GO:00068108.8e-25transport
GO:00302888.8e-25outer membrane-bounded periplasmic space
GO:00052158.8e-25transporter activity
KEGG pathwaymdo:1000208893e-124 
 K05202 (GRIK2)maps-> Neuroactive ligand-receptor interaction
InterPro domain[444-811] IPR0013204e-67Ionotropic glutamate receptor
[458-809] IPR0016388.8e-25Extracellular solute-binding protein, family 3
[72-390] IPR0018285.5e-23Extracellular ligand-binding receptor
[452-516] IPR0195947.1e-06Glutamate receptor, L-glutamate/glycine-binding
Orthology groupMCL34792 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209174-TA
ATGTTTGTAATAATGTTGTGGTTATTTATTGTTTGTTTCGTTCAGCCGTCAACCTCTGAGATCCGTTATACGGAAACTAGACAGAGTTTTTATCAAATAGCTGGCATTTTTGAAAACAATACGCTCACACAACGGCTTGCTTTTAATGAGAGTATACGTTACGCCAATTTGGGAAAAGCTGAATTACAATCAATGTCTGATGCTCAAGGCGGCAAAGTATGTTCAAACACAACTATGCAACCGATAGCTATATTCGGTCCTCAGAATTCCGTCACCGACAACATCATACGCGATCAGTGCTACGTATCAAATATTCCGCATATTCAAGCTAACTTGCAGTTAGCGGACCCTGATATGGAGCTGAACCCAGTTGAAGCTACAGACGAAGAATCCACTGAAGATCAGGGGCTTAAGTTTAAAAAGATATCCATAAACTTTTACCCGCCGGCAGAGGATATATTGTACGCTTACGCCATGTTACTAAAGTACTACAAATGGAAAAACTTCGCAGTTCTCTACGAAGATGACTTAGGTCTGTTGAGAATACAGAAGATACTCTCTGAATATACAGAAGACTATCCCCTCACTATCAGAAGACTAGACCCGAACGCCGATAACCATAACATATTTAAAGATCTCAAGAAATTCAACGAATATCGGGTCATAATTGATTGTGATGTCGACCGCGTTGTCCAATACCTCACGGAAGCCAGAGAAGTGAACATGGTCAACCATTACGAGCATTACATTTTAATAACGATGGATGCGTCGGTGGTTGCGGAAGAACTGCGACAATTCCAATCGAATATAACTTGGCTCAGTATAACGGAATATGACAAGCTACAAAATTCCCAACATTTCCTGACGCCGAGGGTTGGAAGGTGGACGAGTGAAACGAGCGTCGTCTATCCACCTGTCACTGATATCAAAACGTCAGCGCTGCTAATGGATGACATAGCGAACCACGTATTAAAAGCGTTGCAAAAAGTGGAAATGGTGGAGAGCATTGAAAACAGAACAATGAGTGATATATGTGGACCGGAAAGCGAGCCGTGGGAGTTCGGGGCCTTGCTTCAGGATGAAATATTGAAGACAAAAACAACAGGCGTCACTGGAAATATAGAATTCAACGAATTAGGACAAAGAGTTAATTACACTCTTTATGTAAACGAAATTTATGTTTCAACGTTGGATACAATCGGCACTTGGGACTCGACAGCAAGAGGTGAAATTATTGAAGATAGACCTGAATCTGAAAATTATGATAAAAAGAAAAATGTTAAACATTTCTATATTATATCTAAAAAAGCGAAACCGTACTTTTACGATAAAATAAAATGTGCGGAAGATGATCCAGATTGTGTTGAAGAAAAAGCTGATGAAAATTACGAAGGATTTTCTGTGGACCTCGTCAAAGAAATCTTTGATACGTTGCGAAAACATAATTTTAATTTTACATATTCTTTTTTGCCAAAAACATACACTGATTACGGCAAATACAGACCGGAAGAAAAGAAATGGGATGGCTTGATAGGAGATCTCTTAGATAAGAGTGCCGATTTAGCTGTGTGCGACCTAACTATCACTGAGGAAAGAAAAAAAGTGGTCGACTTTTCAGTACCATTTATGTCTCTGGGTATTAGCATTCTGTATATTAAAGAAAAAGAAGTTGAACCAGCTATGTTCTCCTTTCTCAATCCATATACATTTGATGTTTGGATCCACACGGCAACAGCGTTTTGTGTTGTATCAATTATTCTTTTTGTGTGTTCGAGAATATCTCCAGCAGATTGGGAAAACCCACAGCCGTGTGATAAAGATCCAGAGGAATTGGAAAACATATGGAATTTCAAGAACTGTACGTGGCTCGCTATGGGGTCCATTATGTGTCAAGGATGCGACATCTTACCGAAAGCAATCGGCACACGTTGGGTTTGTTCTATGTGGTGGTTTTTCGCAGTTATCGTATGTCAGACATACATAGCACAACTTTCAGCTTCAATGACCGAAGCTTTGGAAAATGAACCTATTACCAAGGTAGAAGACTTGTCCACACAAACCAAGGTCCTATATGGTGCGATCGATGGTGGTTCCACCCTTGGTTTTTTTAAGAATTCCAAGGATAAAATGTTCAATAAAATGTATGAAAATATGGTACAAAATTCAGCGGTTTTAGTTAAAACTAATAAAGAAGGTGTTAAGAGGGTTATAAAAGGCAACGGAAAATATGCCTTCTTTATGGAATCCACGTCCATAGAATACGAACTGAAAAGGAACTGTGACCTTAAAAAAGTTGGTGAGGAATTGGATTCTAAAGACTACGGCATTGCCATGCCCGCTAACTCTCCGTTCAGGAAGTATATCAACCGAGCTATTTTGGAACTGAAAGAATTCATGGTGTTAGATAAGATCAAACGAAAGTGGTGGGAGGAGAAGAATGTGATTCAACCGTGTGAGGTTGAAGAAGACAAAAACGATGTGGAGGGAGATCTTGAAATGAAAAATTTGAAAGGAGCTTTTGTTGTTCTCATAGTTGGGCTTGCTATCTCCATGGTAATTACTGCGTTTGAATTCATGAACGAAGTCAGAAATATTGTCGTGCGAGAACAGGTGTCTCACAAAGAAGTTTTTATTAAAGAACTGAAATCTTCGCTGAATTTCTTCCAACTTCAGAAACCGGTTATAAGAAACCCAAGTCGTGCGCCATCTGTAGCATCTTCTGGCAGTGAAAAGAAGAACAATAGAAATAATGCCATTGAGAACTTGTTAGAATTTGAAAAAGTGCAACAGTAA

Protein sequence:

>DPOGS209174-PA
MFVIMLWLFIVCFVQPSTSEIRYTETRQSFYQIAGIFENNTLTQRLAFNESIRYANLGKAELQSMSDAQGGKVCSNTTMQPIAIFGPQNSVTDNIIRDQCYVSNIPHIQANLQLADPDMELNPVEATDEESTEDQGLKFKKISINFYPPAEDILYAYAMLLKYYKWKNFAVLYEDDLGLLRIQKILSEYTEDYPLTIRRLDPNADNHNIFKDLKKFNEYRVIIDCDVDRVVQYLTEAREVNMVNHYEHYILITMDASVVAEELRQFQSNITWLSITEYDKLQNSQHFLTPRVGRWTSETSVVYPPVTDIKTSALLMDDIANHVLKALQKVEMVESIENRTMSDICGPESEPWEFGALLQDEILKTKTTGVTGNIEFNELGQRVNYTLYVNEIYVSTLDTIGTWDSTARGEIIEDRPESENYDKKKNVKHFYIISKKAKPYFYDKIKCAEDDPDCVEEKADENYEGFSVDLVKEIFDTLRKHNFNFTYSFLPKTYTDYGKYRPEEKKWDGLIGDLLDKSADLAVCDLTITEERKKVVDFSVPFMSLGISILYIKEKEVEPAMFSFLNPYTFDVWIHTATAFCVVSIILFVCSRISPADWENPQPCDKDPEELENIWNFKNCTWLAMGSIMCQGCDILPKAIGTRWVCSMWWFFAVIVCQTYIAQLSASMTEALENEPITKVEDLSTQTKVLYGAIDGGSTLGFFKNSKDKMFNKMYENMVQNSAVLVKTNKEGVKRVIKGNGKYAFFMESTSIEYELKRNCDLKKVGEELDSKDYGIAMPANSPFRKYINRAILELKEFMVLDKIKRKWWEEKNVIQPCEVEEDKNDVEGDLEMKNLKGAFVVLIVGLAISMVITAFEFMNEVRNIVVREQVSHKEVFIKELKSSLNFFQLQKPVIRNPSRAPSVASSGSEKKNNRNNAIENLLEFEKVQQ-