Monarch geneset OGS2.0

DPOGS209941
TranscriptDPOGS209941-TA2769 bp
ProteinDPOGS209941-PA922 aa
Genomic positionDPSCF300148 - 407055-413448
RNAseq coverage338x (Rank: top 34%)
Annotation
HeliconiusHMEL0099922e-16379.88% 
BombyxBGIBMGA011486-TA5e-17640.34% 
DrosophilaCG5621-PB3e-15738.83% 
EBI UniRef50UniRef50_UPI0002060F5A4e-15637.62%UPI0002060F5A related cluster n=2 Tax=unknown RepID=UPI0002060F5A
NCBI RefSeqXP_001655460.11e-16441.38%ionotropic glutamate receptor subunit ia [Aedes aegypti]
NCBI nr blastpgi|1571296953e-16341.38%ionotropic glutamate receptor subunit ia [Aedes aegypti]
NCBI nr blastxgi|1571296958e-15941.25%ionotropic glutamate receptor subunit ia [Aedes aegypti]
Group
Gene OntologyGO:00160201.2e-68membrane
GO:00049701.2e-68ionotropic glutamate receptor activity
GO:00052341.2e-68extracellular-glutamate-gated ion channel activity
GO:00068104.8e-22transport
GO:00302884.8e-22outer membrane-bounded periplasmic space
GO:00052154.8e-22transporter activity
GO:00048727.3e-09receptor activity
GO:00068117.3e-09ion transport
GO:00052167.3e-09ion channel activity
KEGG pathway 
InterPro domain[392-760] IPR0013201.2e-68Ionotropic glutamate receptor
[402-467] IPR0195943.1e-27Glutamate receptor, L-glutamate/glycine-binding
[394-758] IPR0016384.8e-22Extracellular solute-binding protein, family 3
[50-344] IPR0018282.6e-10Extracellular ligand-binding receptor
[524-549] IPR0015087.3e-09NMDA receptor
Orthology groupMCL25826 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209941-TA
ATGGAGAGTGTATGTTCAGGAGTGGCTGAAGGCATTCCTGAAGTTGCCGTCCTTCTGGAGCAGGCCGTAGCCGCGGGTGGAGGAGCTGCGGCCGGGGAGCGCGGACAGAGCCTCGAACCCTATGAACCCTTAGCGGTACCAGAACATATCTGCACTCAGGCCAGTGAAGGCTTACTGGCCAGTGTCGGCGGTGTGGAGGCCGCGGATGCCGCGGCGCGGGCAGGCTTGCTGTTGTTGCTGGCGTCTCCTGTCGCCGTCCCGTCCCCGGCGATGACCATCGCCTCCACCGAGCCGGACCATCCCTTGTCAGCAGCTCTTGAATTTTATCCACGATATGATGTTCTCGCAGAGGCCTGCGCAGCTCTGTGTGAAGCGAAAGGTTGGAAGCACGCCGTGTTACTGCACGACGGAAGCGGCAGCGCGGCGCCGCTCATAGTGCCCGATCACGACACCCTCGCGCTCCGTGTCCGCCAGCTGCCCTCGAGAGAGGACGACGACGCCTTGAGGAATCTTCTCCTCGTACTGAAGAAGTTCGGTGCTGTAAACTTTATCGTATGGTGTTCGGCGGAGTGCTCGGTGCGAGTGCTGGACGCGGCCCAGCGCGTGGGGTTGCTTTCCGAGCGACACTCGTATTTGATGCTCTCGCTGGACCTGCATACACTGCCGCTCGAGGACTTCAGCTACGGTGGAGCCAACATCACCTCACTGCGTTTATTTGACCCTGAATCTTCTGCAGTAAATGTGTCTATGGAAAAATGGCAGCAACAGTATATAAATCTTCTAGGAAACGAAGCAAATGAAGAAATTGATAAGATTATTTCGAATCCTCCGACTTCATTACTTTTATCATATGATGCTGCAAAAATTGTATCCGAAGGAATGGAGTACCTCGACCTTCCATTCATGGAAGACTCTCCGTCTTGCCAGCAAGGCACTGCCGCCTTCCACGCCGACACACTTCTGAACTATTTACGATCGGAAGAAAACAGTGGCGCAACCGGACCTCTTTGGTGGGAAGCGACGGGAGCTCGCGGTGGTGTGCGACTACACGTGGCGGAACTGGAGCGAGGCGGGTTTCTGAGAGCCGCAGGCGACTGGTCCCGGACGGGGGGACTGACGTGGCGGCCTCGACCACCCGCACCCCCGCCACCACCCGACGCCATGACCAACCGCACCTTCACCGTCCTCATCGCTCAGAATCAACCGTACGTCATGAGACAGCAGTCCTCCGAACGACTCTCAGGCAATGCGCGTTATGAAGGCTTTTGTATAGAGCTGGTGGACCGTTTGGCTCAGCTGTTGCACTTCAACTACACGTTCATAGAGCAAGCGGATCGCGCCTACGGGTCCCTCAACAAGACCACCAAGCAATGGAACGGCATGATGAGGCGCCTCATGGATGATAAGAATGTAGACTTCGCTATAACGGATCTCACGATAACGGCGGAGCGAGAGGAAATCGTGGACTTCACGACGCCCTTCATGACATTAGGTATAAGCATCCTATTTCACAAACCTCAGCCTCCTGCACCGGAACTACTGGCCTTTCTGTTGCCCTTCTCTAACGGGGTTTGGATGTGTCTGGGGCTGGCGTACGTGGGTTCGTCCTTGGTGCTGTACGTAGTGGGTCGGCTGTGCCCCGAGGAGTGGCAGAATCCCTACCCCTGTATCGAGGAGCCATCTGCACTCGAGAATCAATTCACTTTAGCTAATGCTCTGTGGTTTAACCTGGGAGCTGTACTTCAACAAGGTTCTGAAATCGCACCGATTGCTTACGGTACTCGTGCAGTGGCCAGTATTTGGTGGATGTTCGCGTTGGTGATCACGAGTTCCTACACAGCCAACTTAGCCACGTTGTTGGCCTCTAAGACCTCCACCGAGCTCATCCGCAACGTGCGCGAACTCGCCGAAAACGACCAGGGCATCACTTACGGGGCGAAATCTAGTGGCTCAACTTACACTTTCTTTGAAATGTCAAGCAGCGAACCATACAAAAGCATGTTCCAAAAAATGAAAGACGTCACAATGCCTTCGACTAATGAGGAAGGAATAGAAAAAGTAATGAATGAAAAATATGCGTTCTTCGCGGAGTCCACGACTATCGACTACACGACGGAACGTAACTGTGAGGTCACAAGAGTGGGAGATCTCTTAGATAGTAAAGGATATGGCATTGCAATGAAAAAGAACTCACCGTACCGACAGGCGTTGAATCTAGCACTGCTGAATCTGCAAGAGGCGGGGATTCTTAGGGAGATGAAACATCGCTGGTGGAAAGAAATGCATGGGGGCGGTGCCTGTCAGGACAAGGAAGACCACGCCACCGAGAGACTAACAATCGACAACTTCAAGGGTTTGATTCTCGTGTTGACGGTGGGCTGCGCTCTGGGTATAGTCATGTCTTGTTGTGACTTAGCGTGGAGCGCCTGGCGTCATCCGCGCGATCCGACGCGGTCCTTTGCTGCGAGCTTCTGGTCTGAGCTGCGATTCGTGTTCCGTTTCGAGCAATCAGAGAAGCCAGTCCGCGGTGCCCTGACCCCAGCTCCCAGCTCGCACGATTCGCCTCCTTCGGCACACTCCGAACGCTCGGAGTTGACAACGGGGAGTGGAGTCGACGGGAGGGGTAGGGGGAGAGAGGAGGACGACAACCACGGAGAAGACGATGTGGGCTCACGCTTCTCAGCGCGCTCGCGACGGACCAGCGCACGGCGATGTTCCATGCACGCCGCCAGTTTGAGACTGGCGAGACACACCACACCCCGGCGATGA

Protein sequence:

>DPOGS209941-PA
MESVCSGVAEGIPEVAVLLEQAVAAGGGAAAGERGQSLEPYEPLAVPEHICTQASEGLLASVGGVEAADAAARAGLLLLLASPVAVPSPAMTIASTEPDHPLSAALEFYPRYDVLAEACAALCEAKGWKHAVLLHDGSGSAAPLIVPDHDTLALRVRQLPSREDDDALRNLLLVLKKFGAVNFIVWCSAECSVRVLDAAQRVGLLSERHSYLMLSLDLHTLPLEDFSYGGANITSLRLFDPESSAVNVSMEKWQQQYINLLGNEANEEIDKIISNPPTSLLLSYDAAKIVSEGMEYLDLPFMEDSPSCQQGTAAFHADTLLNYLRSEENSGATGPLWWEATGARGGVRLHVAELERGGFLRAAGDWSRTGGLTWRPRPPAPPPPPDAMTNRTFTVLIAQNQPYVMRQQSSERLSGNARYEGFCIELVDRLAQLLHFNYTFIEQADRAYGSLNKTTKQWNGMMRRLMDDKNVDFAITDLTITAEREEIVDFTTPFMTLGISILFHKPQPPAPELLAFLLPFSNGVWMCLGLAYVGSSLVLYVVGRLCPEEWQNPYPCIEEPSALENQFTLANALWFNLGAVLQQGSEIAPIAYGTRAVASIWWMFALVITSSYTANLATLLASKTSTELIRNVRELAENDQGITYGAKSSGSTYTFFEMSSSEPYKSMFQKMKDVTMPSTNEEGIEKVMNEKYAFFAESTTIDYTTERNCEVTRVGDLLDSKGYGIAMKKNSPYRQALNLALLNLQEAGILREMKHRWWKEMHGGGACQDKEDHATERLTIDNFKGLILVLTVGCALGIVMSCCDLAWSAWRHPRDPTRSFAASFWSELRFVFRFEQSEKPVRGALTPAPSSHDSPPSAHSERSELTTGSGVDGRGRGREEDDNHGEDDVGSRFSARSRRTSARRCSMHAASLRLARHTTPRR-