Monarch geneset OGS2.0

DPOGS214060
TranscriptDPOGS214060-TA1260 bp
ProteinDPOGS214060-PA419 aa
Genomic positionDPSCF300171 - 54632-57629
RNAseq coverage402x (Rank: top 30%)
Annotation
HeliconiusHMEL0049795e-17987.80% 
BombyxBGIBMGA010397-TA0.075.95% 
DrosophilaCG9304-PA2e-12855.36% 
EBI UniRef50UniRef50_Q9W2B32e-12655.36%CG9304 n=28 Tax=Neoptera RepID=Q9W2B3_DROME
NCBI RefSeqXP_967964.11e-15568.38%PREDICTED: similar to AGAP011701-PA [Tribolium castaneum]
NCBI nr blastpgi|910784682e-15468.38%PREDICTED: similar to AGAP011701-PA [Tribolium castaneum]
NCBI nr blastxgi|910784682e-16167.25%PREDICTED: similar to AGAP011701-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[96-349] IPR0193364.1e-74Rhodopsin-like GPCR transmembrane domain
Orthology groupMCL15941 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214060-TA
ATGTTTCAACATTTAAATGGTTCAGCATACCATCCAAAGTGCAATGCATTAGGTCAGGACTTATTTCGAAGGATACCATGTCCGATAGGCAAACTATGCGTTGATGAGGATACACCATGGAATGTTATTAAAAACAACCAATTTACTTATGTCATACAAAATAACGGGCAACCAAGATATTGGTATGTGTCGATAGTATCTTGTTATTTGGATGAAGAAACGTGTTCCTGGCATCATTACTCTGGAGCTCCGTCAAAGGACAACACAACTCTCACAGATATCCCACAAACTCTAGAATATGATTTCTGGCTTGTTAACGGAAGCCCGAATCTGTCAATTTATAATTCTATGTTATATCAGTTTTCGTTTGACAGACAGAATACTTTGGAATTATATTTAATGTTCTGGCTCTGCTATATCATTTTATTACCCGTTCAGATATATGCTGTGAGAACTCAGAGACATCCTGTTACTAAATTATTTACATCAAGCTTGGTGTTAGAGTTCATAGCTCTGTGTTTTAATGTACTTCACACAGTGAAATTTGCTGCGGACGGTGAAGGCTTTGAAGGCCTGTCTGTCGCTGGTGATATTCTCGATATACTGAGTAGGACACTGTTTATGCTACTCCTGCTTTTATTAGCAAAAGGTTGGGCTGTGACACGACTTGAACTGACATACAAACCATTGGTATTTGGAGTGTGGTTAGTATATGGTGTAGTCCATATACTGTTGTATGTTTGGAACACTACTGAAGTGGATATTATAGAAGAGATAGATGAATATCAAACATGGCCGGGATGGCTTGTCCTAACTCTGAGAGTTGTTATAATGTCATGGTTTGTGCTTGAGCTGCGCAACACTATGATGTATGAACACAATATGCCGAAACTTAACTTCCTCCTGCACTTTGGTGCATCAAGTCTTGTGTGGTTCGTCTACTTACCAATTATAGCACTAATCGCCCTACAAATAAGTCCTTTGTGGAGATTTAAATTTCTTCTAGGTATAACTTACTCAGCAGACTGTTTGGCGTTCTGTGTGATGGCTCATCTCTTGTGGCCAACACGCTCCGAACAGTACTTATTGCTCACACCTTCTGACTATACAGCCGGAATTGAAGAATTGGATGAGTTCAGTGAATCCCCACATGTGGTACACAGTGACACAGTTGCTTTAACAACTGCTGAGAGTGTTAATGCTGATGAAGAAGAAGTTATATTTACAAGACCTCATAAAAACGGGGTTATCTTCTCATAG

Protein sequence:

>DPOGS214060-PA
MFQHLNGSAYHPKCNALGQDLFRRIPCPIGKLCVDEDTPWNVIKNNQFTYVIQNNGQPRYWYVSIVSCYLDEETCSWHHYSGAPSKDNTTLTDIPQTLEYDFWLVNGSPNLSIYNSMLYQFSFDRQNTLELYLMFWLCYIILLPVQIYAVRTQRHPVTKLFTSSLVLEFIALCFNVLHTVKFAADGEGFEGLSVAGDILDILSRTLFMLLLLLLAKGWAVTRLELTYKPLVFGVWLVYGVVHILLYVWNTTEVDIIEEIDEYQTWPGWLVLTLRVVIMSWFVLELRNTMMYEHNMPKLNFLLHFGASSLVWFVYLPIIALIALQISPLWRFKFLLGITYSADCLAFCVMAHLLWPTRSEQYLLLTPSDYTAGIEELDEFSESPHVVHSDTVALTTAESVNADEEEVIFTRPHKNGVIFS-