Monarch geneset OGS2.0

DPOGS210850
TranscriptDPOGS210850-TA1905 bp
ProteinDPOGS210850-PA634 aa
Genomic positionDPSCF300027 + 501137-506360
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0217030.065.47% 
BombyxBGIBMGA006978-TA4e-14464.01% 
DrosophilaCG7896-PA3e-2028.34% 
EBI UniRef50UniRef50_Q178X42e-6531.82%Putative uncharacterized protein (Fragment) n=1 Tax=Aedes aegypti RepID=Q178X4_AEDAE
NCBI RefSeqXP_001651397.13e-6631.82%hypothetical protein AaeL_AAEL005739 [Aedes aegypti]
NCBI nr blastpgi|1571111156e-6531.82%hypothetical protein AaeL_AAEL005739 [Aedes aegypti]
NCBI nr blastxgi|1571111153e-6631.82%hypothetical protein AaeL_AAEL005739 [Aedes aegypti]
Group
KEGG pathwaybta:5201894e-21 
 K04308 (LGR5, GPR49)maps-> Neuroactive ligand-receptor interaction
Orthology groupMCL18852 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210850-TA
ATGCGCGCCATCAAACTTTTGACGTTTTACTTTTTAATTTGGAACCCGGTGCGGACGCAGGAGATCACAGAGGAATCCAGATTAATAAAAGTTTGTTCATATTGTACGTGCAGTGAAATACCAGAAGTGGATGGCACACATTTGGTGTTAAATATATTGTGTTCTGAATTGGATCGCATAGAAAATCTCGCTGATTTGGATAAAATTCAATGGCCGGAAAATCCTAATGGTTTGAAAATATCCGCGACTTTTGAGGGGATGGGTCTATCCACTCTAGGCAAATTACCACCTAATTCTCAAGTAGAGACGCTCAGATTCACAAATAATGCTATCAAAACATATTGGCCCGATCCATTCAGCGATGTTCCCAACCTGAAGAGAATATCGTTCACACAAAACGAACTCTCAGAGATCACTCCAGACCTCTTTACGAAGATAGAGAGCTTGGAAGATTTAGATTTGTCGTATAATAAAATCGGAGACATAAATCCTCTAGATTTTAAATTTTTACACAACCTAAAGAGGTTGAATTTGCAAAGTAATCTTCTAAAGAAAATACCAGTAGCTTCACTCGAACCTGTGACAGTGTTGGAAGACTTGGACCTCAGTAAAAATGGAATTCAGGAAGTGTTGCTCAGACGGGTGGAGAGTGTAACGTTGAAAGGAATTAAAAGGTTAAATTTAAATAGCAACAGAATACGATCAATACTTAAAGAATCTTTTCCGGATAACAACAGCATAGAACTGTTGGATTTATCTAATAATATAATTGAGATGGTCGAAGAGGATGCGTTGTCCTCGTGCATCAATTTAAGAGAATTGAACCTTGCGCAGAACAACATAACGTTTCCTTTCGCTGTTCCGCCGACACTTCAGATCGCTATATTGAAGATAAACACCTTGTACCACTGGCTGAACTTCCCCGCCGGCATCACCTACATAGATCTTTCTTATAATCGTTTATCAGCTCTGTATAATGAGGAGACTGTTGATTTTAATAACCTTGAGGTTCTAAGTATAGGCGGTAATCAATTACGAGATTTTGATATACAAAGAAAACTTCCAAAGCTATTCAGCTTAGATATATCTTATAATCTGTTACAAGAAGTGCCAAAATGTCTGAGCAGTGAAATTCTCCCGAATTTGGAAGAGTTGCGTTTAGATGGCAACCCAATGGAGAGTATTTATTTTAAGAATATAATAGCCTTAAAGTATTTATATATGAACGATCTGATTAAACTGACAGTAGTAGATGATAAGGCATTCAGTAATGTTATTGGCAGACGCGGTGACGACGATGCTAATTCAGAGAAGAGTTGTTTTTCTCTCTATCTCTCTCATAATCCATCCCTCAGTAACATCCAAGATGGAGCATTCGACGGTACAAACGTTTGTATGTTAGACATAAGTCACAACAACCTTAGCGTACTGTCTCGTTCGGTGATGTCTCCTCCGCCGACGGAGGGAGTTGATCTCCAATACAACCCATGGCGGTGTTCCTGCGAGATGCAGTGGATTGTTGACGACCTGCTGCCAGTGCTGTATAGAGACAGCCCTCGGTTATTAGATGAGCTCAGATGCGGTTCTCCACGAGCGCACGAGGGTCTTCGTCTTGTACATTGGTACAACTGGACAGGACGCGCTTTGTGTGACCAAAGAGCGCTTGGAGGCTACGAGATTGAATCCTCAACGGAGCCGAGTAAAGTGACTAATCTGACCCTCATTTTGGGAGGGTGCGTCATAGTTGCTCTCCTCATAGCCATTGCTTTGTTTGTGTACTTGGTGAAGAGTCGAAGAAGACACAGAATAAGACAGGCCGCTCTCAATCGCAAGAGACAAAGTTCTAGTGACGCTAAAAATACCAACGGGCTACACAACGAATTCGCAGCTCTGAATAAGACATGA

Protein sequence:

>DPOGS210850-PA
MRAIKLLTFYFLIWNPVRTQEITEESRLIKVCSYCTCSEIPEVDGTHLVLNILCSELDRIENLADLDKIQWPENPNGLKISATFEGMGLSTLGKLPPNSQVETLRFTNNAIKTYWPDPFSDVPNLKRISFTQNELSEITPDLFTKIESLEDLDLSYNKIGDINPLDFKFLHNLKRLNLQSNLLKKIPVASLEPVTVLEDLDLSKNGIQEVLLRRVESVTLKGIKRLNLNSNRIRSILKESFPDNNSIELLDLSNNIIEMVEEDALSSCINLRELNLAQNNITFPFAVPPTLQIAILKINTLYHWLNFPAGITYIDLSYNRLSALYNEETVDFNNLEVLSIGGNQLRDFDIQRKLPKLFSLDISYNLLQEVPKCLSSEILPNLEELRLDGNPMESIYFKNIIALKYLYMNDLIKLTVVDDKAFSNVIGRRGDDDANSEKSCFSLYLSHNPSLSNIQDGAFDGTNVCMLDISHNNLSVLSRSVMSPPPTEGVDLQYNPWRCSCEMQWIVDDLLPVLYRDSPRLLDELRCGSPRAHEGLRLVHWYNWTGRALCDQRALGGYEIESSTEPSKVTNLTLILGGCVIVALLIAIALFVYLVKSRRRHRIRQAALNRKRQSSSDAKNTNGLHNEFAALNKT-