Monarch geneset OGS2.0

DPOGS200893
TranscriptDPOGS200893-TA1989 bp
ProteinDPOGS200893-PA662 aa
Genomic positionDPSCF300066 - 414381-426671
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0122250.081.84% 
BombyxBGIBMGA000544-TA0.073.30% 
Drosophilaltl-PA4e-9734.72% 
EBI UniRef50UniRef50_E0V9S52e-12748.43%Leucine-rich transmembrane protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0V9S5_PEDHC
NCBI RefSeqXP_392419.36e-12943.16%PREDICTED: similar to CG32372-PA [Apis mellifera]
NCBI nr blastpgi|3454973791e-12943.78%PREDICTED: leucine-rich repeats and immunoglobulin-like domains protein 1-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3071735086e-13143.37%Leucine-rich repeats and immunoglobulin-like domains protein 3 [Camponotus floridanus]
Group
KEGG pathwaybta:5054232e-22 
 K04309 (LGR4, GPR48)maps-> Neuroactive ligand-receptor interaction
Orthology groupMCL15648 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200893-TA
ATGTCCGCGCTCGTGATCGCCGCTGCCCTGCTTTGCGCGTTTGCCGGCCACCTTGGTGCCGGAGCTAGTCTTCACTGCGCTATGAGAAGAGAAATTTCACCATGTACATGCAGACGCGAGGATATTGGTACCGGTGCCATCCTTGTCATCTGTCAACGTATCAGCACCTATGAGGATATCGCGAAGGCCCTCACCAACAAGTTCAGTCCCGAGACCAAAATCGGTCTCGATATATCGAATTCGGAGCTACCCGATTTCGCTGACCACAGTTTTCGGGAGCTTGGACTCTCTATCACGAAGCTCAAACTCAACTTCGACAACTTAAGGCAAGTCGATTTTAATGAATTAAAAGAAAGTGCATTTACTAAGTTGGATCTAGTTGACTACTTCAGCTTGGCCGATAATTCGTTGTCTGAGATGCCGCGACATGTTCTCAGACATCTTCCTCATGTCAAGACAGTTGACTTATGCAGGAACAAGATAACTAAGTTGACGGAAGAAGATTTTAAAGACATTCAAGAAGTGGAACACTTGCTTGTTGCTGATAATCAGATATCAAAGATTGAAAGACATGCAATACCTAAACGGTTAAAGCACGCTCATCTTGGTGTCAACAAACTAAACTCTTTAAACGGAGCTCTGCGTGATTTAGATAGTTTGGATTGGATATTTATAAACGCTAATCATTTAAAGTCCATTGATAACGAACTTCCTGTGAAGGCTAAAAAGATTGTTCTCATACACGCTGCACACAACGAGCTTACGAACCTACCAAAAGATCTTCGACAAATGCCGTCCCTGGAGTCTTTGTACTTTTACGATAACAATATAAAGTCACTTGATGGTGCTTTACAAAAATCGAGAAGATTAAAGACAATAAGTTTATCTTTCAATAAGATTGAATCACTAGCTGAAGATGAGTTTTCTGAAGCAGAAATGTTAGCAGATTTAGATATCGGCTATAATCAGTTAAGATCTTTAAATGGCTCCTTAAGAGCCTTAAGATCACTTCGATATCTGAACTTGACTCACAATTTTCTTACTGAATTTTCTTTACAAGAAATCAAAGGCTTAAGGGGATTAGCCGTTATTGACCTTTCTCATAATAAAATAACAAAAATTTCCGGCAGTATAGAGAATTTGGTGGACGTTGAAACACGCGTAATGGAATTGCGGTTGAATCATAATCACATACTTAATTTAGGCGGTGCTTTAATGGGCTTACGAGGGCTCTTGAGACTTAATTTAAGTCATAACCAATTACAAAAAATATCGTCTGATGATCTCATTGGTTTAGAAGACCTTCGATTACTTGATGTGTCTTACAATCATATAACGACAATGGCGGATACCTCAAAGGCATTTTTACCGTCACTTGAAGAACTCATTGCTCATCACAATAATGTAACAACTTTGGATAAGGATTTCCACGGCTTTCCATCACTCTGTATCGCAGATCTTTCTTATAATGAAATTCAGTCTGTTAACTATGAAGTTGTGTCGAAATCGAGATGTACAATTAATGGAGTTCCCAGTATTTTGAAAATTTATCTTCAAGGGAACCCTGTTCTCTGCGATGAGCGCCTGTACGAGTTAGTAGCTTTATTGGAGAATCTAAATGCACGTGTCAGCGGCGTTTCAACCTGCGTGGCAGCTCAGACTTCAGCCCCAGTTTTAATGCGTGCCCTTAATGATATAGTACCAGATGCGCCAGTAATCGTTGTCACACACATGGGGACCGGGCTTCGAGTTTTGCAAAGAGAAATGCAACAGTCACAGCTTCCGTCAGCTTATCGCCGCGTGGGGGCGCTCATTGGACACGTTTTACCGGAGAGAGAAGGTGGATTGCCTGTCCTGGTTGAGCCACCAGTCACGCTCCCACCGTCTAACACCAATGTGGTAGTTAAATGGCCGGATGTCAAGCGGGAACAGCCACACCCTCTTGCCGATCACCTTCGCCTTCGTGACCTAAACTCTTCACCGCAGTGA

Protein sequence:

>DPOGS200893-PA
MSALVIAAALLCAFAGHLGAGASLHCAMRREISPCTCRREDIGTGAILVICQRISTYEDIAKALTNKFSPETKIGLDISNSELPDFADHSFRELGLSITKLKLNFDNLRQVDFNELKESAFTKLDLVDYFSLADNSLSEMPRHVLRHLPHVKTVDLCRNKITKLTEEDFKDIQEVEHLLVADNQISKIERHAIPKRLKHAHLGVNKLNSLNGALRDLDSLDWIFINANHLKSIDNELPVKAKKIVLIHAAHNELTNLPKDLRQMPSLESLYFYDNNIKSLDGALQKSRRLKTISLSFNKIESLAEDEFSEAEMLADLDIGYNQLRSLNGSLRALRSLRYLNLTHNFLTEFSLQEIKGLRGLAVIDLSHNKITKISGSIENLVDVETRVMELRLNHNHILNLGGALMGLRGLLRLNLSHNQLQKISSDDLIGLEDLRLLDVSYNHITTMADTSKAFLPSLEELIAHHNNVTTLDKDFHGFPSLCIADLSYNEIQSVNYEVVSKSRCTINGVPSILKIYLQGNPVLCDERLYELVALLENLNARVSGVSTCVAAQTSAPVLMRALNDIVPDAPVIVVTHMGTGLRVLQREMQQSQLPSAYRRVGALIGHVLPEREGGLPVLVEPPVTLPPSNTNVVVKWPDVKREQPHPLADHLRLRDLNSSPQ-