Monarch geneset OGS2.0

DPOGS210693
TranscriptDPOGS210693-TA3177 bp
ProteinDPOGS210693-PA1058 aa
Genomic positionDPSCF300013 - 613694-620874
RNAseq coverage1299x (Rank: top 10%)
Annotation
HeliconiusHMEL0205850.072.44% 
BombyxBGIBMGA006308-TA0.066.49% 
Drosophilachp-PA0.039.70% 
EBI UniRef50UniRef50_E2A0X80.041.29%Chaoptin n=8 Tax=Formicidae RepID=E2A0X8_CAMFO
NCBI RefSeqNP_001107810.10.040.48%cell surface protein chaoptin [Tribolium castaneum]
NCBI nr blastpgi|1672343670.040.48%chaoptin [Tribolium castaneum]
NCBI nr blastxgi|3071881990.040.43%Chaoptin [Camponotus floridanus]
Group
KEGG pathwaytgu:1002271761e-26 
 K04309 (LGR4, GPR48)maps-> Neuroactive ligand-receptor interaction
Orthology groupMCL12740 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210693-TA
ATGATACTTGTTGCATACACTACCAAACAAAATATTAACAAAACCAAAACTTTGTATGTTAATCAAATCTTAGTATGTCATATCAATTTAAAAGAAGATGCGTTCCGAAATGCAAAGATTAAAACATTGTCTCTACGAGACTGCGGCGTCACGGACCTATCTCCAGCATCATTCGCCGGCCTTGAAAACACTCTTCAGTCCCTAGATTTATCAGAAAACAACCTAACAATGATATCCAAATTCATGTTAAACAAGTTAGACTCGTTGCGGTTCTTGAACCTAAGAGAAAATAAGGTGGACACGAATTTACTAGCAACGAATAATCCATCAGAGTACTCGACGCCATCGATAAATAACTTTCAGTATAAGTTGTTCTTCTTGGACATCAGTGGCTCATCGTCTCTTGAAATTAGCTTGCAAGATGTGAGAAGAATGCGTTCCCTCAGATATTTGTCTGTGAGTAAATTGATAAGACGCAGCATATCTGCAGAAGATTTCCTAGAATTCGGCGTGGAATTGGAGGATCTAAAAATAATCGGAAGCACTATCAATCGAATCGAAGCGAGCGCCTTTCAACATGTACGGACTATAAAGTCCTTAGATCTGTCTGAAAATAACATTGACTTTATAGACCCATTCGCGTTTGCGGAGTTACATAGCTTGACATCATTGAAATTAGCCAATGGATTAGCAGATTCCGTAAAAATATTGCCATTTGAACCTTTGAAAGCACTTATAGAATTGCAGGATTTGGACGTTAGTAATAACAAATTGAGGAACGTCCCAGACACATCTTTCCACTTCTTATATAAGTTAAAAACATTGAACCTTCAAGATAATCTCATTGATCACTTCTCTAAAGGAACATTACAGAGTGACATACATCGACAGCTAGAAAGCGTTTCTCTATCGCTAAACCAAATGCAACGAATTGATCAACATACATTTGTCAATTTACGAGAATTACAGGAAATTTTAATCGAAGATAATCTAATAGAAACAGTGCATAGACGTTCCTTCACAAGTTTAGATAACCTGAAGGTGATTCGATTAAGAGGAAATATTATTACTGAAATTAGTGAAGAGGCATTCCAAAATCTACCGGCCTTGAAAGAGTTAGATATATCATTTAATCAATTGGAGACGTTCAAGTTTTCGATATTTGATCAAGTTGGATCTGCGACGGCCTTGAAAGTAAACGTGTCATACAACAGAATAGTTTCATTGACTGATTCAAATGCTGTCAATTTCTTCTCTTCAAACTTTTATCCTCCACCTAAAGCGCAAAGATTAGTTTCTGAGGATCCCAGTCCTCTGCGTATAGAAAGAGGACTTGGCACGGTATCAGTGAATATAAGAGTTTTGGACTTCTCACACAATAACATTTCATACATCGCGCCATACTACTTCAGACACGCGGACCTGACGTTATCCGAGTTGCACCTCTCCCACAATATGATCCGTAATATAACACGAGAAGTGTTTGGGTCGATGCTAATGTTGCAATACTTGGATTTATCGCATAACCAAATATTCCACATGGAGTATGACTGTTTTAAGAAAGTTAAAAGATTGCAAATAATAGACTTGTCCCATAATCACCTGTTCGATACACCGGTGGAAGTGTTCCACGAGATGCAGGGACTTACTACAGTGGATCTTTCGGACAACAACATCAAAAACTTAGCAGATAATCTCATCATATCTCCAGCTTTAGAGAGGCTAGACCTATCTGACAATGATTTGTCACGAATACCAACGAATTGTTTATCTCCGGCTGCTGCTATTAATCTAGTAGAACTAGATTTGAGCGGGAACAACATACCCGCTGTAGCTATTGCTGACTTAGTCCAAAGATATAGGCACGACGACTGGCCCGAGGAACCGGACTACAGTGACGAATACATGTACCACACGGCTAGGCGCGACCACGCCAGAGTGTTCCATCAAAAAAAACAATACCCGCAGAACATATTGTTTAAGTCGCTTGCGTGGTTGGATTTGTCTGACAATCACTTGGTGAGAGTTGAAAGCGGTTCTTTTGCTGCTTTACCAAAACTCCGATGGTTGGATTTAAGTATGAATATGCCCTTTAACAACAATGACCGCGGAAGCAGTTTATTTAAAGGTTTAGAAAGAAGATTATCTCATTTGGGACTAAAGAATGTTAGTCTCACAAATATCCCATCAATGCCGTTGCCGAAGTTAAAAAGCCTAGACCTATCATACAACAACTTTCCCTCCATTCCGACCGACATGACGGCAAACTTGACTCGTCTCAGAGCTTTGGATTTGTCTTATAATGATTTGACTAATGTTCCCGTAGCGACTCACTCCCTCAGCGAACTTCGTTGGTTGTCTCTATCTGGGAATCCAATCACTGCCCTTATGAACACTAGCATGTACGGCGTGTCTCCGAGACTAGAATATTTAGACGTAACTCACCTAAAATTGAGTATACTAGAGGCCGGGGCGTTCAGCAAAATGTACGGATTACGCACTCTTAAAATATCTGTTAATGGAAATATAAGAGACTTCAATATTCCAAAGATATTGACACACAATGACGCATTGAAGAATTTGTATTTACATATAGACAATTCTCAAATCGATCTTGGCAAGGAGATGATTGGAGAACTTCCTCCCAAGCTAAATAACATTACTATTGTTGGTAAAGCTTTGAAATTTTTGTCACAGAATCTGCTAGGTGGTGTTACATCTGAAACTTTGACTCTGACCATTTATAATACCAGCCTTGAGGAAGTAGAAAGTGAAGTTTTTTGGAGACCAGGCCATGTAAAGAATCTAACCCTAGATTTGAGGCATAATAATATAGCTAGGGTTCCCAATCCAGCGAGACATGAATGGCCGGGAGTACCAAATTCTTTATTCCTTCACGACATATTTTTGTCTGGAAATCCTTTATACTGTGATTGTCGCATCGGTTGGGTTCAAGCGTGGGATCGCAAACGAAGACAATATTTGTGCGAGAGTCCCTCTAGTTGTGTTGCTGTACGAGACGATCTCAGATTTGCGAAATGTCCTTCCCATTATAACAGGACTTTCAGTGACGTCATCGCGAAAGATTTAGACTGCACTTGGAGTAAAGGATTCCTGAACTTACCAAACTTATACATAATTACGGCAATATCTATCATGACATGCCTCTACATTTGA

Protein sequence:

>DPOGS210693-PA
MILVAYTTKQNINKTKTLYVNQILVCHINLKEDAFRNAKIKTLSLRDCGVTDLSPASFAGLENTLQSLDLSENNLTMISKFMLNKLDSLRFLNLRENKVDTNLLATNNPSEYSTPSINNFQYKLFFLDISGSSSLEISLQDVRRMRSLRYLSVSKLIRRSISAEDFLEFGVELEDLKIIGSTINRIEASAFQHVRTIKSLDLSENNIDFIDPFAFAELHSLTSLKLANGLADSVKILPFEPLKALIELQDLDVSNNKLRNVPDTSFHFLYKLKTLNLQDNLIDHFSKGTLQSDIHRQLESVSLSLNQMQRIDQHTFVNLRELQEILIEDNLIETVHRRSFTSLDNLKVIRLRGNIITEISEEAFQNLPALKELDISFNQLETFKFSIFDQVGSATALKVNVSYNRIVSLTDSNAVNFFSSNFYPPPKAQRLVSEDPSPLRIERGLGTVSVNIRVLDFSHNNISYIAPYYFRHADLTLSELHLSHNMIRNITREVFGSMLMLQYLDLSHNQIFHMEYDCFKKVKRLQIIDLSHNHLFDTPVEVFHEMQGLTTVDLSDNNIKNLADNLIISPALERLDLSDNDLSRIPTNCLSPAAAINLVELDLSGNNIPAVAIADLVQRYRHDDWPEEPDYSDEYMYHTARRDHARVFHQKKQYPQNILFKSLAWLDLSDNHLVRVESGSFAALPKLRWLDLSMNMPFNNNDRGSSLFKGLERRLSHLGLKNVSLTNIPSMPLPKLKSLDLSYNNFPSIPTDMTANLTRLRALDLSYNDLTNVPVATHSLSELRWLSLSGNPITALMNTSMYGVSPRLEYLDVTHLKLSILEAGAFSKMYGLRTLKISVNGNIRDFNIPKILTHNDALKNLYLHIDNSQIDLGKEMIGELPPKLNNITIVGKALKFLSQNLLGGVTSETLTLTIYNTSLEEVESEVFWRPGHVKNLTLDLRHNNIARVPNPARHEWPGVPNSLFLHDIFLSGNPLYCDCRIGWVQAWDRKRRQYLCESPSSCVAVRDDLRFAKCPSHYNRTFSDVIAKDLDCTWSKGFLNLPNLYIITAISIMTCLYI-