Monarch geneset OGS2.0

DPOGS210669
TranscriptDPOGS210669-TA5247 bp
ProteinDPOGS210669-PA1748 aa
Genomic positionDPSCF300013 - 1625749-1644576
RNAseq coverage318x (Rank: top 36%)
Annotation
HeliconiusHMEL0180860.071.32% 
BombyxBGIBMGA006250-TA0.068.29% 
DrosophilaCad99C-PB0.042.23% 
EBI UniRef50UniRef50_E2C9P00.043.03%Protocadherin-15 n=5 Tax=Neoptera RepID=E2C9P0_HARSA
NCBI RefSeqXP_396248.30.043.13%PREDICTED: similar to Cad99C CG31009-PA, partial [Apis mellifera]
NCBI nr blastpgi|3071919890.043.03%Protocadherin-15 [Harpegnathos saltator]
NCBI nr blastxgi|3071919890.042.96%Protocadherin-15 [Harpegnathos saltator]
Group
Gene OntologyGO:00160201.2e-34membrane
GO:00071561.2e-34homophilic cell adhesion
GO:00055091.2e-34calcium ion binding
KEGG pathway 
InterPro domain[597-616] IPR0021261.2e-34Cadherin
[895-995] IPR0159191.9e-33Cadherin-like
Orthology groupMCL13212 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210669-TA
ATGGCGCGGCGGAGCGCGCTTCTCCGGCAAACCACAATCTGGCTTTTAGTTATTCTCCTCTCATCAGCATGGGCGACGCCAGGACCATGCGAAGTGGAGACAGGCCAATCAAGCATTATAGTTGATATTGAAGAAAGCAGAGGGGAACAGGTCAACCAAACTACAATACCAGCGGAGTTACCAATAGTCGGTGAGCCAGATGTAGACGTCATTTTATCAACAGTTTTTCCTAAAGGTCCAACGCTTTTTGAGCTCGACGGCAAACGTCTCCAGCTACTTCAACCCTTGGATAGGGATGCTGATAACTTATCACATATGGTATTCCAGCTGGTATGCCAAGTGAAAGCGACCAAGAAGAGGCGAACTATCCCCGTAATCGTAAGAGTGTCGGATATAAACGATAATGCACCGGTATTTCAAGGCACGCCCTATGAAACTAGTATATCAGAGTTAATACCTATCGGTTCGACGATATTCGACGGTATACGTGCATTGGATCCTGATGCTGGTGTGAATGGTTTAGCTGAATACTTTATAATACCTGGTGATAATAAAACGCTGGAGGCTGCTAACGCGGCCGATGGTTACACGAACTTCGAGATCCCAATACCTCACCAGGGTCAAGTCACTGTGAACAGGAGTTTAGATTACGAACGGACACAGAAGTATCTAGTCACTATTGTTGCTTCTATCGAAAACTATGGAAACGAGGATCGAGCTCGCGATACCAAACGCCGTCTCTCCAGTACCACCACGTTGACGGTGAATGTTCAGGACGGGGACGACCAGGATCCGTCCTTTATATACAAAGGCTGCACATTACATGACGGAGTTTGTATCAACCCTGAGTACAACACTTACGTAAACGGTGGTGTACTCGCTGGTATACTGACGATAGAACCTGAGAAGATTCAAGCCGTGGACATGGACACTCTGAACGCTCGTATAAAATATACTATAGAGAGCGGCGAACCCGATTCATGGGCGAGTTACTTTGATATAGATCAAAGTACAGGCGCCGTGAGGCAACTTGTGCCTGTCGATACAAGTATTGCTAAGAAATTCCAATTAGTTATTAAGGCGGAAGAGATATCGGAAGCGAAACGGTTCACAACAGCCAAATTGACAATAACAGTGAGACCTATAGACGCAAGTCCACCGGTAGTGACGTCATCGTCAGATGAAGGGCAGGTTGAAGAAAACTCTCCCAAAGGGACCAAAGTGCTGGATAAATCTGGAAATCCTATCAGATTGACAGTATCTGATCCTGACTTGGGTCCCGGAGATCCGATACCGGCCTACAAATTTGAGTCAACGACAAGCTTCTTTGATATCGACAAAGACGGGTACCTCGTAGTTAGCGACGATAGATTAGATCGCGACCCACCAAACAAAGATAGACTTAGATTTCAGGTGGAAGCTACAGACATAGACGAGGGTGAGAACGCTCGTATCAGTTACAGTATATACCACGTGTCTAACAACGGGGGTAACAAGTTCACTATAGATCCAAACACTGGTGTTATATCGAGCACCGGTCGTCTACAAGCTGGTGAACAGTACAGTGTCACAGTGAGGGCCGCTGACTCGAAAAGATTGAGTTCCCAAGGCATCATTGAGCTAGTGGTAGCCCCCGGACCCAATACACGCCCACCACAATTCAGTTCAAGGGATTACTTCGCACCGGTGTCTGAAGGAGCCGCTATCAACTCAACTGTTACCACCGTCACAGCCAAAGATCCTGAGAATGAACCTGTTACATACTCAATAGCGTCTGGGAACGATCTGCGTCAATTCGCCATCGGCTCCAACACTGGCATCATAACAGTTATAAGACATTTAGATAGAGAAGTACTCACGAGGTATCAGCTGGGTCCCGGAGATCCGATACCGGCCTACAAATTTGAGTCAACGACAAGCTTCTTTGATATCGACAAAGACGGGTACCTCGTAGTTAGCGACGATAGATTAGATCGCGACCCACCAAACAAAGATAGACTTAGATTTCAGGTGGTGGCTCGCGAACCGACGGGGGCGGCGTCACCGCCTTACTCGCTCTCCGTGGAACTGCTTGATGTTAATGACAACGCTCCAGTTATACCAAAAACACAGCCAATCACCGTACCAGCCAGCCTCGAACCAGCCGCTGTGTACAGGGTGGAAGCTACAGACATAGACGAGGGTGAGAACGCTCGTATCAGTTACAGTATATACCACGTGTCTAACAACGGGGGTAACAAGTTCACTATAGATCCAAACACTGGTGTTATATCGAGCACCGGTCGTCTACAAGCTGGTGAACAGTACAGTGTCACAGTGAGGGCCGCTGACTCGAAAAGATTGAGTTCCCAAGGCATCATTGAGCTAGTGGTAGCCCCCGGACCCAATACACGCCCACCACAATTCAGTTCAAGGGATTACTTCGCACCGGTGTCTGAAGGAGCCGCTATCAACTCAACTGTTACCACCGTCACAGCCAAAGATCCTGAGAATGAACCTGTTACATACTCAATAGCGTCTGGGAACGATCTGCGTCAATTCGCCATCGGCTCCAACACTGGCATCATAACAGTTATAAGACATTTAGATAGAGAAGTACTCACGAGGTATCAGCTGATGATAAGAGCCGAAGATCCAGGACACTTGTCAACAACAGCTACGGTTAATATTAAAGTCACTGACATTAACGACAACAACCCAAAGTTCGACGAGGATTCCTATTTGTTTAAAATTAAAGAAGGAGTAGCAAACGCTGAAGTGGGCCAAGTACATGCTACTGATTTAGATGAAGGTGTGAACGCTATGATCACTTACAGCATCCCGTCACATCTGCCGTTCGCTATAGATAACTCTACTGGCGTTATAAGTACTGTAACAGAACTTGACTATGAGGATACTAAGGAGTATGCATTCGTTGTGACAGCTACTGATGGTGCGATGGACAAACGTCTGGGCACGGCCTCTGTATCAGTTTTAGTGTTGGATGAACCAGACGAGCCCCCGGTCTTCACACAAGGAGTGTATTCAGTTAGAGTGCCAGAGAACGCACCAAATTATCCAGTGGTTAAAGTACACGCCGATGATCCGGATACCCAACCGGAGATAACCTACACTATTATAATGGGAGACACCGATTTATTCTCCATCGACCGTAAGACCGGTCTTATTAGGACATTGAAGCCGCTCGACAGAGAGGAGTCGGCACGCCATGAGCTCATAGTGGGCACTGAGGAGAATAATAGTGATGGAAATGGATCTACAGCCACTGTTGAAGTTGTTGTTGATGATAAAAACGATAACGCGCCCATATTCACGTCTGTGGCTCGTCCGGTTACTATAGAGGATACGTCCTCTATAGGCAGTTTGGTTGAGACGGTGGTGGCGTTGGATTCGGATGCTACATCACCTAACAACCGTATACGGTATGCGCTCGCCGGACGAGGGAAAGCCAGTATATACTTCCACGTGGAACCGGATACGGGAGCTGTGAAAGTTAGAGACGATTTAAGAAAAGAGACTGATAGCGAATATACCGTTGATATCCAAGCGTACGACCAGGGTGATCCGGTGATGTCTTCTGTTATGAGTCTCACTGTCTATGTCAGTCACTCGGCCACCGTACCACCTGATGTTCGCCTCGGCTTCCCGGACACTGTGTACACGGAGCACTTGGCGGAGAACTCTCCCAACTCAACGATTGTAAGGACGTTGCCGATATGGAACAAGAACAAGCATTCAAGAGACACTCCCCTCAAGTGCTTACTCACAGATGCTTCACAGAAAGGAGTGTTTTACGTGAGACTTACAGTCGACAGAGACTGCGCTATATACCTCAATAGTAGTCTAGACTATGAAACACTAACAGAATACTCATTGGAAGTCCAGCTGGAGTCCATACAGGGTCTGATAAACCCAGAGAGCAGCAAAGCGGTTATTAAAATTCATGTAACCGATGTCAATGACAACGCTCCTGTTTTCGTATTTAATGAGCAGTCTACGGTGGGCGGTGCGAAGGGTAAATATTTCGCAATAGCCACTAAGGATATGCCGTTGGGTACCAATTTACTGCAAGTTAAGGCTAAAGACAAGGACAGTGGCGACTTCGGTAAAATAGAATACCGCAAAAACTCATGGACCAAAGCAGCCGAGGAATACTTCTCGTTAGACCCTGACACCGGCGTCATCAGTAATAGGAAGACATTCGAAAATGTCCCAAAAGACGTATTACCGTTCAAATTCAACGTTATGGCGAGGGACAACCCCAAGTCTGATAACTATAAGATAGCTCGAGCTTCTGTCGTGATAAATCTGCTTCAAAAAGACAATCAGCTTATCATAGAAGTCGGTAACCTGAACGTGGATACAATGACGACTCAGCAAGCCCGTAGTCTGCTCGCATCTGTTGAGGAAAGAAGTGGTTTAGTGGCAGGACTCGAGAAATTGACTCCTAGACTATATTTGGGAGAAAATGGTACTTTAGAGAGCGACCCCAACGGAACCAGTGTCTGGTTGTACTTGTTAGATCCGAATACTGGCGAAATATTGACAAGAGAATCACAGCCGGTGAAAAAGGCGGTGGAGTCTGGTGTGTCAATCCGATACAAGAAACACAAAGAGCAAGCAATACAACAATACGGAACCCTAAGCATGAGTATGGCGCCGCCGCGGCCACCGTCAGGGTATGAGTCTAGTGAGGAAGACCCCGCTCCCAGATATGAAACACAGGTTCTCAATATGGCTGTCGATGACGCGGACCTACAATTGGACTTTAGTCCGAATAACCACGCATTCAACATTCACAATGTTCAGTATTTATCCAAAGATAACGGCGAAAGAAGTCCAACACTATCAGAAACAGCGACTACCGCGAGAGCGTCGAGCATCAACGAGAACGGCGGTACCTTGAACAAAGTTCATAACTTCGACAACAACGCCAACAACGCGACCGTCGCTCGGAACACGCAGACGTTGAATCGGCGGACCAACAACAACCATCCCCTGAATAACTCGCTGGGGACGCTACCCCGGGTCAATAATAGTGTAGCGGGGGGACTGCTGGCCGCGACGCTGGGTAGGAAAATCAACACCGGCAACCACAAGAAGAAACAGACGCAACCCATTATGGCGTACGACGAAATACCCGGGTTACAACGATCCACGGATAATGATAATGTTACGTTCGGAAAGAGAAATTTCACTGGTTTCTCATACGAACAATCCCCCGTGGAAACTACCACAGAGCTATAA

Protein sequence:

>DPOGS210669-PA
MARRSALLRQTTIWLLVILLSSAWATPGPCEVETGQSSIIVDIEESRGEQVNQTTIPAELPIVGEPDVDVILSTVFPKGPTLFELDGKRLQLLQPLDRDADNLSHMVFQLVCQVKATKKRRTIPVIVRVSDINDNAPVFQGTPYETSISELIPIGSTIFDGIRALDPDAGVNGLAEYFIIPGDNKTLEAANAADGYTNFEIPIPHQGQVTVNRSLDYERTQKYLVTIVASIENYGNEDRARDTKRRLSSTTTLTVNVQDGDDQDPSFIYKGCTLHDGVCINPEYNTYVNGGVLAGILTIEPEKIQAVDMDTLNARIKYTIESGEPDSWASYFDIDQSTGAVRQLVPVDTSIAKKFQLVIKAEEISEAKRFTTAKLTITVRPIDASPPVVTSSSDEGQVEENSPKGTKVLDKSGNPIRLTVSDPDLGPGDPIPAYKFESTTSFFDIDKDGYLVVSDDRLDRDPPNKDRLRFQVEATDIDEGENARISYSIYHVSNNGGNKFTIDPNTGVISSTGRLQAGEQYSVTVRAADSKRLSSQGIIELVVAPGPNTRPPQFSSRDYFAPVSEGAAINSTVTTVTAKDPENEPVTYSIASGNDLRQFAIGSNTGIITVIRHLDREVLTRYQLGPGDPIPAYKFESTTSFFDIDKDGYLVVSDDRLDRDPPNKDRLRFQVVAREPTGAASPPYSLSVELLDVNDNAPVIPKTQPITVPASLEPAAVYRVEATDIDEGENARISYSIYHVSNNGGNKFTIDPNTGVISSTGRLQAGEQYSVTVRAADSKRLSSQGIIELVVAPGPNTRPPQFSSRDYFAPVSEGAAINSTVTTVTAKDPENEPVTYSIASGNDLRQFAIGSNTGIITVIRHLDREVLTRYQLMIRAEDPGHLSTTATVNIKVTDINDNNPKFDEDSYLFKIKEGVANAEVGQVHATDLDEGVNAMITYSIPSHLPFAIDNSTGVISTVTELDYEDTKEYAFVVTATDGAMDKRLGTASVSVLVLDEPDEPPVFTQGVYSVRVPENAPNYPVVKVHADDPDTQPEITYTIIMGDTDLFSIDRKTGLIRTLKPLDREESARHELIVGTEENNSDGNGSTATVEVVVDDKNDNAPIFTSVARPVTIEDTSSIGSLVETVVALDSDATSPNNRIRYALAGRGKASIYFHVEPDTGAVKVRDDLRKETDSEYTVDIQAYDQGDPVMSSVMSLTVYVSHSATVPPDVRLGFPDTVYTEHLAENSPNSTIVRTLPIWNKNKHSRDTPLKCLLTDASQKGVFYVRLTVDRDCAIYLNSSLDYETLTEYSLEVQLESIQGLINPESSKAVIKIHVTDVNDNAPVFVFNEQSTVGGAKGKYFAIATKDMPLGTNLLQVKAKDKDSGDFGKIEYRKNSWTKAAEEYFSLDPDTGVISNRKTFENVPKDVLPFKFNVMARDNPKSDNYKIARASVVINLLQKDNQLIIEVGNLNVDTMTTQQARSLLASVEERSGLVAGLEKLTPRLYLGENGTLESDPNGTSVWLYLLDPNTGEILTRESQPVKKAVESGVSIRYKKHKEQAIQQYGTLSMSMAPPRPPSGYESSEEDPAPRYETQVLNMAVDDADLQLDFSPNNHAFNIHNVQYLSKDNGERSPTLSETATTARASSINENGGTLNKVHNFDNNANNATVARNTQTLNRRTNNNHPLNNSLGTLPRVNNSVAGGLLAATLGRKINTGNHKKKQTQPIMAYDEIPGLQRSTDNDNVTFGKRNFTGFSYEQSPVETTTEL-