Monarch geneset OGS2.0

DPOGS212456
TranscriptDPOGS212456-TA4146 bp
ProteinDPOGS212456-PA1381 aa
Genomic positionDPSCF300273 - 351559-365494
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0119950.072.54% 
BombyxBGIBMGA002170-TA0.065.83% 
DrosophilaNlg1-PE9e-14644.61% 
EBI UniRef50UniRef50_Q17GB82e-15639.14%Neuroligin, putative n=5 Tax=Arthropoda RepID=Q17GB8_AEDAE
NCBI RefSeqXP_971146.17e-17835.43%PREDICTED: similar to neuroligin, putative [Tribolium castaneum]
NCBI nr blastpgi|910820451e-17635.43%PREDICTED: similar to neuroligin, putative [Tribolium castaneum]
NCBI nr blastxgi|910820455e-17433.98%PREDICTED: similar to neuroligin, putative [Tribolium castaneum]
Group
KEGG pathwaytca:6598523e-115 
 K07378 (NLGN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[2-517] IPR0020182.2e-128Carboxylesterase, type B
Orthology groupMCL16728 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212456-TA
ATGTTCTTAGGGATTCCTTATGCAGCACCACCTATAGGCAACTTACGTTTTATGCCCCCAGTGAGTGCCCCACCTTGGTCTGGTTTAAGGATGACGACTCGCTTCGCTCCAGTGTGCCCTCAAACAATTCCTACTATAAAGAAGGGCAATCCTCCGTCTTTGGCAAGACAGCGCTACTTGAGCAGAATTAAACCATTTTTAGCTGAAGAATCCGAGGATTGTTTATATTTAAACATCTATGTTCCTTACAGAGAAAACAAACCGAAGAAATTTCCAGTACTAGTTTTTATACACGGAGATTCATTTGAGTGGAGTTCCGGTAATCCTTACGACGGCAGAATATTAGCTTCTTACGGCAATGTCATGGTAGTAACTGTCAATTTCAGGTTGGGAATATTAGGTTTCATGAAACCAAGCGTAACAGAACACGTCTATGGCAACAACGGTCTGTTGGATCAGCTAGCAGCATTACAATGGATAAAAGACAACATTGAAGACTTGAATGGAGATCCATATTCCGTCACACTAATGGGACATGGCTCCGGAGCTGCTTGTGTTAATTTTCTCATGCTTTCTCCGATCTCAAACGGATTATTTCACCGAGCGATTCTCATGTCTGGATCTGCGCTCTCTGATTCTGCGATGACAAGAGATCCCACGCAGTACACTCTTCAAGTTGCACAAAGCTTAGGATGTAATCCAAGCTCAAAGAACATGATGACCTGTCTGCAGAATAAACCATTATCGGACATAAAAAAGGTTCAAATATTGGCTCGCGAGTTTGAAACTCCACTAGGACCAGTAGTAGCAGGATCCTTTATACCAAGTGAACCAGCTAGAACCATGGAGGCGTTCCCTAATCTTCTGAGCAAATATCAATTGCTGAGTGGTGTGACCGAGTTGGAAAGGTATCATGATTTTGGAGTCATAGAACTTGACCACGGAATTTTAGAAAACCAAAGAGACGAATTTATAAAGAAATTTTCCAAAATTGTGTTCGAAGGTGCCGAAGACGAAGCTCTTAAAGAGATATTAAAACAATACTCGCCATCTAAGCTTGACCCACAGCGCTGGAACGTTGAAACAAATAGAGATGTCATTTTAAATTTGTTCAGTGATGCGCGTACTTTAGCGCCTATGGTCAGTTTTGCCAACTACCAATCCAAGTCTAATAGACAGTCTTACTTTTACGTTTTCGCACATAATTCGGTCAGCACTGATTATGCATCGCTAAATAAAAGTGTACACGGAGAAGAAATGCCATATGTTTTGGGTATCCCTTTGGGTGGAGGCAATACTCACTTTCATTCTGAATATACGCCAGAAGAGAAACTGCTTAGTGAAATGGTCATGAGATTGTGGACAAATTTCGTAAAAAATGGATCACCTAACACCCAAAGTGTGAATGAGTATTATACCATGGATAAGAAGCAATGGCTCAAATATAACGTAGAGTGGCCCGAGTATAACGTCGCTCATCAGCCATATTTACGAATAGATTTGCCTCCATCAATAAACAGTTTGTATAGATCTAATTATACAAATTTTTGGATAGAAATACTTCCTAATAAAATGAAGAGATACGTAATAGATCCTTTATTCGAGTTTAATCCATTACAAACAACACCGAAAACCAAAGCAATCCATAGAATGACAGATCCAGTCCGAAAAACATGGGGTCCTCAATTTAATATATATCCTTCACAATACAATTCTGCTACGGCTTTTGGGAAATTACGTCCATATTCACCTCCGCATTCAACACCAGACACAGATGCAATATATCGAGAAATACAAGCAATTAGAACCCCATCACGACCGATACCTTCAGGGTTGTTAGAAAAAAATAGACCGACGACGACAAGACCAACACCTAACATGCCCATCAAAACATCTAGTGCTACAATTACTCTAGTTGTTTCCCTGAGTGTTTTATTTCTATTGATAAATATAAGTATTTGCATCTTAGTATATTTAAAGAGAAAAAAGTTAAAACGGGATAGGACAGTAAATAATTTACGCAGATCGCGTGCCGAAATAGGTGAAATTGATGTTATTGGACAAAAGTATTCTAAAGATGACAAAACGGCGTTAAAAACTTTTAAGAATGGATGCAATGTTATTAAATCTTTGAGTATAAATAAAATTAAAGGAAACGGTATGAAACCAAAAAAGAAAGATAGCCATACAAGTCCTAAATCGGATGATTCAGGTGGATTCAGAGAACGTTTTCAATTAAGAAGGCATTTATCTACCAGCACTCTGGATGCCCATACAAAAGTAAAGGACTGGATTTCAAACGAAGTTATGCATAGATGCTCTCCGCGTATTTTAAGAAAATCTAATACTGATTTGAATGACCAATCTTATATGAAAACCAAACCTTTCACAAGGTCAGAAGAATTATTATCTGACTCCAAAAAGGTATCAAAAGAAATATGTAAAGTTGTTAGTCAAAAAGACCCCAGTAATTTATTTGAAAAAAAATATGCTAAAAATAAAACATCTGAAAAAACAACATCCACTTCTGCTTTAAATTCACATTCATCTTTAATAAGCAATAAACAATCTACACTTAGTAGGGAAACCAATAAGTCCAATGACTCCTTAAAAAGTAAAGGAAGCGAAAGCATTAAGTCCAATAAAGTTTCCATTGCGATTGATGCTACACCTTCAGCACGAACTAATTCAATTTTAAAACAAGAGCCTATAGAATTATCCAAGTCTCTAGACCAAATTGAACAAAAACATTCCAAACCGGACAAAATATTGGAAAAATCAAAAACAGAAAATAAAAGTAAAGACTGTAAAATTTACACAAACGAAGTGACTTTATCATTAAACGATAAAAATCCATTAAAAATTCTTCATAAACATTCTTCGTCTGATCCAGTAACCGATGTCAACTATGAACTACTTATGAAACAACTTCAACGCAATAACGAGGTTATTGACATCCTCCCTCCCGTTACATTTAGAAATGATATAAATGTAACTTCCAGAGATGAACATAGTCAATTTTCTTCTCTAACTCCTGAGGAATCTTTACAAACTATAAAAAAAAGAAACTTTCCCAAAGTTTTACCGGATTTACCAAAAGCCCAAAAAAGATTATCACTTCAACCTAACACATTGCAAACTTTTAGAAGTTATTTCGAAACCGGAAACATTGAAAGTCAAGTTGAAAAAACAAGAGTTCCACCTCAACCTCCACCGCGTACTACAACTCTTGAACGGCGTTTAGCTTATAAAAACTCGAAACCACTTTCATCATTTGACGTATCGAATATAAGAAAAATAAATGAATCTGAACCAGTAAACAATTATGAAAACATAAATTCACCGTTTTTGAATAGTAATATAAATAAAATAAGTAATCAATCTTTCGAATCAGTAATGGATAAGCCTTACTCAGCTATAATGACACACACTAGAGAATATCCTAAAGTAATAATCGCATCTAGCGATATTCCTTCTTCACCAGAACCTACGATTTTTATTAAGCCTGCCTCAAGCCAAGGTCAGATACCACTATGTGGGCCTAGAGTAAGATTACCTGATGACTTCCACTCTCAAGGCTCTGGGTCCATGACATCTTTTAGGTCATTTTGTATGGATGATATAGTTGATGACGTTGATGACGAACTCATTGACATTTTACCTGAGGATAAACTTATACAGGAACTAATTGGCGGTGTTGAATCGCCAACACATCTAGATGTCTTAGATTCTAAAGGACCAGAAACCATTTTCGAAAAAAAAGTAGAAATAGTGCCTGTAAAAATTAACATGAATCCAATGGGCCCAAAACATAAGGATGACTATGTGTGTATTTCAAATTTACTCTTACCAGAAATTAGTGGTTTGGAACAAACCGCGCAAATTAAACCAAATTTTTGTTTAAGTAAACTAAAAAGTAGAAGTGAAAGTATAAACAGGCCTCCAGAGAAAGCAGTCAAAGTAAAACCAAAGAAAACAATTAAACCTACTACATTTTTGAGTCGAGCCAGTAAAAGCGTTAGCAGTGGCAGCGGTGGCAGTTTTACAAATAAAAGTGATTCTAGTAGATTAGAACATTCAAATTCTGGTTCATCGAATGATACAGAAACTAGTACAGGAACCGTGAAAAAAATCGAAATCAATTAA

Protein sequence:

>DPOGS212456-PA
MFLGIPYAAPPIGNLRFMPPVSAPPWSGLRMTTRFAPVCPQTIPTIKKGNPPSLARQRYLSRIKPFLAEESEDCLYLNIYVPYRENKPKKFPVLVFIHGDSFEWSSGNPYDGRILASYGNVMVVTVNFRLGILGFMKPSVTEHVYGNNGLLDQLAALQWIKDNIEDLNGDPYSVTLMGHGSGAACVNFLMLSPISNGLFHRAILMSGSALSDSAMTRDPTQYTLQVAQSLGCNPSSKNMMTCLQNKPLSDIKKVQILAREFETPLGPVVAGSFIPSEPARTMEAFPNLLSKYQLLSGVTELERYHDFGVIELDHGILENQRDEFIKKFSKIVFEGAEDEALKEILKQYSPSKLDPQRWNVETNRDVILNLFSDARTLAPMVSFANYQSKSNRQSYFYVFAHNSVSTDYASLNKSVHGEEMPYVLGIPLGGGNTHFHSEYTPEEKLLSEMVMRLWTNFVKNGSPNTQSVNEYYTMDKKQWLKYNVEWPEYNVAHQPYLRIDLPPSINSLYRSNYTNFWIEILPNKMKRYVIDPLFEFNPLQTTPKTKAIHRMTDPVRKTWGPQFNIYPSQYNSATAFGKLRPYSPPHSTPDTDAIYREIQAIRTPSRPIPSGLLEKNRPTTTRPTPNMPIKTSSATITLVVSLSVLFLLINISICILVYLKRKKLKRDRTVNNLRRSRAEIGEIDVIGQKYSKDDKTALKTFKNGCNVIKSLSINKIKGNGMKPKKKDSHTSPKSDDSGGFRERFQLRRHLSTSTLDAHTKVKDWISNEVMHRCSPRILRKSNTDLNDQSYMKTKPFTRSEELLSDSKKVSKEICKVVSQKDPSNLFEKKYAKNKTSEKTTSTSALNSHSSLISNKQSTLSRETNKSNDSLKSKGSESIKSNKVSIAIDATPSARTNSILKQEPIELSKSLDQIEQKHSKPDKILEKSKTENKSKDCKIYTNEVTLSLNDKNPLKILHKHSSSDPVTDVNYELLMKQLQRNNEVIDILPPVTFRNDINVTSRDEHSQFSSLTPEESLQTIKKRNFPKVLPDLPKAQKRLSLQPNTLQTFRSYFETGNIESQVEKTRVPPQPPPRTTTLERRLAYKNSKPLSSFDVSNIRKINESEPVNNYENINSPFLNSNINKISNQSFESVMDKPYSAIMTHTREYPKVIIASSDIPSSPEPTIFIKPASSQGQIPLCGPRVRLPDDFHSQGSGSMTSFRSFCMDDIVDDVDDELIDILPEDKLIQELIGGVESPTHLDVLDSKGPETIFEKKVEIVPVKINMNPMGPKHKDDYVCISNLLLPEISGLEQTAQIKPNFCLSKLKSRSESINRPPEKAVKVKPKKTIKPTTFLSRASKSVSSGSGGSFTNKSDSSRLEHSNSGSSNDTETSTGTVKKIEIN-