Monarch geneset OGS2.0

DPOGS209332
TranscriptDPOGS209332-TA5340 bp
ProteinDPOGS209332-PA1779 aa
Genomic positionDPSCF300194 - 43721-57444
RNAseq coverage98x (Rank: top 61%)
Annotation
HeliconiusHMEL0093790.089.25% 
BombyxBGIBMGA002952-TA0.086.81% 
DrosophilaCadN-PL0.072.01% 
EBI UniRef50UniRef50_O159430.072.01%Neural-cadherin n=57 Tax=Hexapoda RepID=CADN_DROME
NCBI RefSeqXP_002066304.10.081.63%GK18219 [Drosophila willistoni]
NCBI nr blastpgi|1954367180.081.63%GK18219 [Drosophila willistoni]
NCBI nr blastxgi|1954367180.081.63%GK18219 [Drosophila willistoni]
Group
Gene OntologyGO:00160206.4e-49membrane
GO:00071566.4e-49homophilic cell adhesion
GO:00055096.4e-49calcium ion binding
GO:00055158.7e-07protein binding
KEGG pathwaydmo:Dmoj_GI222900.0 
 K10414 (DYNC2H, DNCH2)maps-> Phagosome
    Vasopressin-regulated water reabsorption
InterPro domain[943-1182] IPR0133201.1e-52Concanavalin A-like lectin/glucanase, subgroup
[1615-1773] IPR0002336.4e-49Cadherin, cytoplasmic domain
[947-1184] IPR0089855.6e-46Concanavalin A-like lectin/glucanase
[449-558] IPR0021268e-34Cadherin
[989-1158] IPR0017913.9e-33Laminin G domain
[439-563] IPR0159195.2e-32Cadherin-like
[997-1157] IPR0126802.9e-26Laminin G, subdomain 2
[1455-1491] IPR0018814.3e-11EGF-like calcium-binding
[1458-1491] IPR0062108.7e-07Epidermal growth factor-like
Orthology groupMCL10152 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209332-TA
ATGACGGGCGCTATCTATGTAGCGGGTGCACTCGACTATGAAACTAGGAAACGGTACGAGTTGAAGTTGGCTGCTTCGGACAATCTAAAAGAGAACTATACGACAGTGGTTATCCACGTAAAGGATGTGAATGACAACCCACCAGTGTTCGAGAGACCGACCTATCGTACCCAAATCACAGAAGAAGATGACCGCAATCTTCCCAAGCGTGTGCTTCAGTACGAGCTCACCTTGGTGGCGTCGGACGGCAGAAACGAGAACTCAACTCGTGTGGTGGTCCACGTACTAGATATCAACGATTTGCCACCTCGGTTCTCGCGCAGCGCATATATCACCCAGGCCTTAGAGGAAACAGGACCCTACCCCCACTTCCTTATACAGGTCACTGCGACAGACGGCGACAAGGATAGGCAACAAAATATTGTTTATTTCCTTACCGGCCAGGGTATTGACCCAGATAATCCTTCAAATAGCAAATTCGACATCAATCGAACGACAGGAGAAATCTTTGTCCTTAAGCCCTTAGATCGAGATCAACCTAATGGACGGCCTCAGTGGAGATTTACAGTATTTGCTCAAGATGAAGGTGGAGAAGGACTTGTAGGTTATGCCGATGTACAAGTTAATCTTAAGGACATTAATGACAATGCACCTATTTTCCCACAAGGTGTTTACTTTGGTAATGTAACAGAAAATGGTACAGCGGGAATGGTTGTTATGACAATGACTGCAATTGATTATGATGATCCAGCTGAGAGTAACAATGCAAAACTTTGGTATTCCATCGAGAAAAATGTTATCGAGGAAGAGACAGGATCTCCAATTTTTGAAATCGAACCAGAAACCGGGGTCATTAAAACTGCGGTGTGTTGTTTGGATCGTGAAAGAACTCCAGATTATTCTATACAAATAGTAGCTTCGGATGGAGGGGGGTTAAAAGGAACAGGTACAGCATCAATCAGAGTTAAAGATATAAATGATATGCCGCCTCAATTCACAAAAGATGAATGGTTCACAGAAGTAGATGAAACAGATGGAACGAATTTACCTGAAATGCCGATACTCACAGTAACAGTTCACGATGAAGATGAAACTAATAAATTCCAATACAAGGTTATAGAAAACAGTGGATATGGTGCTGATAAATTTACAATGGTTAGAAATAATGATGGGACTGGATCCCTTAAAATTGTACAGCCATTAGATTATGAGGATCAGTTGCAAAGTAACGGTTTTAGGTTCAGAATACAAGTAAATGATAAAGGTGAAGACAATGATAACGATAAGTATCACGTAGCTTATTCATGGGTAGTTGTGAAACTTCGAGATATAAATGACAACAAACCACAATTTGAACGAGCAAATATTGAAGTATCTGTGTATGAAAATGCAGAAGTCGGAAAAAGCCTCGAAACATTCAAAGCCACGGATCCAGACCAAGGAGGTAAAAGTAAAGTCTCGTACGCTATTGATAGATCCTCTGATAGGAAACGTCAATTTTCAATTAACCAGGAAGGTACTGTTAGCATCCAAAGATCTTTAGATAGGGAAGATACACCCAGACATCAAGTTAAAATTTTGGCTATTGATGACGGTGTTCCTCCAAGGACAGCAACAGCCACTTTAACAGTAATCGTACAGGATATAAACGATAACGCACCTACATTCCTAAAAGACTATAGACCCGTTTTAACTGAACATATAACACCTAAAAAAGTGGCTGAAATTCTAGCAACAGACGACGATGATAGATCTAAGAGCAATGGTCCACCATTCCAATTTCGACTAGATCCGGGTGCTGATGATATTATAAGAGCCTCTTTCAAGGTCGAACAAGACCAAAAAGGTGCAAACGGTGATGGTATGGCAATTGTTTCATCCTTAAGATCATTTGATAGAGAACAACAAAAGGAATATCTCATTCCCATTATTATAAAAGATCACGGTAATCCAGCTATGACTGGAACGAGCACTTTAACAGTCGTAATTGGTGATGTGAATGACAATAAAATGCAACCAGGTTCTAAGGAAATCTTAGTTTATAATTATCAAGGGCAAGCACCAGATACAGAAATTGGAAGAGTATACGTATATGATTTGGATGATTGGGATTTACCAGACAAGAAATTTTTCTGGGAGAGTTCAGAACATCCTAACTTTACATTAAATGAAGAAACTGGAATGATTCAAATGAAACACAAAACAAGAGAAGGTAGATATCACTTAAAATTCAAAGTATACGATCGAAAACATACGCAAACTGATGTACCTGCAAATGTTACCGTTTATGTCAAAGAAATTTCGTCTGAAGCAATCATGAATTCGGGTTCTATAAGAATATCAGGTATATCTGACGAAGATTTTATAAGAGTATGGAATTATAAAACTTTAAGTGTTTCTAGAAGTAAGTTAGATATATTCAAGGATAAATTAGCGGATTTGCTTAACACAGAACGTGAAAACATCGATGTATTCAGTGTACAACTGAGGAAAAAACATCCACCTGTAACTGATATTCGTTTTTCTGCCCATGGAGCTCATTACTACAAACCAATACGATTAAATGGAATAGTACTTATGCATAGAGAAGAAATAGAAAGAGCCGTAGGAATCAATATAACCATGGTAGGAATAGATGAATGTCTTTACGAAAACCAAATGTGCGAAGGTTCTTGTACTAATGTTCTTGATATTAGCAACTTACCTTATATGGTTAATTCAAATAAAACAGCACTTGTTGGCGTTCGTGTTGATGTTATTGCGGAATGTACTTGTGGTGCTAGAAATTTCACTCAAGCTGAAACTTGTCGTAACTCGCCATGCTATAACGGTGGTAGATGTATAGAAGGTAAATATGGATTGACTTGTTCATGTCCGCCCGGATATACAGGACCTAGGTGTCAACAGACATCACGGAGTTTTAGAGGTACAGGTTGGGCCTGGTATCCTTCGTTAGAAATGTGTGATAGCTCTCATTTAAGTTTTGAGTTTATTACCAGGAAGTCCGAAGGAGTTTTACTTTATAATGGACCAATTGTTCCGCCCGAACCAGAAGAAATAGTTGTATCCGACTTCATTTCAGTTGAATTAGAAAGAGGAAATCCAAGATTATTAATTGATTTTGGATCAGGTACACTAGAGTTGAGGGTAAAAACTAAAAAATCTTTAGATGATGGTGAGTGGCATAGACTAGACATATTTTGGGATACCGAAAATGTCAGAATGATCGTTGATTTCTGTAAATCGGCGGATATTCAGGAAATGGAAGACGGAACTCCACCCGAATTTGATGACTCAACTTGTCAAGCATCTGGAACGATACCACCATTTAACGAATATTTAAATGTCAATGCACCTTTACAAATTGGTGGATTATACATTGAACATTTTGATCCTACACACTACCATTGGCAGTACATGCCAATTGGAAAAGGATTTGATGGGTGTGTTAGAAATCTAATACACAATAGTAAATTATATGATTTAGCACATCCTGGTCTCTCTAGAAATTCTGTAGCTGGGTGTCCGCAAACAGAAGAAATTTGTAATCAGGCTGACACAACAACAAGATGTTGGGAACATGGCACTTGTGTTGGAAGTTTCTCGGAAGCTAGATGCCAGTGCCAGCCTGGTTGGACGGGACCATCGTGTAATCTACCAACAACACCAACAAGTTTTAGACCACAGAGCTACGTAAAATTCGCATTGAGCTTTGAGCCTGACAGGTTTAGCACACAGGTACAACTAAGGTTTAGAACTAGGGAACCTCATGGAGAACTTTTTCGAGTAAGCGATCAACACAACAGAGAATATGGCATTTTGGAGGTTAAGGATTCACGGTTACATTTCCGTTATAACTTAAATTCCTTACGGACGGAGGAACGTGATGTTTGGTTGAATTCCGTGCCAGTGGATGATGGACAGTGGCATATAGCTAGAGTGAGCCGATATGGTAGCGCTGCGACCCTCGAAATCGATGGAGGAGAAGGCAGAAGATATAACGAAACATTTACATTTGAAGGCCATCAATGGCTACTGGTAGATAAACAGGAGGGTGTATATGCTGGAGGCAAGGCCGAATACACCGGCGTTCGAACGTTTGAAGTATATGCAGATTTTCAGAAAGGTTGTCTAGATGACATAAGATTAGAAGGTAAACATTTACCGTTGCCGCCGGCGATGAACGGAACTCAATGGGGTCAAGCAACAATGGCCAGAAACTTAGACCGGAACTGCCCCTCTAACAGTCCCTGTATAAACGTTCACTGCACCGAACCCTTCGTCTGCGTCGACCTCTGGAATGAATATGAATGCACTTGCGGTGAGGGTTTGGTATTGTCTGGTGACGGAAAAGGTTGCGTAGACAAGAACGAATGTCTCTACTTCCCTTGCCGAAACGGGGGTTCGTGTGTCAATCGCGAACCAGGGTACCGCTGCCACTGTCCAGAAGGGTTCTGGGGCGAGAATTGCGAACTTGTACAGGAAGGACGAACGCTGAAACTCAGCATGGGCGCCCTGGCGGCCATCCTCGTCTGCCTCCTTATTATCATGAAGGTAAAGCGCACAATGTATAGGTGTCCGGGTGGATCGACTCGGTCTGGCGGTACGTGTGTGAACGTGAACGAGTGCCTGAACAATCCGTGCTTGCACGGCGGCAAGTGTGTGGATCGCGATCCAGCACGCCGCTACGACTGCATATGCACGTTCGGATACGCTGGACATGACTGTGAACTAGAACTTCTTGCCTCCGGAATCATCATGCCCTCCAGGGATTTCATTATTGCTATCATTGTCTGTTTGTTTTTGCTTTTAGTCCTAGTTCTGGTATTTGTGGTGTACAATCGCCGTCGAGAAGCCCATATAAAGTACCCGGGACCGGACGACGACGTTCGTGAAAATATTATTAATTACGATGACGAGGGCGGCGGTGAAGATGACATGACTGCATTTGACATCACTCCTCTACAGATACCCATCGGTGGACCGTTACCGGATCACGTACCTACAAAATTACCATATCCACTGATGGGTGTAGGTTTGGGAGTGGGTCCAATGGGGGTATCGGTCGCCCCCGCGGTGGTACCACTTCCAGGGGAGACGAACGTTGGCATGTTCATCGAAGACCACAAACGACGTGCTGACAGTGACCCTAACGCACCACCCTTTGACGATCTCAGGAATTACGCGTATGAAGGTGGTGGCAGTACTGCGGGCTCCCTCTCGTCCCTTGCTTCTGGTACCGACGATGAGGTACACGACTACGACTATTTGGGTGCCTGGGGTCCTCGTTTCGACAAACTGGCTGATCTCTACGGGCCCGAACTCGACGAGCAACTGTAA

Protein sequence:

>DPOGS209332-PA
MTGAIYVAGALDYETRKRYELKLAASDNLKENYTTVVIHVKDVNDNPPVFERPTYRTQITEEDDRNLPKRVLQYELTLVASDGRNENSTRVVVHVLDINDLPPRFSRSAYITQALEETGPYPHFLIQVTATDGDKDRQQNIVYFLTGQGIDPDNPSNSKFDINRTTGEIFVLKPLDRDQPNGRPQWRFTVFAQDEGGEGLVGYADVQVNLKDINDNAPIFPQGVYFGNVTENGTAGMVVMTMTAIDYDDPAESNNAKLWYSIEKNVIEEETGSPIFEIEPETGVIKTAVCCLDRERTPDYSIQIVASDGGGLKGTGTASIRVKDINDMPPQFTKDEWFTEVDETDGTNLPEMPILTVTVHDEDETNKFQYKVIENSGYGADKFTMVRNNDGTGSLKIVQPLDYEDQLQSNGFRFRIQVNDKGEDNDNDKYHVAYSWVVVKLRDINDNKPQFERANIEVSVYENAEVGKSLETFKATDPDQGGKSKVSYAIDRSSDRKRQFSINQEGTVSIQRSLDREDTPRHQVKILAIDDGVPPRTATATLTVIVQDINDNAPTFLKDYRPVLTEHITPKKVAEILATDDDDRSKSNGPPFQFRLDPGADDIIRASFKVEQDQKGANGDGMAIVSSLRSFDREQQKEYLIPIIIKDHGNPAMTGTSTLTVVIGDVNDNKMQPGSKEILVYNYQGQAPDTEIGRVYVYDLDDWDLPDKKFFWESSEHPNFTLNEETGMIQMKHKTREGRYHLKFKVYDRKHTQTDVPANVTVYVKEISSEAIMNSGSIRISGISDEDFIRVWNYKTLSVSRSKLDIFKDKLADLLNTERENIDVFSVQLRKKHPPVTDIRFSAHGAHYYKPIRLNGIVLMHREEIERAVGINITMVGIDECLYENQMCEGSCTNVLDISNLPYMVNSNKTALVGVRVDVIAECTCGARNFTQAETCRNSPCYNGGRCIEGKYGLTCSCPPGYTGPRCQQTSRSFRGTGWAWYPSLEMCDSSHLSFEFITRKSEGVLLYNGPIVPPEPEEIVVSDFISVELERGNPRLLIDFGSGTLELRVKTKKSLDDGEWHRLDIFWDTENVRMIVDFCKSADIQEMEDGTPPEFDDSTCQASGTIPPFNEYLNVNAPLQIGGLYIEHFDPTHYHWQYMPIGKGFDGCVRNLIHNSKLYDLAHPGLSRNSVAGCPQTEEICNQADTTTRCWEHGTCVGSFSEARCQCQPGWTGPSCNLPTTPTSFRPQSYVKFALSFEPDRFSTQVQLRFRTREPHGELFRVSDQHNREYGILEVKDSRLHFRYNLNSLRTEERDVWLNSVPVDDGQWHIARVSRYGSAATLEIDGGEGRRYNETFTFEGHQWLLVDKQEGVYAGGKAEYTGVRTFEVYADFQKGCLDDIRLEGKHLPLPPAMNGTQWGQATMARNLDRNCPSNSPCINVHCTEPFVCVDLWNEYECTCGEGLVLSGDGKGCVDKNECLYFPCRNGGSCVNREPGYRCHCPEGFWGENCELVQEGRTLKLSMGALAAILVCLLIIMKVKRTMYRCPGGSTRSGGTCVNVNECLNNPCLHGGKCVDRDPARRYDCICTFGYAGHDCELELLASGIIMPSRDFIIAIIVCLFLLLVLVLVFVVYNRRREAHIKYPGPDDDVRENIINYDDEGGGEDDMTAFDITPLQIPIGGPLPDHVPTKLPYPLMGVGLGVGPMGVSVAPAVVPLPGETNVGMFIEDHKRRADSDPNAPPFDDLRNYAYEGGGSTAGSLSSLASGTDDEVHDYDYLGAWGPRFDKLADLYGPELDEQL-