Monarch geneset OGS2.0

DPOGS204134
TranscriptDPOGS204134-TA4995 bp
ProteinDPOGS204134-PA1664 aa
Genomic positionDPSCF300184 + 238062-250876
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0051091e-4923.91% 
BombyxBGIBMGA013616-TA0.054.47% 
DrosophilaCad88C-PA3e-4325.74% 
EBI UniRef50UniRef50_Q86DU30.053.48%Cadherin-like protein n=87 Tax=Ditrysia RepID=Q86DU3_PECGO
NCBI RefSeqNP_001037682.10.058.24%cadherin-like membrane protein [Bombyx mori]
NCBI nr blastpgi|1129826850.058.24%cadherin-like membrane protein precursor [Bombyx mori]
NCBI nr blastxgi|165887920.055.36%cadherin-related midgut membrane protein [Lymantria dispar]
Group
Gene OntologyGO:00160208.5e-19membrane
GO:00055098.5e-19calcium ion binding
GO:00071565.5e-15homophilic cell adhesion
KEGG pathway 
InterPro domain[480-584] IPR0159198.5e-19Cadherin-like
[485-587] IPR0021265.5e-15Cadherin
Orthology groupMCL19118 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204134-TA
ATGACCAACATTCCAAGGGAATCCAGGCCTGATGACTTACCGGAACTTGTAGCGGAAGGAGATACATGGAGCGACAGACCTTTCCTGCCCGGTACTGAGGGTGAAGAAGTTTGTTTTGAACAATATAGAGCAAATGTCAACATTCAAGTAATATTTATGGATGAAGAGAACGTGGGAGAAGTACCCATAGCGAAACTGAATTATCAAGGAGATAAGACACCCACTATTGTTCTTCCCCTCACTGCTGGATCTTTCAATTTGTTGCGACCAGAATTGAAAAAAGAAAATAATTCTTGGTTTCTTTACGTGACTCAAAGACAGGACTATGAAACGTTAGAAAGTGCTTTATATATCCTGAGAATACAGATAGAAAATGAAGTGTTGGAAGGACAAATATCTCTATGGATTGTAAACATTGACGACAATCCTCCTATCATACACTTGCTGGATGCCTGCGTCGTGCCGGAACTAGGAGATCCTCGTTTGACAAATTGTACATACGAGGTAACAGACATAGATGGTAGAATCAGTACCAGCAACATGACCTTCAAGATTGGTAGCGATCGTGATGATGAAAATTTTTTCTATTTTATCGGAGAACCTGATACAAGCAACTGGAACAGAATGACCACGACTCTTGGAATAAATAGAGCACTAGATTTTGAAGCCAGCGCCCTTCATATATTTACAGTTACAGCACTCGATTCTTTAAATAACAACCACACCGTCACGATGATGGTCCAAGTGCAGAACGTTGAACATCGTATGCCTCGTTGGGTAGTTATATTCGCGGTGCAACAAATCGACGAGCTGAAGGCTGGGAATTTTAATGTTAGGGCAATAGATGGCGACACTGGTATCAACAAGCGGATTTTATACAAAATAGAAACGGAAAGAGGTGAAGAAAACCTATTTGACATTACAAATATAGAAGGTGGCTATAGCGGAGGAATTTTTCAAATTAATCCAATCGACAGAGACGCATTAGAAAAAGAACTATTTCAAGTTACTCTGACAGCGTACAAATATGATAACGAGACGTTCTCTACGTCAACCAACGTAGTTATAAGGGTCAATGACATAAATAACAAGAAACCAGAACCCCTACACTCTGAATACAACATCGCCATAGCCGAGGAAACACCTCAAACATTATATTTCGACAAGGAATTTGGATTTCACGATAGAGATTTGGGTCCGAATGCTCGATATACAGTCCACTTAGAGAGCGTCTACCCTGACGGCGTCGCTGATGCTTTCTTTATACAGCCGGAGACAGGGTATCAGAGACAAATCTTCATTATGGGCACCTCAAATCACAGTATGCTGGACTTTGAAGTGGAAGAATTTCAACATATACAAATTAAGGTAGTAGCCACCGACATAGATAAACCCGATTTTATAGGTGTAGCGATTGTAAACATCAATTTAACTAACTGGAACGACGAGTTACCTATCTTTAGTGAAGGCAATCAGATCGTGTCGTTCAATGAGACTGAGGGCGAAGGGTTCAGGGTGGCTCAAGTTCTGGCTCGGGACAGAGATATCGGCGACAGGGTTATCCACAGTGTCTTGGGTAATGCACAAAACTTTCTAAGAATAGATAATGAAACAGGAGAAATCTTCGTCTCTAAGAATAATAGTTTTGATTATCAGAGACAAAGCGAAATATTTGTTCAGGTGCTAGCCGAGGACACATTGGGCGAGCCGTACAACACTGCCACATCACAGCTCGTTATACGACTGAAGGACATAAATAACACACCACCGAGCTTGAGACTACCTCGGGGGAGCACACAAGTACAGGAGAACGTGCCAGCGGGTTACGTTATAAACGATCAGCCTGAGCAGTTCATAACAGCGACTGACCCCGACACCACCGCCAACCTGACCTTCCTGATCATATGGGAGACATCCTACGCTATGAAACGAGGCACGGAGACTGACAAGGACGAGATCGAAGGGTGCTTGGATATAGTCACGTCATATGTGAACGATAATAAAGGCCAGGCAGTCGGTCGCCTGGTGGTGAAGGAGATAAGGCCCAATGTGACCATAGACTACGAGAAGTTCGAGGTGTTGTACCTGACGGTGAGGGTTGTGGATACCGAGACGGAAATTGGAGAACCTTATGACGAATTAACTTTCACGATCACGATTCTGGACATGAACGACAACCCACCCGTGTGGGTGAACGGTACCTTGGAACAGACCCTCAGGGTACGGGAAAAGTCAGGAGCCGGCACCATTATAGGTACCGTTACAGCCACTGATGCAGACGGCCCACAGTATAACCAAGTTAGATATACCATTGTTCCCAGAGAAGATACGCCAGAAGACTTGGTGAAGATCGACTTCGACTCTGGTCAGCTGGCTGTGGACAGATCTGGTGCCATCGACGCTGACGAGCCGAGAGACAAGCTGTACTACACCATCCTGGCCAGCGATAGATGCTCCCAACCAGATAACACCACCTGCGGACCTGACTCCACTTACTTTATTACGGAGGGCAAGGTAACGATACAAATTATTGACACCAACGATCAAATTCCAAGGGCCAGAACCGACAGATACAATACCACGGTTTATATACACGAGAACGCTGAACCTGGGAAGGAAGTGGTCACTCTAGTAGCTGAAGACGGCGACAGAGATGTTATGTACAACACCATACGCTATCAAATCAATTACGGTGTGAATATGCGTCTCAATGACTTCTTTGCCATCGAACCTGAGTCCGGTCGTGTCTACGTTCACTACACCACCCACGAAGTCCTCGACAGAGACGGCAACGAACCAACACACAGGATATTCTTTACATTGATTGATAACTTCAATTTTCAAGGAGACGGAAATCGCAACCAAAATACGACAGACATTGAGGTGATATTGTTGGATGTCAATGATAACGCACCGCAGCTGCCATTGCCTAATTTTATTTGGTCTATATCAGAGAATGCGCAGCAGGGTATAAGACTTAACCCTGGCATATTTGCCGAGGACCGAGACGAACCCAACACGGACAACTCCCGAGTTGGTTACGAGATCACCAACTTGACTCTCACAAACAGAGACATCACTCCACCAGAACTCTTCACCGTCATGCACGTCTTTATATCTGATAATATTTACAACGTTTCCGCTGAATTAGAAGTAGTAAGACATTTGAAAGGATTTTGGGGAAATTATGACATTGGTATCCGTGCCTTCGATCATGGCGTCCCTCGAATGGAATCCAGCGAAATTTACCAAATCACTATAATACCATACAATTTCGAATCACCCAAGTTTAAATTCCCATTAGACGCCGCTACTCTAAGATTTTCAAAGGAACGTGCGCTAGTGAACAGTGTTTTAGTACTGGCCACCGGTGAGCTATTGGAACGAGTCAGTGCCTCTGATTCTGACGGATTGCAGGCTGGAGAAGTCACCTTCGAGGTGGTGGGAGATGAGCTAGCCAGTGAGCACTTCCGGATATTGAACAACGGGGATAACATAGGAACGCTTTTATTGACGCAACCGCTACCAACAGACCAGCAGATTATACAGTTTGAGCTTAAATTGAGAGCTACAGACGGAGGATTGGACCCCGGACCCCTCTCCAGCGAAGTAACCGTCAAAGCGGTTTTCGTGCCAACTCAAGGAGAACCAATATTCCCATCGTCCACGGCAAGCGTCGCTTTTGTTGAGAAGGAAGTGGGGCTAACCGAAAGACAACAACTGCCACTGGCCGAGGATCCGAAGAACCATCTATGTGACAAAGATTGCAGGGATATTTACTACGCTATCATCAGTGGCAACAACGACGGTATCTTTGCACTGGATCCTGTAACCAACGTGCTCACTCTGAACAAAGCGCTCAATCGTTCGGAGAGCGAATCACACGTATTACGAGTGGCCACCAGTAACGAAAAAAATATTTCACCATCTTTGGCTACATCCGTGATTATTGTAACAGTTAATGCGACGCATTCAGACGGGGCGCAAATCACTTACTCAATAGATTTTTCCTCCATGCAAGTCGATCCAAGCCTCAACAATGTACGGAACACCGCATTCATCCTTAACTCTAACTCTGGGTTAATGACCCTCAATATACAGCCAACAGCCAGCATGCAGGGAATGTTTGAGTTTAACGTACTCGCGACTGATCCACAGGAAGCGTTCGACACATCGGCTGTAAAGATCTATGTCGTGTCATCGCGGAACAGAGTCTCATTTCTATTCCTCAACATGCTCCAACAGATAGAGCAAATGACAGACTTTATAGCGTCAACCTTCACTTCTGGCTTTAATATGACATGTAACATTGACCAAGTGTTGCCGGCGACGGATGAGGCAGGATCTACCAGGGACAACGTCACTGAAGTACGCGCACACTTCATAAAAGATGACGTACCTGCTGAGGCTAGCGTCATTGAGGCGCTCCGCGGGGACATATCGCGTCTGCGTTTGATCCAAGGAACGTTGATCTCCAAAGAGTTGGTGCTGCAAGACCTGGTGACAGACATCAGCCCCTCAGAGCTGGCGGATGGACGTGCTCTGCTTTACACCTTAGCAGCAGTAGCCGGTGTGATGACGATTCTTCTACTGGTCACTATTGTGGTCTACTTCTGGAGGACCAGAGTACTCAACCGTCGCCTGCAAGCGCTGTCGATGACCAAGTACGGCTCCCAGGACTCTGGACTGAATCGTTTGGGTCTAGCGGCGCCGGGAACTAATAAACATGCGGTGGAAGGATCCAACCCCATATGGAATGAATCTATCAAGGCGCCCGACTTCGATGCCCTCAGTGAGACGTCTGATGATTCCGATCTAATCGGTATCGAGAATCTGGAACAGTTCCGAGACGACTACTTCCCCCCGGTTGGTTCGGACTCGGCGGAGGGAGTTACAAACACAGAGCCGGTGTCGACGCATCTTAATAATTTTGGATTCAACCCTACCCCCTTCACCCCCGAGTTCGCTAACAAACCGCGACAATATAAGATTTGA

Protein sequence:

>DPOGS204134-PA
MTNIPRESRPDDLPELVAEGDTWSDRPFLPGTEGEEVCFEQYRANVNIQVIFMDEENVGEVPIAKLNYQGDKTPTIVLPLTAGSFNLLRPELKKENNSWFLYVTQRQDYETLESALYILRIQIENEVLEGQISLWIVNIDDNPPIIHLLDACVVPELGDPRLTNCTYEVTDIDGRISTSNMTFKIGSDRDDENFFYFIGEPDTSNWNRMTTTLGINRALDFEASALHIFTVTALDSLNNNHTVTMMVQVQNVEHRMPRWVVIFAVQQIDELKAGNFNVRAIDGDTGINKRILYKIETERGEENLFDITNIEGGYSGGIFQINPIDRDALEKELFQVTLTAYKYDNETFSTSTNVVIRVNDINNKKPEPLHSEYNIAIAEETPQTLYFDKEFGFHDRDLGPNARYTVHLESVYPDGVADAFFIQPETGYQRQIFIMGTSNHSMLDFEVEEFQHIQIKVVATDIDKPDFIGVAIVNINLTNWNDELPIFSEGNQIVSFNETEGEGFRVAQVLARDRDIGDRVIHSVLGNAQNFLRIDNETGEIFVSKNNSFDYQRQSEIFVQVLAEDTLGEPYNTATSQLVIRLKDINNTPPSLRLPRGSTQVQENVPAGYVINDQPEQFITATDPDTTANLTFLIIWETSYAMKRGTETDKDEIEGCLDIVTSYVNDNKGQAVGRLVVKEIRPNVTIDYEKFEVLYLTVRVVDTETEIGEPYDELTFTITILDMNDNPPVWVNGTLEQTLRVREKSGAGTIIGTVTATDADGPQYNQVRYTIVPREDTPEDLVKIDFDSGQLAVDRSGAIDADEPRDKLYYTILASDRCSQPDNTTCGPDSTYFITEGKVTIQIIDTNDQIPRARTDRYNTTVYIHENAEPGKEVVTLVAEDGDRDVMYNTIRYQINYGVNMRLNDFFAIEPESGRVYVHYTTHEVLDRDGNEPTHRIFFTLIDNFNFQGDGNRNQNTTDIEVILLDVNDNAPQLPLPNFIWSISENAQQGIRLNPGIFAEDRDEPNTDNSRVGYEITNLTLTNRDITPPELFTVMHVFISDNIYNVSAELEVVRHLKGFWGNYDIGIRAFDHGVPRMESSEIYQITIIPYNFESPKFKFPLDAATLRFSKERALVNSVLVLATGELLERVSASDSDGLQAGEVTFEVVGDELASEHFRILNNGDNIGTLLLTQPLPTDQQIIQFELKLRATDGGLDPGPLSSEVTVKAVFVPTQGEPIFPSSTASVAFVEKEVGLTERQQLPLAEDPKNHLCDKDCRDIYYAIISGNNDGIFALDPVTNVLTLNKALNRSESESHVLRVATSNEKNISPSLATSVIIVTVNATHSDGAQITYSIDFSSMQVDPSLNNVRNTAFILNSNSGLMTLNIQPTASMQGMFEFNVLATDPQEAFDTSAVKIYVVSSRNRVSFLFLNMLQQIEQMTDFIASTFTSGFNMTCNIDQVLPATDEAGSTRDNVTEVRAHFIKDDVPAEASVIEALRGDISRLRLIQGTLISKELVLQDLVTDISPSELADGRALLYTLAAVAGVMTILLLVTIVVYFWRTRVLNRRLQALSMTKYGSQDSGLNRLGLAAPGTNKHAVEGSNPIWNESIKAPDFDALSETSDDSDLIGIENLEQFRDDYFPPVGSDSAEGVTNTEPVSTHLNNFGFNPTPFTPEFANKPRQYKI-