Monarch geneset OGS2.0

DPOGS205838
TranscriptDPOGS205838-TA5631 bp
ProteinDPOGS205838-PA1771 aa
Genomic positionDPSCF300081 - 42960-52762
RNAseq coverage213x (Rank: top 46%)
Annotation
HeliconiusHMEL0173310.054.29% 
BombyxBGIBMGA010874-TA0.063.55% 
DrosophilaCad74A-PA0.045.60% 
EBI UniRef50UniRef50_F4WV670.047.43%Cadherin-23 n=5 Tax=Formicidae RepID=F4WV67_ACREC
NCBI RefSeqXP_316322.40.048.43%AGAP006256-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582956270.048.43%AGAP006256-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571237580.048.62%cadherin [Aedes aegypti]
Group
Gene OntologyGO:00160201e-31membrane
GO:00071561e-31homophilic cell adhesion
GO:00055091e-31calcium ion binding
KEGG pathway 
InterPro domain[54-73] IPR0021261e-31Cadherin
[1271-1380] IPR0159193.1e-31Cadherin-like
Orthology groupMCL16548 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205838-TA
ATGACTGCCCAAGTCATCAACAGGGTGCCACATTTTATTCCCCAAACCGGGGACATGTCACAATTCAGTTTATCAGAAGACACGCCAGTTGGAACACCAGTGTATCATTTAAAAGGCATAGATCCAGAAAATGGAACTTTAAGATATTCTATATCTGGCCAATACTTCACCGTGGATTCCGTAACTGGCGTGGTCACGCTGGCAAGGTCATTGGACAGAGAAGAACAAGCGAAGTTAGAAGTCATCATAAGTATAACTGATGAGGGATTAGATGAGACGGAACCGAACACGGTGTCGCTTAGGAGGGTGATACCTGTGAGAGATGTAAATGATAACGCTCCCGTATTCCACAACCGACCATATATTGTAAATATAAGCGAAGCGACCTCTGTTGGCAGCGAAATCAAAGTAACGCCAGACATAATAATAACCGACCGTGATGAGAACGACAACGCAAAGATACAAGTAAAATGTTCAGATAAAGAAAGAGGCAGCGATGTTGAAGCCTGTTCCACCTTCAAAGTTACAACGGAAGAAATTACCCCAAACAAATATCAAGTCCGTATATTCCTTTCTAAGCCTTTGGACTATGAAAGCCGTTCGGCGTATGTGCTTACTCTTGATGCATCAGATGTTTCTCAAAAACCTTTAAACGCCATAGGAAGTGTGTCCATTGCTATACAGGACGTTCAGGATCAACCTCCTGTATTTGTTAACGCCCCTTATTCAGCTACCGTCCAAGAAAACACTCCTCCCAATACAAAAATACTGGAGATAGTAGCCAAAGATGGTGACACAGCTAATCCCCGACCAGTGTTATTAACCCTCGAAGACGATACCCAGCAGTACTTTCGTCTACAGTCTGAAAAGTCTATAGGCCGTGCGACACTCATAACATCTAATATTTCTGTGGATAGAGAAAGTGATGTAGTAGTACAGAATGGAGGCGTTTATACGTTTTTTGTAAAGGCAACGGAATTAATAAACAACGAAGTGCCATCAGACTATACAGTAATTCCTATAACAATTATCGTAACTGACGCCGATGATCATAAGCCAATTTTTAATGGGGATGTTTTTGATATTTCCATACCAGAAAACATTGAAAATGGAAGTCCCATTCCAGAATTGTCAATATACGTAGAAGATTACGATCTCGGTCAGAACAGTAAATACGACTTGCATATCCGTAACGTTCGTAATTCTGAAAATGTTTTCTCAATAACGCCAGATCATGGTGAAGGTAGAACGCCAATAAGCATTAAGGTTAAAGACTCATCCAAACTTGACTATGACGTTGACGATGAGGACAAGAGACTATTTAGTTTTGATATTGTAGCTACGGTTAACGGATTGGAATTGAGTCATGCTAAAGTGAATATAAAGTTACTAGATATGAATGATAATGCTCCAGTCTTTGACCAACCATTTTATAAATTTAATGTTCTGGAAAACGCATCTATAGGCTTTAAAATTGGCGAACTTCAAGCAAGCGATAAGGATTACGGGATTTTCGGGGAAATAGAATATTTTTTGTCCGGATTTGGATCAAACATATTTAAAACGGATAAAGTCCTGGGAGGAATATTTGTTGCAAATATGCTGGATTATGAAACCCAAAAAAGTTACAGTTTAACACTTCTCGCAAAAGATGGAGGCGGAAAATCAACGTCAGTTAGTATTTTCATCGACGTTTTGGATACTAACGATAATACACCGATATTTGAAGCTTCCGAGTACAGCCGTACTATACGAGATGGTGCGACAAGCTTTGAGCCTCAACTTGTAATACGGGCAACAGATGCTGACGGTCCTACGCAAGGAGGTGGTCGCATTAATTACTCCCTTGTATCAGACAACAGCATAGCCCATAAAGGTGAGGTTTTTGCTATAGATGAAGAAACCGGTGAGATCACTATAAATAATAAAGTTGAAACCATGGACACCCCCAGAGGTCAATATGAATTATTAGTACGAGCTACGGATTACGGAAATCCACCACTTCATAATGAAACACGAGTACTCATACGTGTAGGTGTTCCTGGTAACCAAAGACCCACTTTCAAAGGCAATTACCATCACTACAAATATATTGTCAATCAAAAAGATCTCAATGATGATTATACGTTCGATCTTAATCCTATGAATTATAAAGCTAGTATACGAGAAAATGCCAGTCCAGGACAAAACGTTACCAGAGTTGTAGCTCATGATCCAGATGGGATGGATGAATTACTGACTTACCATATTATCTCTGGATCGAAGGATAATTTCGCTATAAATGAAAAGACTGGTTTAATTACTGTATCAAACGATGCAAATCTTGATCGCGATATAAATCCCGAACGCTATGAGATAATTGTTTCTGCAGTTGATAGCGGAACTCCTATACCAGAAACGGCTACCACCACTGTGTTTGTAACAATACAGGACGTGAATGACAAACCACCGATGTTCAATGTAACAGAATCTACAACCTACATTTCAGAAAGGGCTCATGTTAATGATCTAGTTACGAAATTAGTAGCCCACGACTCAGATCTAAATGCTAAACTTAAATATAGTATAATAGAACCGATCAAGGCATTTTCAAAAGCTGGTGTACAGATCAAATCTAATTCGGCTTACGATTACAAAAACTTGTTCAAAATCAATGAAGACACTGGAGAAATTTTTGTGAACGGAACGTTAGATTACAGTCAAACATCCATAGTCATACTTACAATAAAAGTCACTGACGTAAACGCTGAAATAAACGAAGATAAGCAATTCGCACTGTTAGAACACACTATTTATATTCAGCCATTTGCTGACAAAAATCCCCAATTTACTAACGCCGGTTGGACAAACTCCAATCCTACAATCCATCATAAAATAAAAGAAGAGCAACCGATCGGTAGTACTGTTCTCGTATTGATGGCTGAGGATCCGACGTCTGGCCACATCGTTTCTAATTTCAAGGTTATCAATTCACAAACAAGTCTCTTACAAGTTAACCCTTTAAGTGGACAGGTTGTGTTAACAAATCATTTGGATTACGAGAAGTTGAGCAACCCCAATTTGACACTAACAGTACAAGCTACCAGTACTGATGGAAGCAAACACAGCATTGCCAATGTTATTGTTGAAGTTGTAAATGTAAATGATAATGAACCCGTATTTGAAAAAGAGATTTATAAAGTGAGTATTTTAGAATCCATAAAGTATCCAGAGCAAATATTAACTGTTAACGCCAAGGACACTGATGCAGTTCTTACCGACGAAGATAAGAAGAATGGCTTCTCTGATGTGAGATATTCAATAAAGGGAGAAAATTCTGAACTCTTATCAATAGATAGTGTCACAGGAGTTATTCAGGTGGGTGAAAACAAAACGTTGGACCGTGAACGTCAATCCGTTCTTAGGATTGAAGTTGAAGCTTACGATATGCCCGCTGGCGGAGCTGACAGGCTCAAATCAGTTGCGACTGTGCTGATTGACGTGCTTGACGTAGATGACAATACTCCTGTATTTGATAAGAGCGTTTACACTGCTGTTGTGCCCGAAAATGTACCGATTGGAATAAACGTCGTAAAACTTACTGCGGTTGATCCCGATGAAGGCCTCGGAGGAGAAATCAAATATGAATTTTTGGATGAAGGAGAAGCACACGGTCTATTTACAATTGATTCTACGACTGGAGAGGTCACAACCCGAAGCGCTCTCACTGGTCGGGGTCGCACGGATCCATACCGTCTGCTGGTAGGGGCAGTCGACGGAGGTGGGCATTCAGGAGATGCATCCCTTTCACTGTATATAGGTGACGTAAGCGCTAACGATGGCGTGCCTAGATTTATTAGACCAGCAGACGGCGAGGTCCTTAATGTTAGTGAGAATGCCACGATTGGAACTTCGGTGTTCCAAGTGGTGGCCAGTGACCCCGACGACCCCACCCAGCCATCTGGACAATTGTCCTACTCAATACAACAAGACGATGCAGATTCTAAAATATTTAGTATAGATCCAGACAGTGGTCTACTTACAACAAGACAGATGCTTGATCGTGAGAGTAAAGCCACGTACACATTAGTGTTAGTCGTCTCTGACCATGGATCACCACCTCAGCAGAGCAGCCGTATTGTTACAGTACATGTCGGTGACATAGATGATCATAAGCCGCACTTCTCACGAGCACTGGATGAACCACCGCTGCTGCTCTCTACAAAGGAGGAAGTACCAGTAGGAACGGTCATAGGCACTCTAGAGGCTATTGACGAAGACATTGGTGACAATGCGGCCATAGATTATGTTATAACGGCGGGCAACGAGTTGGAAATAGTAAGTTTACAACGAACAAAGGACAACAAAGCTGACATTGTGGCAGCGGGAAGACTCGACCGTGAGACTGTGTCTAGACTACTTCTAACAGTCAAATGTTTTAAATACGATACAAAACCACGCATAAATAAGGACTACAACCGTTTGGACGCGTCGGAAATACAAGTGGTTATAAAAATTCTGGATATAGACGATCATCTTCCCGAATTCGAGAGTTCAAACATGACAATCGGTGTTCGCCTAAACGTCGCCGTCGACACCGTCATAGCAACAGTGAAAGCTAAGGACAAAGATCCAGAAGCCTTACCAATCAATTACGCCATAGTTAATATGAATTTCGTTTCACCGATTAAATCCAAATCATCGAACAACACATCCGATGTCATAGTCATAAACAACGTGACAGGGGAATTAAAAATTATGAGAAACTTAATACATTTTGCTGATGGAATATTTAGACTTGTCGTTCGAGCCAACAACTCCAACGAAACGGATCGTTTCAGTGATTTACAAGTAGAGGTGGTAGTTGTACGGGAGCGTGATCTCCTTCGTCTGGTACTGGGCGGAGACAGTCGGGGTGCGCGTGCGCAGGTCGCGGGACTAAAGGAACGAATGTCGGCTGCTATAGCACCACAGCGGCTAAAATTGCAGTTACATGAAGCACCCCGACATGACGTGTATGATAATCTTGGACCATGTTTCCAATTCCGTAAAATGGAAACTGGAGAGGCGCTCACTCCTAACGCGATGAAAGCCACCATTCGTACCCTGGGTACAGAATTCCAGGATATTCTTCAAACTTACAAAGTGCACAACATCACCTGGTGTGGGGCCAAGCGCGCTGCTCCAGCACCCGCCCAAACTGCACTCCTCGCGGTCGCAGCGATGTTACCAATCGCTGCTTTCGTGGCCACACTTGTGTTGTGTTGTATGCATTCTCATGGTATGTATATAGTTAAAGTACATGAATATTATATATCATCATCACCATCAGCCTAAAGGAGCCCACTGCTGAGCAAAGGCCTCTTCTCACATGGATTAGGTTAGAGTATTAATCACCACGCTTGCTCAAGACGGGTTGGCGATTTCGATCCTATAATTTGTAATTATAAGACTAGGTTTCCTTACGATTGTTTCCTTCACCGTCCGTCAGTGGTGTCTTAATACTCTTAGAAAGTACATATATTTCGGAAAAGATCACATTGGTACTTGCTAGGTTTCGAACCCGCGCCCTCGAATATGAGAGGCGGGCCTTTTATCTCCAGGCCACCACGACTTATGTTATATATATCTTATCTTTTTTGTTCAATATAA

Protein sequence:

>DPOGS205838-PA
MTAQVINRVPHFIPQTGDMSQFSLSEDTPVGTPVYHLKGIDPENGTLRYSISGQYFTVDSVTGVVTLARSLDREEQAKLEVIISITDEGLDETEPNTVSLRRVIPVRDVNDNAPVFHNRPYIVNISEATSVGSEIKVTPDIIITDRDENDNAKIQVKCSDKERGSDVEACSTFKVTTEEITPNKYQVRIFLSKPLDYESRSAYVLTLDASDVSQKPLNAIGSVSIAIQDVQDQPPVFVNAPYSATVQENTPPNTKILEIVAKDGDTANPRPVLLTLEDDTQQYFRLQSEKSIGRATLITSNISVDRESDVVVQNGGVYTFFVKATELINNEVPSDYTVIPITIIVTDADDHKPIFNGDVFDISIPENIENGSPIPELSIYVEDYDLGQNSKYDLHIRNVRNSENVFSITPDHGEGRTPISIKVKDSSKLDYDVDDEDKRLFSFDIVATVNGLELSHAKVNIKLLDMNDNAPVFDQPFYKFNVLENASIGFKIGELQASDKDYGIFGEIEYFLSGFGSNIFKTDKVLGGIFVANMLDYETQKSYSLTLLAKDGGGKSTSVSIFIDVLDTNDNTPIFEASEYSRTIRDGATSFEPQLVIRATDADGPTQGGGRINYSLVSDNSIAHKGEVFAIDEETGEITINNKVETMDTPRGQYELLVRATDYGNPPLHNETRVLIRVGVPGNQRPTFKGNYHHYKYIVNQKDLNDDYTFDLNPMNYKASIRENASPGQNVTRVVAHDPDGMDELLTYHIISGSKDNFAINEKTGLITVSNDANLDRDINPERYEIIVSAVDSGTPIPETATTTVFVTIQDVNDKPPMFNVTESTTYISERAHVNDLVTKLVAHDSDLNAKLKYSIIEPIKAFSKAGVQIKSNSAYDYKNLFKINEDTGEIFVNGTLDYSQTSIVILTIKVTDVNAEINEDKQFALLEHTIYIQPFADKNPQFTNAGWTNSNPTIHHKIKEEQPIGSTVLVLMAEDPTSGHIVSNFKVINSQTSLLQVNPLSGQVVLTNHLDYEKLSNPNLTLTVQATSTDGSKHSIANVIVEVVNVNDNEPVFEKEIYKVSILESIKYPEQILTVNAKDTDAVLTDEDKKNGFSDVRYSIKGENSELLSIDSVTGVIQVGENKTLDRERQSVLRIEVEAYDMPAGGADRLKSVATVLIDVLDVDDNTPVFDKSVYTAVVPENVPIGINVVKLTAVDPDEGLGGEIKYEFLDEGEAHGLFTIDSTTGEVTTRSALTGRGRTDPYRLLVGAVDGGGHSGDASLSLYIGDVSANDGVPRFIRPADGEVLNVSENATIGTSVFQVVASDPDDPTQPSGQLSYSIQQDDADSKIFSIDPDSGLLTTRQMLDRESKATYTLVLVVSDHGSPPQQSSRIVTVHVGDIDDHKPHFSRALDEPPLLLSTKEEVPVGTVIGTLEAIDEDIGDNAAIDYVITAGNELEIVSLQRTKDNKADIVAAGRLDRETVSRLLLTVKCFKYDTKPRINKDYNRLDASEIQVVIKILDIDDHLPEFESSNMTIGVRLNVAVDTVIATVKAKDKDPEALPINYAIVNMNFVSPIKSKSSNNTSDVIVINNVTGELKIMRNLIHFADGIFRLVVRANNSNETDRFSDLQVEVVVVRERDLLRLVLGGDSRGARAQVAGLKERMSAAIAPQRLKLQLHEAPRHDVYDNLGPCFQFRKMETGEALTPNAMKATIRTLGTEFQDILQTYKVHNITWCGAKRAAPAPAQTALLAVAAMLPIAAFVATLVLCCMHSHGMYIVKVHEYYISSSPSA-