Monarch geneset OGS2.0

DPOGS201962
TranscriptDPOGS201962-TA5328 bp
ProteinDPOGS201962-PA1775 aa
Genomic positionDPSCF300060 - 642962-654917
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0050870.062.71% 
BombyxBGIBMGA010542-TA0.088.25% 
DrosophilaCad86C-PF0.042.55% 
EBI UniRef50UniRef50_Q9VGW10.042.71%Cadherin-86C n=20 Tax=Drosophila RepID=CAD86_DROME
NCBI RefSeqNP_788635.30.042.66%Cad86C [Drosophila melanogaster]
NCBI nr blastpgi|3535263430.042.71%cadherin [Drosophila melanogaster]
NCBI nr blastxgi|3535263430.042.53%cadherin [Drosophila melanogaster]
Group
Gene OntologyGO:00160207.6e-26membrane
GO:00055097.6e-26calcium ion binding
GO:00071564.2e-23homophilic cell adhesion
KEGG pathway 
InterPro domain[474-581] IPR0159197.6e-26Cadherin-like
[478-587] IPR0021264.2e-23Cadherin
Orthology groupMCL15154 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201962-TA
ATGACCTTAACGGCGGTGCTGGTGTTATTGACGCTGGCTGCGGCAGCTCACAGCGGCGATCCCGTGTTCGACCCCAGCACACTCATGAGGCTCGTGCTGGTGCCGGCAGACGCGGCCACCGGCTCCGTCATCTACCGCGTTCGAGCTTCCGATCCGGACTTCGATTACCCCCTGCACTTTGAACTTATCGGGCAAATGGGCAGACTCGACATAGGCATTGAGACATTACCTTGTACTCGGTACAATTCAGTCTGTCAAGCCAATGTGATATTGCTAAGGAGGCTGGAGCCAGGCCGCTACGTGGACTTTAGGCTGTCGGCTCGGAATACCAGAGGGCGAAGTGCTCGAATTGCCTGCTCGATCACAGGCACTAATGCTACTACCCCTCGAGATACTATATTTCCTCATCAACCCAGTATCATTTTGGTGCCAGAGGATGCGAAGAGAGGAACCGACCTTGAAATTGTAATTGCAAGGAAGAATCCAGTATCCCCAAAACCTCTGGAGTTGGAGCTTTGGGGTTCACCGCTCTTCGCAATCCGTCAACGCCGAGTTTCTAGCGAGAATACCGAGGGCACAATATTTTTAGTTGGTCCATTGGATTTTGAGGCTCAATCCATGTATCACCTAACATTGCTCGCTGTTGATCCCTACGTCGAAATAGGCAAGGACACGCGAAATATAGCTGGTTTAGAGGTTGTGGTGGTTGTGCAAGATGTTCAGGATATGCCTCCTGTTTTTACTTCAGCTCCACCCATCACCCATTTACCTCGACAAGTAATTCCTGGTGATATGATTGTAAGAGTAAGAGCTGAAGATGGAGACAAGGGTGCTCCTCGGCAAATAAGATATGGACTAGTTTCTGAAGGGAATCCGTTTACACCGTTCTTTAACATCAATGAAACAACAGGTGAGGTAACTTTGGAACGGCCGATTGAAGAAATTGCAGCAATATCACACGCTGGAGCACCAATACTTCTTACTGTTGTTGCTGAAGAAGTACGACTTTCCCGGGAGGAACCGGAGGCTATGTCATCGACGGTTCAACTGGCGTTCATTTTACCTGAAAGGGATAATTCTCCTCCGTATTTTGAAAATGAATCTTACATTACACATTTGGATGAAAATGCTCCGCAAGGTACAGCTTTGGTATTCAACGATCCGTATATACCGCAAGTGAATGACAACGATGCGGGAAAAAACGGAGTGTTTTCTCTTTCGTTAGTTGGAAATAATGGGACTTTTGAAATATCACCAACAGTGGCGGAGAGACACGCGCAATTCATTATCAAAGTGCGGGACAATACTATGCTGGATTATGAAGCCAGAAAATCAGTCGTGTTTCAGATTTTGGCACAAGAACTTGGTCCAGCAACAAACTTGTCAGCAACTACGGATGTGACCGTTTACTTGAATGATGTTAATGATAATCCTCCTATTTTTTTGGCCTTATCTTATGATGTGGAATTGCCAGAAAACGTTACAGCTGGAACTAGAGTTGTCCAAGTTGCAGCTGATGATGTAGATACAGGGGCATATGGAAAAATTCAATTTACTGCCATACTAGGATATTTAAATACTTCTTTAAATTTAGATCCTATAACAGGAGTTATAACTGTAGCAACAAATAATCACGGTTTTGACCGTGAAGCAATGCCAGACCTGCATTTTTTAGTAGAAGCTAGAGATAATGACGGTATAGGATTAAGAGTAACAGTGCCATTAATTATTAAGTTATTAGATGTAAATGATAACCCACCAGAGTTTGAAAGGGCCTTATATGAATTTGTATTGTCTCCGAGTTTAAATAATTTTACTTCATCGGCATTTGTGAAGGCTGTTGATAAAGATGCCGAACCACCAAATAATGTAGTCAAATACGAAATAATTGAAGGAAACCGTGAGGGTAAATTTGCTATTAATGAAGATACAGGGGAAATTTATTTATTAGAACCATTGAAAAGAATCACAAAGAAAAATGCAAATAGGCGTAAAAGGCAATTTGATAATCAAGAAGAGAGTGAAGTTTATATGTTAACTATTCGAGCCTATGATATGGGAGTTCCAAGGTTGTTTTCAACAACAACAGTGAAGATTTATCCTCCAGAAAGTAAAACTAGAACTATGTCTTTCATTGTTCCCGGTGCTAATCCAGATAGAAAAAAATTAGAAGAAGTTTTGAGCACATTATCTGGAGGAAAAGTAACTATTGTAGATATAAAACCTTACAAAACTAATGATGATAAAGGTTCAATAGATCAGAGTGGCCAGGCGTCTAGTCAAGAAAAAAGTGTGGTTACTGCAGTAGTTCGTATGGCTGGTAATGCTGCTATTAATGTTGCAAAACTTCAAGAACAATTATCAAAGAATATCACATTATATTCCACCACTGTCGTGCAGAAGGATCAGACATCAATAATTGATAACGATTCTGGCGTGTATAAAGCGGAGAGTCGATTATTATTCTGGCTTTTAATCTTATTAGCAATATTAATAGCTCTCGTTTTATTACTTCTTATATGTTGCTGTATTTGCGAAGGTTGTCCATTGTATATGCCACCAAGGAAAAGAGTGATACGTGTCAACTCCACAGAGGACGACGTGCATTTAGTGGTCCGCGACAAGGGAGTTGGGAGGGAAAACAAGTCGACCCAAAACTTAGAAAGTAAATCAATACAAGCACCTGAATGGAGGAGAAGAGAAGCTTGGAGTGCCGAACAAACAGATATTAGAACAAAACCAACACAATGGAAATTCAATAAAAGAAATTATAAATCAAAGGAACTGAGTAAACCCGCTTCTACGCCTGGTGACATTAGACAAGAATTTGTGCAAGCTGCCACTGATAATGATTATAAATACGATGATTCTAGACAGTCTTTTCGACGAAGGGATGGCCCCAACATAATTTATACAAAAGAAATCCAACTTGAAGATAACTTTGCTAGTAAGCATAAAGAGTACATAGAAGATCTTGAAAATGGTTATGATAGAATAGCAACTCTTCACCATCAACGAAAAGATCAGGACAATGATTCTATAAGAAGACATGAAATCGATCGAGGGTCTGAGGTTGAAGGCGATCATAAGACTGATGATAAAAATTTCCATGGTGAACATCGATTAAAAATTGAGTACGCTGATAAGCGCGACCCGTCTTCCATGGGCAGAGATCAATTTTTCATTAAAGAGGGAAACACTGAAATATTAAGGCTCGTTACACGGGGAAAAACTGAGGACGAACGATACGTTAACCTTCCTATTCAACAACAACAACGACCAGTTACATTAATTCCTCACACTCAATATGTAGTTGTGGATAATGGTAAGGAATTGTTAATGGAACGATTTATTAGAGAACAAGAAGAGGAAGCTAACAACATAAGGGAAAGAATGAGTAAAGTTGTAACTGATTTAGACAACGTCCAATCGCCTCAAGATGGTAAAAAGTCTCAACTAATTATAGACAAACATAGTTTTCCGCCTGAATACACTAATATGACACCAGAAGTCCCAGGGGCAGTTCCTCATAAATCGGATTATTTACAATCTGCACTTATAGAAATGCACAATAAGTCATCTATTCACCAAGAACTATTGGAATCATCATTAAGAAAGCAAAATGAACTTTTACATCAAATATTAATTGAACGTGAAAGAATATTACACAATCAGGAAACAGCATCTCAAGTGGAAAATAAGTTAGAGACCCAAAGTTTACCAGGACATGCTGTGATGGCTACTCAAACTGAGTGTCATATTGGAACTCAAACAGATTCTCATTTACTAAATGAAGTGAAACGGAAATCTCGTAGTGATAATGAATCATACAGTGAAGACGAATCACAAAAAAATTCAGATAAAACCCATAAAGTAACATGGGTTAAAAAGAAAAAACCAAGAAAAAAAATTAAATACAAAGACCCGAGGCGTAGTATACGTGTTTATGAATTGAAAAGAAAAATTAAAACGCCTATAATTGAGGAAAGCGACATATCACCATCTGCAGAGAATGAAAAGCAGATTAAAATAAGCAAAACAAATGAAAGAGAGGAAAGAATCAAACATTACGGTGACATCACCAAAAGTACGGTTACTACTTCTAAGAATGAATATGTCAGTTCCACACAATATAAAAATTCAGAACCCGACAGAAAACTTAAATCTCGTAGAGAAGTATTGATGGAAATATCAGATTCTTTGGATGAAAAATCAACACTTGATAAAAAACAACAACAAGAACCATCTTCTTCTTCATCTTCGAGAACTGAGAATTTCAAAAGAACTATTTCCTATGACCATGAAAGTAATAAGGGCTCTCACAGTGGCACAGATTCACCAGGTGAAGATAAACGGAACTCAATATTCTCGCGACAAGCATCTTCAGAAGAGGCTAAAGAAAATATTTGTAAACAGTTGGAATGTCATAGTAAAAGTAATAAAATTGAAAACCATCAAGAGCTTAGTGATAAAGACCTTTCAACTAACGACTCCATAAAAATTAAGGAAAAAGAGAATAAGGAATCTATGAAAACTAATGAAAAGCCTGTTAAAAGTTTACCACGATATATGCAATGGTATGGCAAAAAATCAAAACCTTCAACATCTGAAAAAATCGTACCAGACAAACCAAAAAGACCATCTAAAATAAAAATAGAACAAGAAAGAGACACAAAAGATAGAACAAGCCGATACGGTAAAATTATTAGCAAAGATAGTCAAGTTGGCGAAGTAAAAAAAAACTTCAAAGAAAAAGAGAGTGATTTTATCCATCCTCGCTTATTAAAAGAAGGAAAGGTAACACCTGTTCCAGAAGGACCGTTACCTGATGTACATCCCTTGTTACAACACTCAGAGCACAGATACGAGCACCAGTACCAAAATCAAAATCCTCTATGTTACGTTCAACAAACACATATACCAAAATACTTAGGTAGCCAAACTGAAAAACCAGTGTTGCCACAAAGAACTTCTTTAGAGCAACAGCCAATTTATGTCAATCAAGACGGTGTAACAGAAAACAGCACAAAACCGGATATTGCAGAAAGTGCCTTAACTCATAGCATTGCTATATCAAGTGCGTATGGAAAAGAACAAAAAACTGCGTCGGAAGTTCACGTCTCCAAAATTAAAATATCGGGTGAAACGATTTCTGAGAATCAAAGAAAATTGGACGACAATGATTCAGGCATAGCTATGAGCACACTTGTACACCAAACAGGTATTAAGAGATTACCTATAACGGAGAAGAAAAGTGTGTTTACTATTGCATACGACGACGTACAAACGAAACAACTACGGCCTGACAGCAGCTCCACCTCTTATTAA

Protein sequence:

>DPOGS201962-PA
MTLTAVLVLLTLAAAAHSGDPVFDPSTLMRLVLVPADAATGSVIYRVRASDPDFDYPLHFELIGQMGRLDIGIETLPCTRYNSVCQANVILLRRLEPGRYVDFRLSARNTRGRSARIACSITGTNATTPRDTIFPHQPSIILVPEDAKRGTDLEIVIARKNPVSPKPLELELWGSPLFAIRQRRVSSENTEGTIFLVGPLDFEAQSMYHLTLLAVDPYVEIGKDTRNIAGLEVVVVVQDVQDMPPVFTSAPPITHLPRQVIPGDMIVRVRAEDGDKGAPRQIRYGLVSEGNPFTPFFNINETTGEVTLERPIEEIAAISHAGAPILLTVVAEEVRLSREEPEAMSSTVQLAFILPERDNSPPYFENESYITHLDENAPQGTALVFNDPYIPQVNDNDAGKNGVFSLSLVGNNGTFEISPTVAERHAQFIIKVRDNTMLDYEARKSVVFQILAQELGPATNLSATTDVTVYLNDVNDNPPIFLALSYDVELPENVTAGTRVVQVAADDVDTGAYGKIQFTAILGYLNTSLNLDPITGVITVATNNHGFDREAMPDLHFLVEARDNDGIGLRVTVPLIIKLLDVNDNPPEFERALYEFVLSPSLNNFTSSAFVKAVDKDAEPPNNVVKYEIIEGNREGKFAINEDTGEIYLLEPLKRITKKNANRRKRQFDNQEESEVYMLTIRAYDMGVPRLFSTTTVKIYPPESKTRTMSFIVPGANPDRKKLEEVLSTLSGGKVTIVDIKPYKTNDDKGSIDQSGQASSQEKSVVTAVVRMAGNAAINVAKLQEQLSKNITLYSTTVVQKDQTSIIDNDSGVYKAESRLLFWLLILLAILIALVLLLLICCCICEGCPLYMPPRKRVIRVNSTEDDVHLVVRDKGVGRENKSTQNLESKSIQAPEWRRREAWSAEQTDIRTKPTQWKFNKRNYKSKELSKPASTPGDIRQEFVQAATDNDYKYDDSRQSFRRRDGPNIIYTKEIQLEDNFASKHKEYIEDLENGYDRIATLHHQRKDQDNDSIRRHEIDRGSEVEGDHKTDDKNFHGEHRLKIEYADKRDPSSMGRDQFFIKEGNTEILRLVTRGKTEDERYVNLPIQQQQRPVTLIPHTQYVVVDNGKELLMERFIREQEEEANNIRERMSKVVTDLDNVQSPQDGKKSQLIIDKHSFPPEYTNMTPEVPGAVPHKSDYLQSALIEMHNKSSIHQELLESSLRKQNELLHQILIERERILHNQETASQVENKLETQSLPGHAVMATQTECHIGTQTDSHLLNEVKRKSRSDNESYSEDESQKNSDKTHKVTWVKKKKPRKKIKYKDPRRSIRVYELKRKIKTPIIEESDISPSAENEKQIKISKTNEREERIKHYGDITKSTVTTSKNEYVSSTQYKNSEPDRKLKSRREVLMEISDSLDEKSTLDKKQQQEPSSSSSSRTENFKRTISYDHESNKGSHSGTDSPGEDKRNSIFSRQASSEEAKENICKQLECHSKSNKIENHQELSDKDLSTNDSIKIKEKENKESMKTNEKPVKSLPRYMQWYGKKSKPSTSEKIVPDKPKRPSKIKIEQERDTKDRTSRYGKIISKDSQVGEVKKNFKEKESDFIHPRLLKEGKVTPVPEGPLPDVHPLLQHSEHRYEHQYQNQNPLCYVQQTHIPKYLGSQTEKPVLPQRTSLEQQPIYVNQDGVTENSTKPDIAESALTHSIAISSAYGKEQKTASEVHVSKIKISGETISENQRKLDDNDSGIAMSTLVHQTGIKRLPITEKKSVFTIAYDDVQTKQLRPDSSSTSY-