Monarch geneset OGS2.0

DPOGS204438
TranscriptDPOGS204438-TA5274 bp
ProteinDPOGS204438-PA1757 aa
Genomic positionDPSCF300002 - 127653-145016
RNAseq coverage614x (Rank: top 21%)
Annotation
HeliconiusHMEL0174010.067.35% 
BombyxBGIBMGA013573-TA0.062.14% 
DrosophilaCad87A-PA0.038.04% 
EBI UniRef50UniRef50_D6WC500.045.55%Cadherin 23 n=2 Tax=Neoptera RepID=D6WC50_TRICA
NCBI RefSeqXP_392736.30.047.24%PREDICTED: similar to Cad87A CG6977-PA [Apis mellifera]
NCBI nr blastpgi|3407299640.047.18%PREDICTED: cadherin-87A-like [Bombus terrestris]
NCBI nr blastxgi|3287823430.047.23%PREDICTED: cadherin-87A-like [Apis mellifera]
Group
Gene OntologyGO:00160202.2e-36membrane
GO:00055092.2e-36calcium ion binding
GO:00071563.6e-35homophilic cell adhesion
KEGG pathway 
InterPro domain[631-734] IPR0159192.2e-36Cadherin-like
[635-739] IPR0021263.6e-35Cadherin
Orthology groupMCL16017 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204438-TA
ATGGCTGCCGGTGTTCTGGACAGCTCTGATCTGCTTACGATAAAATATGGTTTAGTTGGCACGGACAAGTTTGCTGTGAATTCAGATACGGGGAATGTGACTCTTATACAGCCTTTGGACAGAGAGAAAGAAGACACAATAAAATTCCTCGTGTCCCTTGAGAACAGCGACCCTGGGTCAGGTCAGACCCTGGTCCAGACTCAGTCCGTCACCGTCATCATAGTTGACGAAAATGATAACCCACCACTCTTCAAGAATAGTCCCTACGAAGTTAATGTGGCGGAGGACGAAGTTGTTGGGACGACGGTATTGAGTAACATAACGGTCGAGGATACTGACTCCGTGGGCGAGGCGCTGGAAGTGGGCTGCGTGGCCAACGAACAGTGGCCAGAAGCCTGCGATCTGTTCGAGGTGGTGAGTCTCCATTCGACTGCGAACGAGTTCACAGGAGCTCTGGTACTGAAAAGGTCTCTGGACTACAACGAGAAGCAGTTCTACCAGTTCCAGTTACACGCCACGGACGGGACCCTCAACTCGTCATCTCCGGTGGAGGTGAAGGTGGTGGACGTCCAGAACACCCCGCCGGTGTTCAGCGGGGCTCTGAGCGCGGCCTTACCTGAAGACGCGCCCGTCGGCAGCCTCGCCCTCGTCATCAGGGCCAAAGATGGAGACCGAGCACAGCCTAGAGACGTGCAGCTCGAACTGCTTACAAACCCAATGGACTTCTTCGTACTGGACAGTAAGACCGGTGAACTGAGAACAGCTAAGCCTCTGGACAGAGAAGCCCTGGCCGATCCGAACTCACCGCTCAATATTACGGTTAAAGCCACTGAGCTGGTGGACGGCATCCCCGTGGAATCCCCGTTGAGTGTGACCCGGGCGGCGGTGGCTATCACCATCCGCGATGTCAACGACGAGCCGCCACGGTTCAACCGCCGCGAGTACTCCGTGGAACTGCTGGAGAGCCTCGCCCGGGGCACGCCGCTGCCACACCTGGACATGGTCGTCACTGACACAGACCTGGGCATGAATTCGGTGTTTTCATTGAGACTAGCCGATAACCTGGGTGCGTTCGTGGTGGAACCCTCGGTGGCCACCGGCAGCGCCACCGTCACCTTAAGACTCAACTCCACCCTGGACTACGAGGATCCCAACCAGAGAAAGTTCATTCTGGAGGTGATCGCGGAAGAAGTCCACACGTCTCCAGCGCTCTCGTCCAAGGCCAGTGTGACAGTGTCGGTGAAGGACGTCAATGATAACTCCCCGGAGTTCGCGGACGACTCCTACTCCGCCGGGGTCAGGGAGGACGCCGCTCCCGGGACCAGACTGGCCGCTATCAAAGCCACGGACAGAGATACGGGGAGATTCGGTACCGAGGGTATCGTGTACCAGCTGTCGGGTAACGGGGCGGAACTGTTCTCAGTGGAGCCGAGGAGTGGAGTCATCAGTGTGGCAGAGTGCGCCAGCCCGGGGGTTGGACCCTGTCTGGACTACGAGACACGGAAGGAATACTTCTTACAGTACAAGGCTACAGATGACGACGGTGCAGGTCAGAGCAGCGTGGTGTCTGTCCAGGTGTCGGTGAGGGACTCCAACGACAACCCTCCCGTGTTCCTGACGCCGGTGTACCGAGCCTCCATAGACGAGGACGCCACTAAATTCGAACCGGAACTACAGGTCCAGGCGCGCGACCTCGACCCGACGTCCGACGTGCGCTACTCCATCGTCTCTCCTCACAACCCTCCCTTCTGGGTGGAGCCTTCTTCCGGTCGCATCTCCGTGCTGCCCGGCAAGCTGCTAGACGCCGCGGACAACAAATACATACTCACTGTGATGGCCTCTGACGGCCGCTTCAACGCGACGTGTCGCGTGGAGGTGACCGTCCGTGACGTCAACGACCACCCGCCCGTGTTCGCCACCACCACCTACGACGCCAGCGTGTCCGAGGACGCCGCCATCGGGGCGGAAGTGGCGGCGGTCCAAGCGACAGATCTGGACAGTGGGATGAACGCTCAGATCAGATACAACATACAGAAGGGAGCTCTGGACGCCTTCAGGATACACCAGGACACGGGGGTCGTAAGCGTGGCCGGCGGACTCGACTACGACCGCAAGAACACCTACCACGTCCTCATCACCGCCACCGACATGGGTGTCCCCAGTCTGACTGGCACCACGGAGTTGACGGTTCACGTGCTGAACGTGAACGACAAGAAGCCGGTGTTCAAACCCGCGGTCCAAAGAGCCGGGGTGTCCGCAGACGCTGAGACCGGCACCCTCGTGTACCAGCTCGTGGCGGAGGACCCTGACGCCGACGAAGGCAGCTTGAGGTTCGGGCGGGGAGATAGACCCGTCAGGGCGGTGGACGTCAACGGACAAGAGGTGAAGGATGACGGCACGATAGCCTTGTGGTTCCTGGTGTCCCCGTCTGGCGAGGTGTTCGTGTCTCAAACCCCGGACCGCTCCCGGGCTGCGGTGGTCACGTTGCCCGTCTCCGTCACCGACCAGTCCGCGCCCGTGCCGCAGACCACTGAGGGCGAGCTCATAATAACCATAGTGGACGTGAACCGTCAGCCGCCCGTGTTCTCTCAGCCGGCGTACGTGGAGCGCCTGGTGGAGGAGCAGAGCCCGGGGACGGTGCTGGCCTCCTACAGCGCCACGGACAAGGAGACGCCCATCGCCTCCATCGTCATACACCCGCCCAGCCCGTACTTCGATATCGACAATGTCACCGGTGAAGTGAGGGTAGCCAGCCGTATTGACTACGAAGCCACTCCGTCCTTGAACTTCACGCTGGTGGCGTACGACTCTGGCGTCCCTCAGCTGTCGTCGTCGGCGGCTGTGTTCGTCGAGGTGGTCAACATTAACGACGAGAGGCCCGAGTTCAGCGCCGACCTGTATAACGCTACGGTGGAGGAACACTCCACACCTGGGACCAGTGTGCTGAACGTGTCTGCTGTAGACCTCGATGCAGGCGACTTCGGCAAGATAACCTACAGTCTGTCCGGGGAATCAGCCAGCCTGTTCACCATCAGTCAGGAGGGAGAGATCACAGTGGCGGAGGGAGCGGATATCGACAGGGAAACCACCTCAGAGCTGTGGTTCCGTGTGGTCGCCACAGATAACGCGCCGCCACAGACTAAAAAATCTTCCAGTGTACCGGTCCACGTCATCATCACGGATATCAACGACAACCCGCCGGTGTTCAGTCAGCGTGTGTATAAGGGCAGCGTGTCGGAGAACGTGCCTCTGAGTCCCCCGCCCGCCCTGCTGCAGTTGAGGGCGGAGGATCTCGACCAGGGCGAGTACGGAACCGTCACATACAGTATCGAGGACCAGAGTCAACCAGGAGTGTTCCGTGTGGATCCCCAGACGGGCATCCTGTCTCCAGTGCGGCCGGTGGTAGGCGGCGCCCTCTACCAGCTGGTGGTGGCGGCCAGTGACGGTAGCCACCGCGACACCGCTCGAGTCGACATCACAGTACTGAGCGTCAACAAACACAGCCCCGTGTTCGTGCAGCCGCCGCCAGATGTCAGGCGAATCGAAATACCCGCTCACGCGGCTCAAGCTGGCTACCTCATAACGACCATCCAAGCCACGGACGAGGACCCGGGGGAGAACGGCCGAGTGACGTACCATCTCAAAGTGGACAACCAGAACGTTGGACACACCGAGGAGTTCAGTCTGGATCCTGTGACAGGACAACTCACCACCAGGGTACCTTTGGACATCGACAGCCAGGCGGAGTACCAGCTTATCGTGTCTGCGTCTGACGGCGGTACTCCCAGCCAGTTCGAGGCCCTCCGCGTGCTGCAGGTGGTGGTGGTGGACGATCAGGACCACGCGCCGGCCTTCCTTCACAGGCACTACCACTTCAGCGTCACGGAAAATTTACCCTCCGGCGTCATAGTGGGCACGGTGAAGGCCATAGACAAAGATAGAGGAGACAATGGGAAGGTGTACTATCATATACTGGAGGGGAACCAGGACGGCGCCTTCACACTCGACAGGACCCAGGGCATCATAAGGGCCAGCATGAGCTTTGACAGGGAAAAACAGGACGAATACTCCATGACAGTATACGCCAGCAACAATCCCATACTAGAGCATGCGGCGGCCATCTTGAATTCCATAGACAACAGCACGGAGAGCCAAGACGTCAGCGTCACCACCATCAAGATCAGAGTACTCGACGAAAATGATAACGAGCCTAAATTTGAACAGAAGGTGTACTACGCGGGTGTATCTCCAGGTGCTCGTGTTGGCGAGGTGATAACGTCACTGGTGGCTCGTGACCCTGACCTGGGCGTGAACGGTTCCCTGCTCCTGGCGGTGGCAGCCTCTCACCTGTATAAGTTCGGCTCCACCACCTCTAGTGGCAGCGTTGTGCCTTCACCTTTCAATATTACACAGGACGGTGTATTGACGACGGCCACCCTGATGTCCGACTACCGCCAGGACAGGTTCGTGCTGGACGTCATCGCCCAGGAGTTGGCGCCCCCTCACCGACAGGACACCGCGCAGGTCTATATCTGGATCATCGATCGTTCGTCGGTCCTCCGGCTGGTGGTGTCTCGTCCGTGTTCCTCAGCGGGCGCGCCTCACGCTCGCTTGTCGTCGGCGGCGCGGGCCCTGCTGGTCCCGGGGCGGAGGGCCGCGGTACACACCGCCGGGGACGGCAGATGGTGTGAGCTCCAACTGCACGCCGTGGACCCCGTCACCTTCCAGGTGTTGGATGTCCGGCGTGTTCTGGAGGACATCGACGCCCAGTACGACGCGCTCCGCCAGCTCTACCAGGAGTATGGAGTAGAGACGCTCACAGCCGCCGCATCAGGAGATAAAGCTCCTGAGAGCTTCGATCCAGCTCTTGCAGCACTTATAGCGCTTCTGATAGTGCTGTTCACTGGCATCGTCACATTCATCGTGGTCTGCGCCTGTCTTAAACACTGGGTGGTGCCGCCGCCGTCACTACAGTCCAGTAAGGGAGATAGTCTGGCGCGGAGGAGGATCCTGGAGGAACTCAGCACCACCGAGAACCCGCTGTGGCTGGAAACCAAGCTCAGGCCGTACGAGGAACAGGAGCTGACCATGAACGTGGACAACACGTACGCCACCATCGAGGGAGGCCGCTCCAGGTGTGAAGACTACGCCACGCTGACACCAGACCTGCATCAGCTGGAACACGCCACCACAGCGCTGGAAGCAGCTCTAGGATTCCAGGGCAGTACGTTCAAGCCTGAGAGAGAGACTCCCGAGCCGCCTCCTCGACCCTGA

Protein sequence:

>DPOGS204438-PA
MAAGVLDSSDLLTIKYGLVGTDKFAVNSDTGNVTLIQPLDREKEDTIKFLVSLENSDPGSGQTLVQTQSVTVIIVDENDNPPLFKNSPYEVNVAEDEVVGTTVLSNITVEDTDSVGEALEVGCVANEQWPEACDLFEVVSLHSTANEFTGALVLKRSLDYNEKQFYQFQLHATDGTLNSSSPVEVKVVDVQNTPPVFSGALSAALPEDAPVGSLALVIRAKDGDRAQPRDVQLELLTNPMDFFVLDSKTGELRTAKPLDREALADPNSPLNITVKATELVDGIPVESPLSVTRAAVAITIRDVNDEPPRFNRREYSVELLESLARGTPLPHLDMVVTDTDLGMNSVFSLRLADNLGAFVVEPSVATGSATVTLRLNSTLDYEDPNQRKFILEVIAEEVHTSPALSSKASVTVSVKDVNDNSPEFADDSYSAGVREDAAPGTRLAAIKATDRDTGRFGTEGIVYQLSGNGAELFSVEPRSGVISVAECASPGVGPCLDYETRKEYFLQYKATDDDGAGQSSVVSVQVSVRDSNDNPPVFLTPVYRASIDEDATKFEPELQVQARDLDPTSDVRYSIVSPHNPPFWVEPSSGRISVLPGKLLDAADNKYILTVMASDGRFNATCRVEVTVRDVNDHPPVFATTTYDASVSEDAAIGAEVAAVQATDLDSGMNAQIRYNIQKGALDAFRIHQDTGVVSVAGGLDYDRKNTYHVLITATDMGVPSLTGTTELTVHVLNVNDKKPVFKPAVQRAGVSADAETGTLVYQLVAEDPDADEGSLRFGRGDRPVRAVDVNGQEVKDDGTIALWFLVSPSGEVFVSQTPDRSRAAVVTLPVSVTDQSAPVPQTTEGELIITIVDVNRQPPVFSQPAYVERLVEEQSPGTVLASYSATDKETPIASIVIHPPSPYFDIDNVTGEVRVASRIDYEATPSLNFTLVAYDSGVPQLSSSAAVFVEVVNINDERPEFSADLYNATVEEHSTPGTSVLNVSAVDLDAGDFGKITYSLSGESASLFTISQEGEITVAEGADIDRETTSELWFRVVATDNAPPQTKKSSSVPVHVIITDINDNPPVFSQRVYKGSVSENVPLSPPPALLQLRAEDLDQGEYGTVTYSIEDQSQPGVFRVDPQTGILSPVRPVVGGALYQLVVAASDGSHRDTARVDITVLSVNKHSPVFVQPPPDVRRIEIPAHAAQAGYLITTIQATDEDPGENGRVTYHLKVDNQNVGHTEEFSLDPVTGQLTTRVPLDIDSQAEYQLIVSASDGGTPSQFEALRVLQVVVVDDQDHAPAFLHRHYHFSVTENLPSGVIVGTVKAIDKDRGDNGKVYYHILEGNQDGAFTLDRTQGIIRASMSFDREKQDEYSMTVYASNNPILEHAAAILNSIDNSTESQDVSVTTIKIRVLDENDNEPKFEQKVYYAGVSPGARVGEVITSLVARDPDLGVNGSLLLAVAASHLYKFGSTTSSGSVVPSPFNITQDGVLTTATLMSDYRQDRFVLDVIAQELAPPHRQDTAQVYIWIIDRSSVLRLVVSRPCSSAGAPHARLSSAARALLVPGRRAAVHTAGDGRWCELQLHAVDPVTFQVLDVRRVLEDIDAQYDALRQLYQEYGVETLTAAASGDKAPESFDPALAALIALLIVLFTGIVTFIVVCACLKHWVVPPPSLQSSKGDSLARRRILEELSTTENPLWLETKLRPYEEQELTMNVDNTYATIEGGRSRCEDYATLTPDLHQLEHATTALEAALGFQGSTFKPERETPEPPPRP-