Monarch geneset OGS2.0

DPOGS216145
TranscriptDPOGS216145-TA4383 bp
ProteinDPOGS216145-PA1460 aa
Genomic positionDPSCF300214 - 153482-214596
RNAseq coverage571x (Rank: top 22%)
Annotation
HeliconiusHMEL0084740.089.37% 
BombyxBGIBMGA010266-TA0.091.61% 
Drosophilashg-PA0.053.30% 
EBI UniRef50UniRef50_F4WLD00.056.99%DE-cadherin n=14 Tax=Neoptera RepID=F4WLD0_ACREC
NCBI RefSeqXP_394063.30.058.94%PREDICTED: similar to DE-cadherin precursor (Protein shotgun) [Apis mellifera]
NCBI nr blastpgi|3504040650.058.50%PREDICTED: DE-cadherin-like [Bombus impatiens]
NCBI nr blastxgi|3800161550.058.97%PREDICTED: LOW QUALITY PROTEIN: DE-cadherin-like [Apis florea]
Group
Gene OntologyGO:00160203.7e-41membrane
GO:00071563.7e-41homophilic cell adhesion
GO:00055093.7e-41calcium ion binding
KEGG pathwaydmo:Dmoj_GI222904e-134 
 K10414 (DYNC2H, DNCH2)maps-> Phagosome
    Vasopressin-regulated water reabsorption
InterPro domain[1319-1448] IPR0002333.7e-41Cadherin, cytoplasmic domain
[1073-1277] IPR0089857.8e-39Concanavalin A-like lectin/glucanase
[1072-1275] IPR0133205.5e-35Concanavalin A-like lectin/glucanase, subgroup
[149-254] IPR0159193.7e-29Cadherin-like
[153-263] IPR0021266.7e-28Cadherin
[1106-1257] IPR0017912e-25Laminin G domain
[1117-1256] IPR0126803.4e-22Laminin G, subdomain 2
Orthology groupMCL10636 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216145-TA
ATGACTCAAGGGTTGCGCCTGCGCCGATTGAGAGATAAGGTCTACACGAGTCGCTCCGTAGAAGTACGAAGACCGGCCGAGTTTACCCCTGCGTCTCCGAAGCTGGCTGTGAAGGACAATCACAAGCCTCTCTTCACTGAGTGCCTAAACTATAGGCCCACGCTGAAAGAAGAGCAGCCTGTAGGAACATATGTATTCACGGTTCATGCCGAAGACCGGGATCCACCCGAAATGGGTGGTACGGTTACTTACAAATTCGTCTCAACTCCTGGTGAGAAGGAGAGATTTCATGTTGACCCTGAGACCGGAAGAATTACTACAGCTGATATATTTGACCACGATGAACCGTCACGTGAGAAAGAAGCCTATATAACAGTGCGGGCAAGTGATAATGGCCAGCCCCAATTAGACGACGCGTGTACTATTAAAATACTTATAGAAGACATCAATGACAACCAGCCGGTCTTCGATAAAGTGTCGTATTCGGAGTCAGTGCCCCAAGATCTGCCGAGCGGTCGTGAGGTGATGCGGATATCTGCCACGGATATAGATGATGGCAATAACTCGATCGTACATTACAGTCTCGATTCAAATTCGCCGGATCAGGCCTATTTCTACATCGATCCTGATAACGGGGTCATATTTCTGAATAAGACAATTGATAAAGTGCCAGGCTACAAATTCAAGTTGAGCGCGATAGTAAAAGACATGGGTGATCCGCCGCAACAAAGTTCGATTACTTTAGACATCCAAGTCGTTGAATCTAACAAGAAAAGCCCGTCTTTCATCGAAGTGCCCGATGAACCGATAAGATTAAAAGAAAATTATGCTGACTTCAACACCCCGATCGCTACTGTGAGAGCAGTATCAAACATTCCGGAAGAAAAGAAATTACAGTTTGAATTGGTTATGGGTCAGACGGAACAAACGAATAAATGGCACACGTTTGTCTTGGAACCAGAGGCGGATTCCGCATACATAAAACTCGGTAACCATTTGGATTACGAGAAAATAACAGATTACACATTGACTGTTAGAATTCAAAATAACTATAAATTAGCAGCTGAAACTATAATTCAAATAGAAATTGAGGACGTCAATGATAATATTCCTATATTCAGTGAGATAAGATCTGGCAGCGTGTTAGAAAACGAACCGCCCGGTACACCTGTGATGCAGGTTAGGGCTTTCGATGCTGATGGGACTTCAGCCAATAATCAAGTCACTTATGAATTAGGAGACCCATCAGACCCATTCGCTATAGACAGCATCACGGGAAACATAACAACATTAAAAATGTTCGACAGAGAAGAGAGGAGTTTCTACAACATAAAAATTATAGCCACTGACAATTCCGAATCAGCTCTCATCCCTGGCAAGCATAACGCCGGCCAACAGGTTTTTCTTATAGAGATAGCTGACAAAAACGACAATCCGCCACATTTCACACAGGATACGTATGTAGCCGAGTCGATAGCTGAGAACGCTAACATTAACGAGCTCGTCACACAGGTTACCGCGCTGGATATAGATACAGCGTCCGTGGTAACATACAGCATAGTGGCCGGTAACACATACGACGCGTTTGTGATACGTAATTTCACTGGTGAAATTCGGGTCAACAATGAACTCGACTACGAAAATATAACCAGCTACAGCTTAGACGTGAGAGCTTTCGACGGCTTGTATGAAGACTACGCGAAGGTCCTCATAAAGATTGAAAACCTCAACGACAATCCTCCGGTCTTCTTGGCCAATTACACGAAGACCATCGAGGAAGAAAAGTTATATGAAGGGTGCATTGTAAAGGTTGAAGCCTACGATCCAGATTTAGACCGAAACGAACCTCAGAACATAGTCTATTCACTGGTGAAGCACGAACAAAAGGAGTTCCTCCAAATAGACAATGACGGTTGTCTGCGACTCACGAAACCCTTGGATAGAGATCAGCCTTCAGGTTTCACGAGATGGCAGATCCTTATCATGGCCGCCGACCACGGGGGTAGACTCGGTTCTGATAGTCTGCGATCGACCACAGAGGTCATATTGGAACTGACTGACATCAATGATAACGCACCTTTCTTGACTAATACTCAACCAATCATATGGTACGAGAACGAACCGCCTGGCCAAGTTGTTATTCTAACAGCTAAAGATTACGACTCAGCTGAAAATGGACCACCTTTTACATTCGCTCTTAACGAGTCCGCCAGCTTCGATATACGATCCAAATTCCACATACAAGGAAACATACTGTCTACGTTGGTTTCCTTCGACCGTGAACAGCGCAAGGAGTACAAAATCCCAGTAGCTATCACAGATTCTGGAACACCAAGATTAACGGGCGTTTCCATTCTTCATATTGTCATCGGAGATAGAAACGATAATTCCATGGAACCTGGCCACAGTGACATCTTTGTCTACAACTACAAGGGGGAAGCACCAGATACGGAAATCGGCCGCGTGTTTGTAAATGACCCTGACGATTGGGACCTACCCGACAAACGGTTCATGTGGCTGCCCTCGTACGAGCAAAGGAGTCCTTACTTCGACGTCCATAGTAATACTGGCATGATCACTATGAAGGAGGGAACCCCTAACGGGACTTACATGCTCCGATTCAATGTGACGGAAGTTAACGAGCCGTTAGTGCCCTTCCACTGGGTGGAGGCTACAGTGAATGTGACCATAAAAGAAATACCAGAAGAGGCTGTAGACAAATCCGGCTCAATCAGGTTCCTGAACATCACAGCTGAGGAGTTTATCATACCGGAGGCTGATGGCACTAGTAAGAAAGACAAACTTCATCGTCGTTTAGCAGAACTATACAACAAATCTATGGACAATGTTGATGTGTTCACCGTTTTTACCAAACTGACCGTTAAAGACGCCTTCCTGGATGTAAGGTTCTCAGCCCACGGGTCCCCGTACTTCGAAGCCGAGAAATTGGACAGTATGGTTGTGGGGATACAAGAAAAGCTGGAAGATGAGCTCGGAGTCAGGATATATATGGTGAAAATTGACGAATGCCTGATAGAGAAGGAACAGTGCGAAGATTCCTGTCGCAATGTCCTCATCAAAAATAACGTACCGCTCTCCGTTTATACGAACACCACCAGCTTCGTCGGGGTATCAGCGAGAGTGGAATCTGAATGCACTTGTGACGTGGAAGAACCGCTCATTTGTTTGAACGGAGGGACTCCATTCGCTGATAAATGCGAATGTCCTGAAGGCTGGAATGGTCCGCATTGCGAGCAGACCAGTATCGGTTTCCATGGCGACGGCTGGGCCATGTACCCCTCCCCACCAGCCTGCCATGAGGGTCACGTGACCCTCACAGTGACGTCACACACTAGCAACGCTCTGGTCTTCTATCTTGGACCGTTGAAATTCAACCCGCTTTTAGATGTCCAAGATTTCATGTCCTTAGAACTGGTAGATGGTTATCCGATACTATTAGTGAACTACGGTTCTGGTACGACACGTCTTAATAACAGTGTTGTACATGTAGCTGATGGCAAACCACATTTGATAGAGATAGTACTCATGAGGAGTTCCATTGAAATGTTTGTGGATAGATGCAAGCTATCTACGTGCATGAGCCTGGCAGCCCCTACAGGACCGAGAGAAATACTGAATGTCAACGGCCCCCTCCAGTTGGGTGGGGCGAGCGTAGATTTACAAAGTCTAGCTAGATCGTTCGGATGGCGTTACGTGCCCACAAATCAGCACTTTATGGGCTGCATCAGCAATTTTACGTACAACGACTACATGTACAATTTGGGAAAGCCATCCGTTGAACGGAACGCAGATCCCGGATGTCAGAAGAGTGTGTTCACAGCCGTTACATTTGGCATCGACACCAACTTCCTGGTCGCTATACTCGTGTGCATCGCGATATTAATAATTCTCCTCTTGGCTGTGGTCGTTCATCGGAAGCGGGCTGATGCATGGGCGGAGAAAGAGTTAGATGATATAAGAGAGAACATCATCGCTTATGAAGATGAAGGTGGTGGTGAAGGAGACGCTGGCTACGACTTACACGTTCTAAGGCAGATGTACGATGGCCCTCCACCAGATAATGAACCAGCGTTCATGCATGTACCAGTGGTTGGCGCAGCTCCTGATATATCAGGATTCTTGGACGACAAAAAATCCGTCCTGGATCGGGATCCAGATATCAATCCATACGACGACGTGCGTCACTACGCTTATGAAGGAGACGGGAACACTAGTGGATCGCTATCCTCGTTGGCAAGCTGTACCGACGACGAAGATCTCAAATTCAACTACCTATCTACATTCGGGCCTCGTTTCCGTAAGTTGGCGGACATGTATGGAGACGATGGTGAAGACCGCCAGCACGAGGAGTCCTGGTGCTAG

Protein sequence:

>DPOGS216145-PA
MTQGLRLRRLRDKVYTSRSVEVRRPAEFTPASPKLAVKDNHKPLFTECLNYRPTLKEEQPVGTYVFTVHAEDRDPPEMGGTVTYKFVSTPGEKERFHVDPETGRITTADIFDHDEPSREKEAYITVRASDNGQPQLDDACTIKILIEDINDNQPVFDKVSYSESVPQDLPSGREVMRISATDIDDGNNSIVHYSLDSNSPDQAYFYIDPDNGVIFLNKTIDKVPGYKFKLSAIVKDMGDPPQQSSITLDIQVVESNKKSPSFIEVPDEPIRLKENYADFNTPIATVRAVSNIPEEKKLQFELVMGQTEQTNKWHTFVLEPEADSAYIKLGNHLDYEKITDYTLTVRIQNNYKLAAETIIQIEIEDVNDNIPIFSEIRSGSVLENEPPGTPVMQVRAFDADGTSANNQVTYELGDPSDPFAIDSITGNITTLKMFDREERSFYNIKIIATDNSESALIPGKHNAGQQVFLIEIADKNDNPPHFTQDTYVAESIAENANINELVTQVTALDIDTASVVTYSIVAGNTYDAFVIRNFTGEIRVNNELDYENITSYSLDVRAFDGLYEDYAKVLIKIENLNDNPPVFLANYTKTIEEEKLYEGCIVKVEAYDPDLDRNEPQNIVYSLVKHEQKEFLQIDNDGCLRLTKPLDRDQPSGFTRWQILIMAADHGGRLGSDSLRSTTEVILELTDINDNAPFLTNTQPIIWYENEPPGQVVILTAKDYDSAENGPPFTFALNESASFDIRSKFHIQGNILSTLVSFDREQRKEYKIPVAITDSGTPRLTGVSILHIVIGDRNDNSMEPGHSDIFVYNYKGEAPDTEIGRVFVNDPDDWDLPDKRFMWLPSYEQRSPYFDVHSNTGMITMKEGTPNGTYMLRFNVTEVNEPLVPFHWVEATVNVTIKEIPEEAVDKSGSIRFLNITAEEFIIPEADGTSKKDKLHRRLAELYNKSMDNVDVFTVFTKLTVKDAFLDVRFSAHGSPYFEAEKLDSMVVGIQEKLEDELGVRIYMVKIDECLIEKEQCEDSCRNVLIKNNVPLSVYTNTTSFVGVSARVESECTCDVEEPLICLNGGTPFADKCECPEGWNGPHCEQTSIGFHGDGWAMYPSPPACHEGHVTLTVTSHTSNALVFYLGPLKFNPLLDVQDFMSLELVDGYPILLVNYGSGTTRLNNSVVHVADGKPHLIEIVLMRSSIEMFVDRCKLSTCMSLAAPTGPREILNVNGPLQLGGASVDLQSLARSFGWRYVPTNQHFMGCISNFTYNDYMYNLGKPSVERNADPGCQKSVFTAVTFGIDTNFLVAILVCIAILIILLLAVVVHRKRADAWAEKELDDIRENIIAYEDEGGGEGDAGYDLHVLRQMYDGPPPDNEPAFMHVPVVGAAPDISGFLDDKKSVLDRDPDINPYDDVRHYAYEGDGNTSGSLSSLASCTDDEDLKFNYLSTFGPRFRKLADMYGDDGEDRQHEESWC-