Monarch geneset OGS2.0

DPOGS202805
TranscriptDPOGS202805-TA1740 bp
ProteinDPOGS202805-PA579 aa
Genomic positionDPSCF300018 - 403737-405815
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0059950.091.90% 
BombyxBGIBMGA010451-TA0.088.56% 
Drosophilads-PA0.069.16% 
EBI UniRef50UniRef50_D2A0X70.073.93%Dachsous n=2 Tax=Coelomata RepID=D2A0X7_TRICA
NCBI RefSeqXP_001650242.10.070.92%protocadherin [Aedes aegypti]
NCBI nr blastpgi|2700063620.073.93%dachsous [Tribolium castaneum]
NCBI nr blastxgi|2700063620.073.50%dachsous [Tribolium castaneum]
Group
Gene OntologyGO:00160202.6e-40membrane
GO:00055092.6e-40calcium ion binding
GO:00071565e-40homophilic cell adhesion
KEGG pathway 
InterPro domain[435-539] IPR0159192.6e-40Cadherin-like
[439-544] IPR0021265e-40Cadherin
Orthology groupMCL10034 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202805-TA
ATGTGGCGCGCTATAATATGTATTCTGATCGCTGCGGCGGCGGCAGAGCACGTCCGAGAGTTTAGTGTGAGGGAAAACGCTCGCGCCGGGACCTTCGTCGGTCACCTCGGAGACGAGCTTCCGGACGCCGCCCCACCGTATACGATAGTCCCTGTACTAGGCAGCGCCGTCAATGATGACCTCAAGATTGATGCCCACACGGGAGAAATAAGGACTAGAGTGCCCTTAGACAGAGAGAACAGAGACCATTACGCTCTCGTCGCGATACCGGCTAGCGGTGATAACATCCGTGTTGTTGTACGAGTGTCCGATGAAAATGATAACTCACCCACGTTCCCCACATCGGTTATGAACGTGGAATTTTCGGAGAATACTCCTAGGGATGTAAAGAGGAAGTTAAATCCAGCCAAAGACCAGGATTTAGGAGTGTTCAATACGCAGAGATACAATATTGTATCCGGAAATACCGATAATGCGTTCAAACTGTCTTCACACAGAGAAAGGGATGGCGTCTTATATTTGGATCTCCAAATTAATGGTTTTCTTGACAGAGAAACGACAGACCACTATGAGTTAGTGATAGAAGCTCTAGATGGAGGCACTCCACCTCTTAGAGGGACGATGACTGTAAATATAACAATTTTAGATGTAAATGATAATCCTCCTGTGTTCGCTGAAAGTGCATATTCAGCGATGATACCAGAAAACGCAACAGTCGGTACCACAGTACTGAAAGTGTTTGCGATAGACTCAGACGAAGGTGAAAACGGTGTGATAGAATACTCAATAAATAGGAGACAAAGTGATAGAGACAATATGTTCAAAATAAATCCAGATACAGGTGAAATTATTGTAAATAAACTCTTAGATTTCGAGACCAAAGAACTTCACGAGTTAGTCGTCGTCGCTAGAGATAAGGGTGCTCAGCCTCTAGAGACCACTGCGTTCGTGTCAATACGTGTCACCGACGTTAATGATAATCAGCCCACAATAGATGTTATCTTCTTAAGTGACGACGCCACACCTAAGATATCAGAGTCCGCTCAGCTGGACGAGTTCGTAGCAAGAATATCTGTTCACGATCCGGATTCTAAAACCGAATACGCCAATGTCAATGTTACTTTGAACGGTGGCGACGGACACTTCGATTTACGGACGCATGATAATATAATTTATTTAGTTGTGGTAGCTTTGCCTCTCGATAGAGAATCTCAGTCCGCTTACACTTTAAACGTAGTAGCCACTGACAAAGGTTCGCCTCCGCTACACGCGTCTCGTATCATAACTCTCTTGGTGACGGATATAAATGATAATCCCCCAACTTTCCTTGAGAGCGAGTATAAAGCAAATGTCCCAGAAGCAGCAGCACCCGGTACTCCTGTGCTGCAGGTGTCCGCTTTCGATGCTGATGAAGGAGAGAATTCCGAAATAAGGTATTCCATACTCCCATCACCGCAATCCGATTGGTTTTCTATTGATGAGCGTTCTGGTCTGGTTACGACACGTGTTCGCGTTGATTGTGAAACAAATCCGATGCCTAAGTTAACGGTAGTGGCGAGTGATCGCGGCAATCCTCCTTTGTCGTCCACCGCGACCTTGTTAGTGACAGTGCTAGATGTAAACGACAATGAGCCGATCTTCGATCAGTCCTTCTACAACGTGACCGTTCCAGAGAATGAAGCCGTCGGCAGCTGTATTCTAAAGAACGTAGAACCTTATGGAGTGACTACTGACTACTGA

Protein sequence:

>DPOGS202805-PA
MWRAIICILIAAAAAEHVREFSVRENARAGTFVGHLGDELPDAAPPYTIVPVLGSAVNDDLKIDAHTGEIRTRVPLDRENRDHYALVAIPASGDNIRVVVRVSDENDNSPTFPTSVMNVEFSENTPRDVKRKLNPAKDQDLGVFNTQRYNIVSGNTDNAFKLSSHRERDGVLYLDLQINGFLDRETTDHYELVIEALDGGTPPLRGTMTVNITILDVNDNPPVFAESAYSAMIPENATVGTTVLKVFAIDSDEGENGVIEYSINRRQSDRDNMFKINPDTGEIIVNKLLDFETKELHELVVVARDKGAQPLETTAFVSIRVTDVNDNQPTIDVIFLSDDATPKISESAQLDEFVARISVHDPDSKTEYANVNVTLNGGDGHFDLRTHDNIIYLVVVALPLDRESQSAYTLNVVATDKGSPPLHASRIITLLVTDINDNPPTFLESEYKANVPEAAAPGTPVLQVSAFDADEGENSEIRYSILPSPQSDWFSIDERSGLVTTRVRVDCETNPMPKLTVVASDRGNPPLSSTATLLVTVLDVNDNEPIFDQSFYNVTVPENEAVGSCILKNVEPYGVTTDY-