Monarch geneset OGS2.0

DPOGS211456
TranscriptDPOGS211456-TA1809 bp
ProteinDPOGS211456-PA602 aa
Genomic positionDPSCF300223 + 135541-139800
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0138234e-6554.33% 
BombyxBGIBMGA002162-TA1e-16557.95% 
DrosophilaCad96Cb-PB7e-12745.66% 
EBI UniRef50UniRef50_Q9VBV61e-12445.66%Cad96Cb, isoform B n=17 Tax=Drosophila RepID=Q9VBV6_DROME
NCBI RefSeqNP_001163727.12e-12645.64%Cad96Cb, isoform C [Drosophila melanogaster]
NCBI nr blastpgi|2813625504e-12545.64%Cad96Cb, isoform C [Drosophila melanogaster]
NCBI nr blastxgi|2813625501e-12345.32%Cad96Cb, isoform C [Drosophila melanogaster]
Group
Gene OntologyGO:00160201.8e-28membrane
GO:00055091.8e-28calcium ion binding
GO:00071563.2e-25homophilic cell adhesion
KEGG pathway 
InterPro domain[124-230] IPR0159191.8e-28Cadherin-like
[128-235] IPR0021263.2e-25Cadherin
Orthology groupMCL16152 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211456-TA
ATGCGTCCTTCCCCGCAGCCTCTGAATGTTCCCATCATCATTATAACTATCCAAGATAGTCGATGTTATTTGATGAACGGTGGCGCCGTGGAGAGTTTCTTCATCAGCGAGGACACGCCGGTCGGCAGTATTATTGGAACACTATCAGTGAACGGTGACCCCGGTGATGAGGGCGACATCAGTCTTCGGGTCCAGGAGCGCAAGCCGGCCGTGGCGCTAGTCCCGGGGTCCAAGAACGTCACCCTGACCCGCGCCCTCGACCGAGAGGAGAAGACCGGACCCTCCAGCGTCTACGTCAACGTGCGATGTGACAGACGACACACCACAGACCCGAGCTTCGTGATCCCGGTGTCGGTCCGTGTGTGGGACGTCAACGACAACGCTCCGTCGTGGTCGGGCGCGCCGTACAGGGCGCGCGTGTCGGAGCTGGCCGCCGTGGGCACACGCCTGCTGACGGCGCGGGCGCACGACCCCGACCAGCCCGGGCCGCACGCCACCGTCAGATACTCCGTGCTGCCCGGACCCGCCGCGGAGTACGTGGGGTTCCCGAGCGAGCTGGACGGCGCGCTGGTGGTGAGGAAGCCGCTGGACTACGAGACCGCCACCAACCTCACGGTGACTCTGAGGGCGCAGGACGGCGGCTCCCCGCCCCGCCACAACGACACCACCCTCACCATCGTCGTCATGGACGCCGACGACCAGAACCCGACCTTCACACACGATCACTACAGCGCGGTCATACCCGAGGACGCGCGGGAGGGTACAATCCTCGAGACGTTCCCGGGGCCCGTGGCGGCGCACGACCAGGATCGGGGGATCAACGCTCCCGTCACGTACAGCGTGCGAGCCTCCCCCTCCCCCGCCGACAACAACACCGCCCTGGTCCGACTGCACAAGGACAGCGGCGAGCTGAGCGTCACCGGCGACCTGCTGCGGGCCAGCCTCCCCACCACCATCGTCATACAGGCCACTCAGGTAGACAACCCGGACCGCTACGCCCTGGCCACCCTGTCCGTGTCCCGTGCCGGCTCCGGGTCCGTTTCGTTCCCGCGGCGCCTCTACTCCGTGTCGGTGCGCGAGGACTCCGCCCCCGGGAGCGTGCTGCTGTCGCTGGAGGCCCGGGGCCAGGGGCCGCTGCAGTACTTCGTATCGGACCGCAGCTTCCTGCAGCAGTTCGCCATCAGCGAGGCAGGCGAGCTGCTGCTGCGACGAGCGCTGGACCGGCTCGTCAGACACTACGACTACCAGGTCATGGTCACCGACGGACGGACGAACGACACGGCTCACATCAACATATCGGTGGAGGCGGTGAACGAATGGGAGCCGCGGTTCAAGCACGCCCAGTACTCGTTCGTGGTGGAGCGGCCGACGGGCGAGGGCCGCGTGCGGGTGGGGCGCCTGCACGTGCACGACGGCGACCCCGAGGACCGCGTGTCGGTGAAGGTGGCGGGCCCGGACGCCGCCGCCGTCACCGTGGACGACGCCGGGGACGTGTTCGTGTCCGCGCCCGCCCTCAGGAACATGCGCTCCGACACGCTGCACCTCGTCGCCACCGCCGTCGACTCCGGCACTCCGCCCAGACAGGGTGCGCTTCTCGTGGGTTGTAACGCGTGTCTCTCCGCCAGTCGTCCGTGCCGCTGTCGGTCCGCGTGTCTCCCCCCGCCCCCACGTCTCCCCCCGGCGCCGCCTCCTCCTGTCCAGTGTGGGCACGTCCGTGTGTGTGTCTGTGTCGTCGCTGGTACTGCTGGCCGTGCTGCTGCTCTTCCTTCACAGGCTCAGACTCTCTTTGGACAAAAACCTCCAACGTGA

Protein sequence:

>DPOGS211456-PA
MRPSPQPLNVPIIIITIQDSRCYLMNGGAVESFFISEDTPVGSIIGTLSVNGDPGDEGDISLRVQERKPAVALVPGSKNVTLTRALDREEKTGPSSVYVNVRCDRRHTTDPSFVIPVSVRVWDVNDNAPSWSGAPYRARVSELAAVGTRLLTARAHDPDQPGPHATVRYSVLPGPAAEYVGFPSELDGALVVRKPLDYETATNLTVTLRAQDGGSPPRHNDTTLTIVVMDADDQNPTFTHDHYSAVIPEDAREGTILETFPGPVAAHDQDRGINAPVTYSVRASPSPADNNTALVRLHKDSGELSVTGDLLRASLPTTIVIQATQVDNPDRYALATLSVSRAGSGSVSFPRRLYSVSVREDSAPGSVLLSLEARGQGPLQYFVSDRSFLQQFAISEAGELLLRRALDRLVRHYDYQVMVTDGRTNDTAHINISVEAVNEWEPRFKHAQYSFVVERPTGEGRVRVGRLHVHDGDPEDRVSVKVAGPDAAAVTVDDAGDVFVSAPALRNMRSDTLHLVATAVDSGTPPRQGALLVGCNACLSASRPCRCRSACLPPPPRLPPAPPPPVQCGHVRVCVCVVAGTAGRAAALPSQAQTLFGQKPPT-