Monarch geneset OGS2.0

DPOGS200480
TranscriptDPOGS200480-TA5520 bp
ProteinDPOGS200480-PA1839 aa
Genomic positionDPSCF300158 - 257151-270200
RNAseq coverage271x (Rank: top 40%)
Annotation
HeliconiusHMEL0051090.078.87% 
BombyxBGIBMGA010415-TA0.054.60% 
DrosophilaCad88C-PA0.051.30% 
EBI UniRef50UniRef50_E2A6530.057.69%Cadherin-23 n=9 Tax=cellular organisms RepID=E2A653_CAMFO
NCBI RefSeqXP_971786.10.054.82%PREDICTED: similar to Cad88C CG3389-PA [Tribolium castaneum]
NCBI nr blastpgi|3071847980.057.69%Cadherin-23 [Camponotus floridanus]
NCBI nr blastxgi|3838647390.058.25%PREDICTED: uncharacterized protein LOC100879829 [Megachile rotundata]
Group
Gene OntologyGO:00160204e-34membrane
GO:00071564e-34homophilic cell adhesion
GO:00055094e-34calcium ion binding
KEGG pathway 
InterPro domain[269-288] IPR0021264e-34Cadherin
[1232-1341] IPR0159193.7e-31Cadherin-like
Orthology groupMCL15148 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200480-TA
ATGATCTGTGTAGAAGTCATATCAGCAAACAGACCACCACGGTTTCTTATAGACGGACGTTCAGAAATCGTGATCAGATTAAAAGAAGGTCCAGATACACCAGTTGGAAGTTTAATTTACCGATTAAGAGGCGTTGACTCCGATGGAGATACTTTGCGATTTGGTATAAGAGAGCAAGTCGGAAGTGATATATTAAGAATAGAAGCCATTTCGTCCAACGAAGCTAATATATATTTAGTTAAGGAATTAGATAGAGAGATTCGCGATGAATACTCATTTGTATTAACCCTGACTGATGGTCATTTGGGGGAAGTAGAGGACGTCAATGACAACGAACCTATATTTAAACCGTATCCTCCGGCTATCACTGTGAAGGAAGATGCATTACCTGGAGTATTGCTTACTGTTGAAGCAACTGATCTTGATGAGGGTGCATATGGCCAAGTTCTATACAATTTACAAGAACTAGATGGCGATGTAGACAACTTTGCCATTCAAACTGTAAACGGGAAGGGAGTTATACGACTAACAAATCGGCTAGACTACGAGCGGAAATCCCTTTACCAATTGCGGGTTCTGGCTATAGATAGAGCAAACCAGGGCCGTGTTAATACTGGAACTGCTGCTATCTTAGTAAAAGTGCAGGATGTTGAAGACCAACCTCCAGAGTTTGTGGTCGCGAGCCCAGTTACCAGAATCAGTGAAGATGCGCCGGTTGGAACGTCTGTTTTACAAGTCAGGGCAATTGATGGTGATAGGGGCATCAACAACAGAATCTCCTATAGCATTATATCTGGTGGAGAGGAACACTTTGATATTGACAGCAGTTCTGGGGTAGTGTACACCATCAGTCCTATAGACAGAGAAGACCCTAATAATAGCAATGGAGCCTACATACTAGAGATATTGGCCACAGAGGAATCTCATATGGTATCACCACTACCGAGTGCTACAACTGAGGTGACTGTCATTATAACCGATGTGAATGATGAAAAGCCCAAGTTTAAAAGCAATAGATATGTAGGAGAAATCATAGAAAATGCCCAACAAAACACACCAATAACCTTCCTACAGGATGGTGTGCCGGAAGTATTCGACTATGATCAGGGTAAAAATGGGACTTTTGAATTATACTTAGTAGGTGACAATGGTGTTTTTGATGTCACACCTTTTAAGGGTATAAATGAGGCATCATTTTTAATAAGGGTCAATGATCCATCTTTCTTAGATTACGAAAAGGTTACTGTAATGAATTTCAGCTTGGTTGCCAAAGAGATTGTTACCAAAGAACCAAAAATGAGTATAGTGCCGATAACAGTTCATATAAAGGATGAAAATGACAACTTCCCGGAGTTCACGGAGACCGTTTACACTGTATCAATACTTGAGAACTGTGCTGTTGGTACGACAGTGGCTTGGATTCAAGCAACCGATTCCGATTCAGATAGCTATGGAACTAGAGGCATCAGATACACCGGCCTGACTGGCAGTGTTGCACATTTGTTACATTTGAATCCAATATCTGGTGTGATAACAGTGAAGCAGGCCGGCGATGACAGTTTTGATAGGGAACTAGTGTCAAGACATTATGTGACTGTTGAGGCAAGGGATGATCAGGGCAAAGGAAACAGAAACACAGCACAATTAATAATAAATATAGAAGATGTGAATGACAACGCTCCAATGTTCCTCGCTAATAAATATGAGGCAAGGTTGCTGGAGAACTCCTTGGACTTTGAAAATCCGCTTGTATTGGAAGCAAGAGATTTAGATTTGAACGGAACAAAAAATAGTCACATAGAATATTCTATAGTCGGTGGTGATTACAAAAACAATTTCTCAATAGATCCAAATCTAGGTATAATAATACCAATCGGCGGGATTGACTTCGAACAAATAGCGGGTGACAACACTAATATAAGGCCCATACATTTGACGGTCCAAGCTCGTGACTTCGGCTCCCCTCCATTATCGTCCACAGTTCCCGTGACGGTGTATGTGGCGGACGTGAACGACCACGCCCCCTCGTTCACACAGACGGTGTACAAGCGGGCCATCCCGGAGGATATGCCGGGGGGAACTAGTGTTATAGAGGTCAAAGCCCGCGATTCAGACGGCTCGTCTCCCAACAATCGTGTGGTTTATCGTATACAACGCGGCGCCAGCGACAAGTTTGTTATAGACTCCTTCTCGGGACTGATCTCCGTGGCTGCCGGAGCCAATCTGGACCCGGACCGGACCGAGCCCACCACTAACAGATATGTACTCACTGTGGTAGCTTTAGATGGTGGTATAGGAGACCAACAGCTGAGCGCGTCTGTCATTGTAAATATAACGATAGTGGATGTTAACAACAAACCACCAGTATTGGTTGAACCCGGACTCGTACACGTCATGGAAAACACACAGGTGGGTACAGTAATATATCGGGCCCACGCATATGATCTCGACGAACAACCAGTGTTGAGATTTTCCATAGACAAGGAATTAAGTTCCGGACGAAATGAGGATGGCGTCCCGGTGACTATCAACGACTACGACTATATAGGGATATGGGATCTGAACACTATCGACGGCACCCTGAGGATTGTTAGATCTTTGGACAGAGAGAAAGTGGAAATTATAAAACTGGTGATCACAGTCGAGGATATGGCGGCCATGAGTAACGGACCAGTGCAGAGAGCTTCCGCTATACTGACGGTTATAGTACAAGACGAGAATGACAACAACCCCAAATACAGGAAACCATTCTACAAGACCTCCATCACAGAGAACTCTAAGAACGGCGTCCATATTGAAACTGTCATAGCGGACGACGCCGACAGGAACAGAACTATGACGTATATGTTGGAAGGTCCGGAGGAGATATTAGGGCTGGTACACATGGACAGTTCTACGGGGGAAGTGGTGGTAGCCAACAGAATAGACCACGAGCTACAGCCTTGGATCAATGTGACCGTCAAGGCTACGGACAGCGGAACGCCGCCCAGATCCGCGGCGGTGGAGCTGGTGATACAGGTATTGGATGAAAACGACAACAATCCTATATTCGAGCCTTCGTCTTTCGAATACAGAGTAAGAGAGGACATAGAGCCCGGCAGTACTGTAGCCGATATTGTAGCGAGAGATGCTGACTCGGGCGAATATGGAAAGATTACGTACTTACTGGACAGAGTCTCTACACAGGGCAAGTTCCTAATAAACCCGGAAACGGGGGCGCTGAAGGTGTCCGATTATCTGGATCGAGAGACGCAAGCGAGTTACAACCTGGTGGTAGAGGCCTGGGATAACTATCAGTTCGGATACCTCAGCGGGGAGAGTCGGAACGCGTTCAAACAGATCGTTATCCACGTAGAGGACGTGAACGACAACCCGCCCGTCCTCACCCTGCCTACAGGCTGCACGACTATATCCGAGTTCCACAACCACCGCGAGCCGATACTGTCGGTGACGGCGAGCGACGCAGACGACACCGCCACGCCCAACGGACGAGTACAATTCCATCTGGTGGGGGGAAAGGGACACGATCTGTTCCGTTTCGAGCAAGTCGGCGGCGACGCTAACACCGGCCGTCTGTACGCGAAGCAACCGCTGAAAGATCGCTTCGGCAACTACACCTTCATCATAGAGGCCAGGGATCTCGGCCTACCCTCCAACGTCGTCAGAGACGAATTGAATCTATGTGTCACAGATTACAACGATCACGCGCCAGTGTTCGTACATCCGCCGCAGAATGTCACCATTAAAGTGCCGGAAAATGCAACAATAGGCACAACAGTGGTCGAAGTGAAGGCTATAGACGCTGACATCGGGCCTAATGGCGCAGTCCGCTATAGACTGCGACACGACGCTCGCGGCTCATACAGGACCTTCACTATACATCCCACCAGCGGAGCCCTACGGACCACCGGAGCGCTAGACAGAGACAAACAAACAACCTACCAGCTGAGAATAGAAGCCTACGACCTCGGCCTGCCAACTCCTCTCAGCTCCGACTTGGATCTTACAATTTACGTCCAAAACGTGGACAACTATAAGCCGAGGTTCCCCGACAGACGGCTGCATTTAAATATAACGGAAAATGAAGAATTCTCTACGACGCTACCGAGAGTGCTGGAGAGAGACGAGATCGACCGAGGCGACGAGCCGATGCTGCCCGTGTGTTACTACATCGTGGACGGAAATGATGAAGGCTTGTTTGAACTCGACAGGGCTACGTACAGATTCAAATCAAAACCGTTAGACAGAGAACAAAAAGATCAGTACTTGATAACGGTTCTAGCAACAGAGGACTGTGTACGGCCAGACTATGAAGCTAGTGAGGACTCAAGTACACTACAGATATACATTAATATACAGGATGTGAACGATAACGCGCCTCAGTTCATCAGCAAAATGTTCACTGGTGGGATCACAACAGAAGCGGACTTTGGAATTGAGTTTATGCACGTCAAGGCCATAGACCAAGACGATGGTGTGAACGCTAAGATAAACTATTATCTTCTTGGTGAGGTCAAGGAGACCTTAACTGAGGGTCTGGAAAATTTGGCGGTGTCACCTTTCCTGGTAGACGTCGACACGGGAGCTGTGACCTTGAACTTTGACCCTCAGAAGGGAATGAAGGGCTACTTCGACTTCAAGGTATTAGCGAACGACACAGACGGTTTGCAGGACGAGGCTCACGTCTTCATATATCTGCTCCGCGAGGATCAAAGGGTGCGTTTCGTTTTACGTTCCCATCCTTCGGAGATACGTGATAAAATAAACATATTCAGAGAGAGGTTGGCGCGTGTAACGGAATCCGTGGTCAATATAGACGACTTGAGGGTGCATGAAAATAAAGACGGGTCCGTAGACAACACCAAGACGGATCTGTACCTGCACCTGGTGAACAGCGGCGACCACTCGGTGCTGGAAGTGGAGCAAGTCCTCAAGATAGTGGACAAGAACATCGAACACCTGGACGACCTGTTCAAAGAGTTCAACGTCCTGGACACCCAGCCGGCGGAGTACCAGCCCTTGACGGCTGACACGCTGTCGTCCCAGCAGGCCGTGTTCTGGCTGGTGTGGACCGCCGGCATACTGAGCGCGTTGCTGGCGGTCACCGTCATCATGTGTCTCTCGCAGAGAGCTGACTTCACCAGGAGACTGAGGGCCGCCACCACCGCCTACTCTTCCCAAACGTCGGGCGAAGCTGACATGACGATACGAAGTTCGGGAGGTAGAGTCCCGAATACAAACAAACACAGCACCAAGGGCTCCAACCCTATATGGCTACACGCGTACGAAAACGACTGGTACAAGACAGACGATCAAATGAGTCACTCGGAACGAGACTCGCTGGACGAGAACGCCGTCGACCAGGACCTCAGCAACGACAAGCCGTACTTCATAACGACCTACCCGAGTGGGGACCAGGACCTGGACAACCACAACGACCTGGACCACCGAGGATACGACTTCTACCAGCAGCTGGAACAGGTGAAGAACGCCAAGAACATGGAGACCACGGAACTCTAA

Protein sequence:

>DPOGS200480-PA
MICVEVISANRPPRFLIDGRSEIVIRLKEGPDTPVGSLIYRLRGVDSDGDTLRFGIREQVGSDILRIEAISSNEANIYLVKELDREIRDEYSFVLTLTDGHLGEVEDVNDNEPIFKPYPPAITVKEDALPGVLLTVEATDLDEGAYGQVLYNLQELDGDVDNFAIQTVNGKGVIRLTNRLDYERKSLYQLRVLAIDRANQGRVNTGTAAILVKVQDVEDQPPEFVVASPVTRISEDAPVGTSVLQVRAIDGDRGINNRISYSIISGGEEHFDIDSSSGVVYTISPIDREDPNNSNGAYILEILATEESHMVSPLPSATTEVTVIITDVNDEKPKFKSNRYVGEIIENAQQNTPITFLQDGVPEVFDYDQGKNGTFELYLVGDNGVFDVTPFKGINEASFLIRVNDPSFLDYEKVTVMNFSLVAKEIVTKEPKMSIVPITVHIKDENDNFPEFTETVYTVSILENCAVGTTVAWIQATDSDSDSYGTRGIRYTGLTGSVAHLLHLNPISGVITVKQAGDDSFDRELVSRHYVTVEARDDQGKGNRNTAQLIINIEDVNDNAPMFLANKYEARLLENSLDFENPLVLEARDLDLNGTKNSHIEYSIVGGDYKNNFSIDPNLGIIIPIGGIDFEQIAGDNTNIRPIHLTVQARDFGSPPLSSTVPVTVYVADVNDHAPSFTQTVYKRAIPEDMPGGTSVIEVKARDSDGSSPNNRVVYRIQRGASDKFVIDSFSGLISVAAGANLDPDRTEPTTNRYVLTVVALDGGIGDQQLSASVIVNITIVDVNNKPPVLVEPGLVHVMENTQVGTVIYRAHAYDLDEQPVLRFSIDKELSSGRNEDGVPVTINDYDYIGIWDLNTIDGTLRIVRSLDREKVEIIKLVITVEDMAAMSNGPVQRASAILTVIVQDENDNNPKYRKPFYKTSITENSKNGVHIETVIADDADRNRTMTYMLEGPEEILGLVHMDSSTGEVVVANRIDHELQPWINVTVKATDSGTPPRSAAVELVIQVLDENDNNPIFEPSSFEYRVREDIEPGSTVADIVARDADSGEYGKITYLLDRVSTQGKFLINPETGALKVSDYLDRETQASYNLVVEAWDNYQFGYLSGESRNAFKQIVIHVEDVNDNPPVLTLPTGCTTISEFHNHREPILSVTASDADDTATPNGRVQFHLVGGKGHDLFRFEQVGGDANTGRLYAKQPLKDRFGNYTFIIEARDLGLPSNVVRDELNLCVTDYNDHAPVFVHPPQNVTIKVPENATIGTTVVEVKAIDADIGPNGAVRYRLRHDARGSYRTFTIHPTSGALRTTGALDRDKQTTYQLRIEAYDLGLPTPLSSDLDLTIYVQNVDNYKPRFPDRRLHLNITENEEFSTTLPRVLERDEIDRGDEPMLPVCYYIVDGNDEGLFELDRATYRFKSKPLDREQKDQYLITVLATEDCVRPDYEASEDSSTLQIYINIQDVNDNAPQFISKMFTGGITTEADFGIEFMHVKAIDQDDGVNAKINYYLLGEVKETLTEGLENLAVSPFLVDVDTGAVTLNFDPQKGMKGYFDFKVLANDTDGLQDEAHVFIYLLREDQRVRFVLRSHPSEIRDKINIFRERLARVTESVVNIDDLRVHENKDGSVDNTKTDLYLHLVNSGDHSVLEVEQVLKIVDKNIEHLDDLFKEFNVLDTQPAEYQPLTADTLSSQQAVFWLVWTAGILSALLAVTVIMCLSQRADFTRRLRAATTAYSSQTSGEADMTIRSSGGRVPNTNKHSTKGSNPIWLHAYENDWYKTDDQMSHSERDSLDENAVDQDLSNDKPYFITTYPSGDQDLDNHNDLDHRGYDFYQQLEQVKNAKNMETTEL-