Monarch geneset OGS2.0

DPOGS208459
TranscriptDPOGS208459-TA4689 bp
ProteinDPOGS208459-PA1562 aa
Genomic positionDPSCF300064 - 1772792-1778026
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0049320.060.99% 
BombyxBGIBMGA010597-TA0.060.77% 
Drosophilaft-PA0.044.04% 
EBI UniRef50UniRef50_UPI0002247D150.044.60%UPI0002247D15 related cluster n=1 Tax=unknown RepID=UPI0002247D15
NCBI RefSeqXP_001357109.20.053.85%GA17399 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984756690.053.85%GA17399 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|2700064220.047.91%fat protein [Tribolium castaneum]
Group
Gene OntologyGO:00160209.9e-38membrane
GO:00055099.9e-38calcium ion binding
GO:00071561.1e-35homophilic cell adhesion
KEGG pathway 
InterPro domain[1103-1207] IPR0159199.9e-38Cadherin-like
[1108-1212] IPR0021261.1e-35Cadherin
Orthology groupMCL10034 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208459-TA
ATGCAGTCCCGCGCTGTGGATACCGGTGTCGATCTCCGTGTACCCGAGTCTCAACCTATTGGAACCAATGTCGGAAGAATACCTATTAAACCTGGTTTTACTTATCGTTTTAATGAGCCGCCTAAGGAATTCGTTCTAGATCCTGTATCTGGTGAAATCAGAACTAATGTAATTCTTGATCGGGAAAGTGTTGATCGTTATGCTTTTGTCGTTTTATCAAGTCAGCCAACGTATCCTATCGAAGTGCGTCTTAGAGTTACTGATGTCAATGATAACTCCCCAGAGTTCCCCGAACCGGTGATAGCTGTGGCATTTTCAGAGAGTGCAGCTCCAGGGACTAAGTTGTTATTAGACGCGGCTACGGATAAGGATTTAGGTGAAAATGGAATTGCCAACGATTACAGAATAGTCGACGGCGATAATGAAGGAAAGTTTAGATTGAACGTTACTGTCAATCCTAGTGGACAAACTTCTTATTTACATTTAGAAACTACAGGAAAATTAGACAGAGAAACAAATGATTTTTACATTTTAAATATCTCTGCACGTGACGGTGGTAGTCCTCCAAAGTACGGGTATCTTCAAGTGAATGTTTCAATTTTAGATGTAAACGATAACCCTCCTATATTTGATCAAAGCGATTTTTCAGTATCACTTAATGAGAGTGTTCCTCCCGGAACTACTGTCCTTAAAGTGACCGCAACGGATAGTGATTTGGGTGATAATAGCAAAATTACTTATGAAGTGACTGATACAGAAAAACAGTTCGCCGTTGACCCAGAGTCCGGTGTTATAACGACATTAAAAAAATTGAGTTGTCCAAAATATTGCAGTAACACAACTTGTAATATGACGTGTGTTCTTACAGTTATAGCAAAGGACCACGGAGTACCACGTCAAGATGCCAGAACATATGTAACTGTCAATTTGATCGATGCTAATGATCACGATCCAGTTATCACTTTTACTTATGTACCATCGACAGCTAACTTTGCAACAGTGGATGAAAATGCTAAAAATGGATCACTTGTTGCTGCTATTACTGTAACAGATTTAGATGCTGGTCTTAATGGAATAACGAGCATTAAAATAGTGGCAGGAAATGAATTAAATCATTTTAGACTGGAAAACTCAAGCAGTGTGTATATAGTTCATGTTAATGGTATTTTAGATAGAGAGGAAATAAGTAAATATAATTTAACTGTAGTAGCTACTGATAAAGGTACACCCCCACGTACTGCTACTTCGTTTCTAGTAATTCACGTAAATGATGTCAATGATCATGAACCTGTATTTGAAAAATCTGAATACTCTACAGTTCTTAGTGAATTGGCTCCCATTGGAACATATGTCGCTGGAATAACAGCAACAGATGAAGATACGGGAGTCAATGCTGAAATTTTTTATGATTTTTATGACGGTAACCAACAACAATGGTTTTTTATTGATCACTATACGGGCCTAGTAACAACTAGATCAGTTTTAGACAGAGAGATCCAAGGCACTGTCGAATTAAATGTGTCAGCCCGTGATGGTGGGCCAAACCCTAAATGGGCATATACAAGGCTAAAAATTACTATATTAGATGAAAATGATGAGGCACCTAGTTTTCCTCAACTTCAAATTAATACATCACTACCAGAAAACGTGAAACCGTTAAAAGAAATTCTTATTTTAACTGCTTCCGACTACGATCAAGGAACAAATGGATCTGTGTCATATTACTTGTCATCAAATATTGAAAGGAAATATCCTAACACTTTTATGTTGGATCCAATAACTGGTCAATTAAGCGCTGTTACAGAATTAGATAGAGAAAGAATTCCATTGTATGAGATACAGGTTATAGCAAAAGATCAAGGTTACCGTCCACAATCATCTACAGCCACAGTTTTCTTAAGAGTAATTGATGTTAATGATAATGATCCAATATTCTATCCTCAGAGATATTTCGAAAGTATAAGAGAAGATTTGGCACCAGGGTCTAGGGTTCTACAAGTAAAAGCATTTGATCTTGACGAGGGGGACAATTCTAAAGTAGTATACAAGTTAGAAAGTGGTGGTGAAGGCTATTTTGATGTTGAACCTGAAAGTGGCAATGTAATATTGCAAAAAGATTTCCGTAAGGCACCCAAGTCACTTTATTCGTTAAGAGTATCTTGCAAAGATAAAGGAAATAGAAAGGCTGTTGAAGACGCTATAGTTGATATTATTAAGGTTTCTTATAGAAAAAACTTAGAGTTTGATGGATATAATGGTTACAATTTTAAAATAACTGAAGATGATGGCTTATTGAAATCGAGCCCTGGCCGGATTGTAGGAAAAGTTGGTACAAGAACTATAAGTGATAATATATCATATTATATTATAGAAGGCGATCCAAAGAAAATTTTTAAAATTGACGAAAAATCTGGAATAGTCACTACAGAGTCAAATCTTGACCGTGAAGATAAAACTACATATCACCTAAAAATAATGGCAAGAAGTGGACAAGCTTTTGGTTTTACTACTATGAATATTTCAGTATTAGATGGGTATATTTATGTTAAGAGTCCTCTCGACAGAGAAGAAAAGGATTATTATTCATTAACTATCGTCGCATCAGACCACGGGAAACCGTCAAGATCATCTCAAGTCCCAGTAGTTATTCACGTTCTTGATGAAAATGACAACAGTCCACAGTTCACGAATACTACATTTATTTTTAAAATAAAAGAAAATGAACCGCCTGACACTTTCGTGGGTAAATTAACTGCTACTGATAAAGATATCGGGCGTAATGCAGAATTAACTTTCAGCTTACCAATTGCTCAAAATGATTTCAGAATTGATTCGAGAAATGGATTTATTAAAACATTAAAGTCATTTGACAGAGAAAATTTATCCCAGAATAGCGGTCAAAATTATATAACGTTAACCGTTACTGTAAGTGATAATGGAAAAGTAAAACTTTCAGATTCTGTTAGAGTTACCATATATGTCACAGATGTAAATGATAATTCACCAATCTTTACACGTACTCCTTATGCAGTCGAAGTATCTGAAGGAGCAGTTGTTGGTGCTTCAATCATGAGGGTTTATTCTTCGGATGCCGATGAAGGTCTAAATGGTGACGTTTATTATAAGCTGATAGGAGGTGATGATTTAGGAAAGTTTGTGCTTGATGAAGCTACAGGACAACTTTTTATAAATAAGCCATTAGATAGAGAAAGTATAGACCATTACGTACTCACAGTTATAGCACATGATTCGGGACAAACAACTCGGCTTTCTTCAACAACAATAATCACGGTAGATGTTCTGGATGAAAATGATAATGCACCCGTATTTGAACAGTCTCAAATGAATGTGTCTGTGTTGGAAACAGAGCCTGTTAATAAAAAAATTATTCAATTTCATGCCAATGATGCCGATTTGGGCATAAATAATGAATTGCAATATTCAATTACATCTGGAAATAGAAAAGAGACATTTTTTATTGACAGTTACTCAGGGGAGCTATTTTTACATAAGCATTTGGATTATGAAGATTTGACATCGTACGTCCTAAACATAACGGCGACTGATAACGGAAACCCAAGTCTATCATCTAGTATCACATTTACTGTCACTGTTATAGATGCAAATGATAATGCTCCAATATTTACGAATACAGCTATAGTACGTCAAATTAGGGAAGGAATTCCTATGCATACTCCAATTGTCACTGTTACTGCAGAAGATCCCGATTCAGGATGGAATGGAAAAGTTTACTATTCCATTACTCATCAAGATCCCAGTAACGGAAAAAGACATTTTGCAATTAATAATGTCACAGGCGTTATATACACTTTATTGCCTATTGACAGAGAAATCATTGACACCTTTAGAATAACTGTAGTGGCTTGCGATAAAGCTGAACCTGCGTCCAGTAGATTGTGTTCTGAAAAATTGGTAACTGTAATTGTTGAGGATATCAATGATAACGCCCCAGTATTTGTGTCCATGAATGCCGCTGTTATTAACTCTGAAAGATTGGGTAGAAGTATGAGTCGAGGCAAATTCATTATGAATGTTTTGGCTAGAGATTTGGATTCCGGGACTAATGGTCTAGTAACGTATAAGTTAATTCATGGAGGGAATGACATGTTTGATCTTCATAGGAGTAATGGTGCGCTAAGTCTACGTTATCCTCCATCAATGCCTGATGTAAGATGGAATTTAGTTATAAAAGCCACAGATGAGGCAGTTTTAAGTGAACAGAGGAGTACTGAAACTTACTTAACTGTGATCATGGGAGGAACTGAACTGGAAGGTATTATGTGGACGAATGTAGGATCTATTTCCGTAGCTGAAAATGAACCAGCCGGAACTGCTGTGCTAAACATGACTAACAATTACAAAGGTGGTTTAGAATACTATATAGTGAATGTGACTGGTGATAATAAACAAGTGGATAGGTTATTTGATATAGATTCGTCCTTGGGCATTTTATCCACTGCTGTTTCATTGGACAGAGAGGCTGGTGTCGATCGTTACGAAGTGGAAATTTGTGCTGTATCATCAGGAAGTCCTTTGCAATCCACAACCACAAAGGCATATTGGTTTGATTATGGATTGGATTCATTTACTTATTTTAAAATGGTCAATCCAATTAACTTCGCACAATCGTCGGTAGAAGACAAAACATCCGCACCATACTGTTGCTAA

Protein sequence:

>DPOGS208459-PA
MQSRAVDTGVDLRVPESQPIGTNVGRIPIKPGFTYRFNEPPKEFVLDPVSGEIRTNVILDRESVDRYAFVVLSSQPTYPIEVRLRVTDVNDNSPEFPEPVIAVAFSESAAPGTKLLLDAATDKDLGENGIANDYRIVDGDNEGKFRLNVTVNPSGQTSYLHLETTGKLDRETNDFYILNISARDGGSPPKYGYLQVNVSILDVNDNPPIFDQSDFSVSLNESVPPGTTVLKVTATDSDLGDNSKITYEVTDTEKQFAVDPESGVITTLKKLSCPKYCSNTTCNMTCVLTVIAKDHGVPRQDARTYVTVNLIDANDHDPVITFTYVPSTANFATVDENAKNGSLVAAITVTDLDAGLNGITSIKIVAGNELNHFRLENSSSVYIVHVNGILDREEISKYNLTVVATDKGTPPRTATSFLVIHVNDVNDHEPVFEKSEYSTVLSELAPIGTYVAGITATDEDTGVNAEIFYDFYDGNQQQWFFIDHYTGLVTTRSVLDREIQGTVELNVSARDGGPNPKWAYTRLKITILDENDEAPSFPQLQINTSLPENVKPLKEILILTASDYDQGTNGSVSYYLSSNIERKYPNTFMLDPITGQLSAVTELDRERIPLYEIQVIAKDQGYRPQSSTATVFLRVIDVNDNDPIFYPQRYFESIREDLAPGSRVLQVKAFDLDEGDNSKVVYKLESGGEGYFDVEPESGNVILQKDFRKAPKSLYSLRVSCKDKGNRKAVEDAIVDIIKVSYRKNLEFDGYNGYNFKITEDDGLLKSSPGRIVGKVGTRTISDNISYYIIEGDPKKIFKIDEKSGIVTTESNLDREDKTTYHLKIMARSGQAFGFTTMNISVLDGYIYVKSPLDREEKDYYSLTIVASDHGKPSRSSQVPVVIHVLDENDNSPQFTNTTFIFKIKENEPPDTFVGKLTATDKDIGRNAELTFSLPIAQNDFRIDSRNGFIKTLKSFDRENLSQNSGQNYITLTVTVSDNGKVKLSDSVRVTIYVTDVNDNSPIFTRTPYAVEVSEGAVVGASIMRVYSSDADEGLNGDVYYKLIGGDDLGKFVLDEATGQLFINKPLDRESIDHYVLTVIAHDSGQTTRLSSTTIITVDVLDENDNAPVFEQSQMNVSVLETEPVNKKIIQFHANDADLGINNELQYSITSGNRKETFFIDSYSGELFLHKHLDYEDLTSYVLNITATDNGNPSLSSSITFTVTVIDANDNAPIFTNTAIVRQIREGIPMHTPIVTVTAEDPDSGWNGKVYYSITHQDPSNGKRHFAINNVTGVIYTLLPIDREIIDTFRITVVACDKAEPASSRLCSEKLVTVIVEDINDNAPVFVSMNAAVINSERLGRSMSRGKFIMNVLARDLDSGTNGLVTYKLIHGGNDMFDLHRSNGALSLRYPPSMPDVRWNLVIKATDEAVLSEQRSTETYLTVIMGGTELEGIMWTNVGSISVAENEPAGTAVLNMTNNYKGGLEYYIVNVTGDNKQVDRLFDIDSSLGILSTAVSLDREAGVDRYEVEICAVSSGSPLQSTTTKAYWFDYGLDSFTYFKMVNPINFAQSSVEDKTSAPYCC-