Monarch geneset OGS2.0

DPOGS207947
TranscriptDPOGS207947-TA5583 bp
ProteinDPOGS207947-PA1860 aa
Genomic positionDPSCF300090 - 169232-188223
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0142112e-16762.57% 
BombyxBGIBMGA000387-TA0.038.33% 
Drosophila% 
EBI UniRef50UniRef50_Q16U553e-4123.35%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16U55_AEDAE
NCBI RefSeqXP_001660565.16e-4223.35%hypothetical protein AaeL_AAEL010014 [Aedes aegypti]
NCBI nr blastpgi|1571248731e-4023.35%hypothetical protein AaeL_AAEL010014 [Aedes aegypti]
NCBI nr blastxgi|1571248734e-5823.17%hypothetical protein AaeL_AAEL010014 [Aedes aegypti]
Group
Gene OntologyGO:00160201.4e-12membrane
GO:00055091.4e-12calcium ion binding
GO:00071563.5e-07homophilic cell adhesion
KEGG pathway 
InterPro domain[1455-1580] IPR0159191.4e-12Cadherin-like
[1458-1568] IPR0021263.5e-07Cadherin
Orthology groupMCL30147 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207947-TA
ATGTGGAAACAAGCTTATACAGTTGTAGTGATACTTGTAGCATCTGGATTTTTTCACTACAGTCATGCTTGCACAGTAGAAGACGTAGACCAGAGCGTTCCTGTTACAAGAGAAATAAAAGACACATTTAGAGGCATCTTTTTTTCAAGTAATACTCAAAATATTCAAGAACCCGTTTCTTTAATACAAAACGAAGGTCTAAAGGAAGGACCGTATCTTGATATTTTCCTAAATAACTCACTACTTACCATCGGTACCAACGACAACTTTGCCAATTATGAAGAAGTTGAAACCGAAACGACTATGAGATACACAGTAAATTTAGGATGTACAAGTGGTTCGCGACTGAGTTTTGTTTTGATTATAAACATAAGAGACACAAACAACAATAACCCTATATTTCTGCAACCGGAATACGAGTTTAACATCGTCCTACCAGTTCCGCCTGGATTCATGGTCACCAATTGTGAAAATGACATTGTTGTCAGAGATATAGATCTCACCACTCGAAGGATTGATTATAAACTGAATGGAAGCGATCTGTTTGAAATATCGTACGACCCATCTTCTAAAGTTCCAAAAGAATTTAAATCGATACTCAGAACGACTACTTTGATAAGATATATACCCGAGCAAATAACACTGACGTTGACTGCCACGGATGTCGACGAAACTAGCGATCCGGCAAGATCAAACACTACGACGGTTATCATCAAAGCAGACAATCAATTTCAATTTCCTGATGAGCCAATATTTTCACAACCTTTTTATTTGGCATCATATGAAAGGGAAAGCGATTTTGTATTACAAGATACCATTTATTTGGAACAGGGATATGACGATCAAGTAAAATTCAGTTTTGAAAGCGATTATTCTGAATACTTTAATATGGTCGTAGACGGTAACAAAGTTCAGTTTAATATGAATAAATCGATACCAGTCCAATTGTATGAAAAAAAACAGATATATTTAGTTGTAAAAGCAGAAAGAGAGTATACTAGTGGTGCCACTGCTGCGGTTATTTTGAAATTACCAATAGATACTAAGTTAGAATTTGAGAGAGCTATTTACAAAGGAAAAATAACTGATAACGTTTTAAATCTAACAGATTTAATATTAAAACAAGGCTATGAACATTCCAATATAACAGTAGAAATAAGCAGTGAATACTCTTTCAAAGCGACTGTTCTATCAAATATCATAAAATTATCAATGGATCCTTTGACGGAGGATGTTATTAGAAATAATAATTTTATTGGTTTGGAAGTACTAGCCTCAAATAACCGAACTACGGCAATAACTGCTGTAGTATTAGAAATCATCAAAGAAGATACCATAACTCCAGTATTTGAAAAATATATATATAATGAATACATCACATTCTTCGAAATATCGCAGGATGGAGCAAGAATAAGACTTAAACCTTTGGCTACACCTGAAGAATTGCTGAAAACAAATATCATTTTAAGCATACTTGCTGATAAACCAAGAACAGTCGGAGCACATGCTACAATCAATATAGCGGTGCCATCAGCTAGATCTTTGGTTTTTGATAAGGATTTGTATATTGGAACAATAGAAAAGAATAATTTAACACTCAACAACATTGTTCTAACCGAAGGATATGCCTCAGACATTAATTTCACCTTAACTGGCGATCTTTCAAATCACTTTTCCGTAAGCAACAATCAAAACATCCTAACTGTCTCCGTAACAACGGAATTGCCGGAGACAGTTGTATTGGAGAACGACTTTATTGTATTGACTCTCGTTGCATTTGGAATGAAAGCTATTACAACTTCAACAAATATCGTCATTCGAATAATAAAAGAGGACTATCTAACTCCCATTTTTAACGAGCGGATCTATTCTGTAAAGTACGAGAATGATCAACTAAATGCCGTCAATATGACATTGATCCAAGGTTTTGACGAAACTGTCACATTTGAATTGATTGGCGTCCACAGAGAATTCTTTAGTTTAGAAACTAGTCTCAACACAATAAAACTAGTTGTGAACTCCACTATACCAGAAGATATTATATTTAATGAAAAGGTTATCATATTAAATGTTGTGGCTCGAAAGCCCCTAACAGTTGGAGCAAATGCAGCAGTGTATATTACATTTCCACCAGAGATGACAGAGCTAGGCGTATTGAATTTCACCCAAAATGCCTATAGTGGTTCTTTAAAAGAGGGTGTCCTTATAGTAGAAGATATAATACTTGAATATGGTTACACGCCACAAACTACATTTATTCTAAGCGGAGACTATTCAGATAAATTTTCCTTAAATTATTCAAGCAATGTTATCACAGTGATACTAAAAAGTAATGTATCGTTGGAGGAAATCGAAAGACAGAATTTTATTACTTTAGAAGTTAAAGCAACAAGAAGAAGGACTATTCCAGCAACAACTGTAGTAGTGGTGTATATCAATAGAACTAGCATAGTTAGTCCCATATTTGAACAAGCATTTTATAACGGCAGTTACACTAACGATGGTGGATTAGTATTTGAGCAAGTAATATCATTACAGGAAGGTTTTGATAGTACTGTAGAATTTAGTTTAGAAAAAGAATATTCACAGTGGTTTGTGTTGGAACAAAATGGCAATTCTGTAATATTAAAATTAAATACTTCAAATCCTATACCAGAGGCCATCACAGAAAAAAACAAGCAATTTTTAATCACAATTTTTGCCAGGAAACCAGAAACGGTTGATGGAAGAGCAGTCATATATATAGATCTACCGAAAGAAAATAGCAATGTCAGAATTCTTCAATTTGAACATGTCAGCTATTTGGGAAGTATAGAAAGCGGAGTAATACAACTAGAAGAGATTCGTCTCAGGACCGAAATTACGTCAGCGATGGATTTCAATATAACCGGAGAATACGCATCGTATTTTACGATCTCTAAACAGACTGAATCTATACAGATTGACATCATGGACGCGCCGCCAGAAGTATTTGAAAACAATGACTTTTTAGTTCTCAATATAAATGTGTTTGAAGTCGGATCTGTAAGTGGACATACCACTGCCGTATTGAATATAATAAAAGATAGACAGTATAAAAATATTACCCCCGTATTTAGCGAAGCCTATTACACTGGACAGTATTCTAAAGATACTGGCCTTTTATTCTCATCTATAATAACCCTCATCCAAGGTTATGATGAAACTGTGACATTTTTACTTGACAGCGAAGACTCGAAAGGTTTTGAATTAGTAGAGACAGACGTAAATAATTTTACACTGACATTCAATGGAAGTTCAAGTGAAGGACATAAGAAGAATTACTTATTGTTCCCCGTTATAGCACTTAAACCAAACTCCAGACAAGGAAGCGCTGCAATATTTATTTCTATGACAGGATCTCCGGAGACGAATGTATTCTTTGATAAAATTCTTTACAATGGGAAACTAGAGGATGACATTCTCTCACACGATACCATAACATTGACAGGTTTTAATGGAACAAATATTTTAATAACGGGTGAAAACTCGAGACTTTTTGAAGCGGAATTCATTAACGGCTTTGTAAAAGTCACAACAACGTCTTCTTCAGAATTTCCGAGAGAGCTTACCCACATAGCACTTGAATTACAAGCAGGGAGTGCAAAGTCAGTACTTTTAATAGATGTCAGTTTTTCAGATAATCCGGATCTGCCAAAGATATCATTTAAATCAGAATCTTACTTCTTTTGGGCCGATGTCAAACAGACGGGAGAAATTGGAAAAGTTGAAGCAACGGTTGATAATGATGAAGCTGTCACGTACTCCCTGCGTGTCACTAATGATCACATAGCATCTCGGCTTAACATAGATGAAAGTAATGGAGTATTACAGTTAACAAACGTCGCCGAGAAGGGAATATATAATTTCAATGTTAAAGCGACTTCAGTTCAAAGCAAAGTTGAAGCTACGGCATCAGTGCTATTGCGTGTTGACGCTCTGCCGGATTGCGGCGGCGAGGTTGGCTTATCACCTTTGATAGTTATTGAAAGAGTTGAAGAAGAAGCTCATTATAATTTGGTTGTCTTAAATGAGACTGAACATGAAGGTTGCAAGTATACATTAACAAACGTCTTCCCCGAAGATCAATCCTGGCTATACGTCGAAAATAACGGATTGCATACAAAGCCTATTGACCGAGAGGACAAATCAATTGCTTTCATGACTCTGTCTCAGATTCAAGTGGAATTAACACTTAAGTGCGACAGTGATGGAGTACCGGCCTTCGCTAAGCGTTCATTAGACGCAGACGATAGCTCGTACCTCTATTCTTACGACTACGGTCCTAATAAATGGGTCCTGACTGACACCATTTTATATAATGCCAGACGAAGCTTCGTGAACCTTATCGTCAAAGATATTAATGATAATTCACCAAAATTCAACGGAAAGGAAAATGATACAATTTACGTCGGATATCCGATGTCCGAAATAGAGGGGCTCGTTCTTCCACGTGCTCTTGCTGAATTGAAGGCGACAGATGAGGATATAGGAGAAAATGCAGCTATAATGTATTGGAGCAGGGAAGACAATCTGGCCGTTTCACCAAATACCGGTTTAGTTCACGTTCGCAATAACGCAAAATTGGAAAACAATTCCCGTTTAACAGTATATGCTATCGACCAAAACGGACAAGGGAACAACGGCTCTATAGTTATTGTGGTTAAATTATTAAATAAAAACCAAATTGCCGTCCTGACTATAAGAAACGCATTCTTGGAGGATGAATCTAACGTTTTAAATGACTTGAGTAATTCGGTGGGATACGACATTAAGGTTTTAAGGTCTATTGTGATTTCTGATAGCTATGAAGAATCCAATAGAACGAAACGAGACATAAATAGCAATAGTGGATCGTCTCTGCAACTATATGTTTATGGTTTAAAGGAAAGTGAACCAGTCGACATCAATCAATTGACCGGTGATATTAACAATAATAACGTAGCTACAATCACCATAGCCAGGATTCTATCTTTAGAAGATCATCTTGACAGTCTAGCGATATGTCCCGGACTGGAACGTGATATAGGCCTCCTAGCTACGACCATCGCATTATCTATCCTGATATTAATTTTAATTATTGCGATATCTGTCTTGTTCTTCCTTAAATGGAGAATAACAAGAAACTACGAGAGATTCAGTGATAGCAACAGTACTACTTCCCAGCTAGCATCACCCAAGCTTCCTGTTATTGAAGTTCCTCAGAAAACACGTCTCAATATGGAGGAAATAAAAAGGAGTGAGAAGAGACTACAAGAAATGTTAGAAGAACCGGTCGAAAGTCAACTCGATTCCGAAACAGACGATATTAAAGATGTAAATGAGACTCTTGATGAGGCCATTGTTAACATATCAAGTGATGTTCAATTACCTATTATAATACAATCAATAGATAAGCTGAAAGATGGTAATGACGAGTCTGATGATAATGACGAATATGGTGAAATGAAACAACCGCGGAAATCTGTTGTGACGTTCAATGAAAACGTTGAGAAAATCATTCATATAGAAGACGTTAATGAAGATGAAAGCTCAGAGCAGAGTTTTGAAGTTTATAAATTTTAA

Protein sequence:

>DPOGS207947-PA
MWKQAYTVVVILVASGFFHYSHACTVEDVDQSVPVTREIKDTFRGIFFSSNTQNIQEPVSLIQNEGLKEGPYLDIFLNNSLLTIGTNDNFANYEEVETETTMRYTVNLGCTSGSRLSFVLIINIRDTNNNNPIFLQPEYEFNIVLPVPPGFMVTNCENDIVVRDIDLTTRRIDYKLNGSDLFEISYDPSSKVPKEFKSILRTTTLIRYIPEQITLTLTATDVDETSDPARSNTTTVIIKADNQFQFPDEPIFSQPFYLASYERESDFVLQDTIYLEQGYDDQVKFSFESDYSEYFNMVVDGNKVQFNMNKSIPVQLYEKKQIYLVVKAEREYTSGATAAVILKLPIDTKLEFERAIYKGKITDNVLNLTDLILKQGYEHSNITVEISSEYSFKATVLSNIIKLSMDPLTEDVIRNNNFIGLEVLASNNRTTAITAVVLEIIKEDTITPVFEKYIYNEYITFFEISQDGARIRLKPLATPEELLKTNIILSILADKPRTVGAHATINIAVPSARSLVFDKDLYIGTIEKNNLTLNNIVLTEGYASDINFTLTGDLSNHFSVSNNQNILTVSVTTELPETVVLENDFIVLTLVAFGMKAITTSTNIVIRIIKEDYLTPIFNERIYSVKYENDQLNAVNMTLIQGFDETVTFELIGVHREFFSLETSLNTIKLVVNSTIPEDIIFNEKVIILNVVARKPLTVGANAAVYITFPPEMTELGVLNFTQNAYSGSLKEGVLIVEDIILEYGYTPQTTFILSGDYSDKFSLNYSSNVITVILKSNVSLEEIERQNFITLEVKATRRRTIPATTVVVVYINRTSIVSPIFEQAFYNGSYTNDGGLVFEQVISLQEGFDSTVEFSLEKEYSQWFVLEQNGNSVILKLNTSNPIPEAITEKNKQFLITIFARKPETVDGRAVIYIDLPKENSNVRILQFEHVSYLGSIESGVIQLEEIRLRTEITSAMDFNITGEYASYFTISKQTESIQIDIMDAPPEVFENNDFLVLNINVFEVGSVSGHTTAVLNIIKDRQYKNITPVFSEAYYTGQYSKDTGLLFSSIITLIQGYDETVTFLLDSEDSKGFELVETDVNNFTLTFNGSSSEGHKKNYLLFPVIALKPNSRQGSAAIFISMTGSPETNVFFDKILYNGKLEDDILSHDTITLTGFNGTNILITGENSRLFEAEFINGFVKVTTTSSSEFPRELTHIALELQAGSAKSVLLIDVSFSDNPDLPKISFKSESYFFWADVKQTGEIGKVEATVDNDEAVTYSLRVTNDHIASRLNIDESNGVLQLTNVAEKGIYNFNVKATSVQSKVEATASVLLRVDALPDCGGEVGLSPLIVIERVEEEAHYNLVVLNETEHEGCKYTLTNVFPEDQSWLYVENNGLHTKPIDREDKSIAFMTLSQIQVELTLKCDSDGVPAFAKRSLDADDSSYLYSYDYGPNKWVLTDTILYNARRSFVNLIVKDINDNSPKFNGKENDTIYVGYPMSEIEGLVLPRALAELKATDEDIGENAAIMYWSREDNLAVSPNTGLVHVRNNAKLENNSRLTVYAIDQNGQGNNGSIVIVVKLLNKNQIAVLTIRNAFLEDESNVLNDLSNSVGYDIKVLRSIVISDSYEESNRTKRDINSNSGSSLQLYVYGLKESEPVDINQLTGDINNNNVATITIARILSLEDHLDSLAICPGLERDIGLLATTIALSILILILIIAISVLFFLKWRITRNYERFSDSNSTTSQLASPKLPVIEVPQKTRLNMEEIKRSEKRLQEMLEEPVESQLDSETDDIKDVNETLDEAIVNISSDVQLPIIIQSIDKLKDGNDESDDNDEYGEMKQPRKSVVTFNENVEKIIHIEDVNEDESSEQSFEVYKF-