Monarch geneset OGS2.0

DPOGS207352
TranscriptDPOGS207352-TA3807 bp
ProteinDPOGS207352-PA1268 aa
Genomic positionDPSCF300188 + 316455-331997
RNAseq coverage184x (Rank: top 49%)
Annotation
HeliconiusHMEL0127213e-2729.82% 
BombyxBGIBMGA010279-TA0.074.96% 
Drosophilapyd-PA1e-15665.86% 
EBI UniRef50UniRef50_E2B5H82e-16864.86%Tight junction protein ZO-1 n=2 Tax=Formicidae RepID=E2B5H8_HARSA
NCBI RefSeqXP_001120987.12e-16864.94%PREDICTED: similar to CG31349-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3072137157e-16864.86%Tight junction protein ZO-1 [Harpegnathos saltator]
NCBI nr blastxgi|3504094185e-17255.95%PREDICTED: tight junction protein ZO-1-like [Bombus impatiens]
Group
Gene OntologyGO:00055151e-24protein binding
KEGG pathwaydme:Dmel_CG313492e-154 
 K00942 (E2.7.4.8, gmk)maps-> Purine metabolism
InterPro domain[280-395] IPR0014781e-24PDZ/DHR/GLGF
[791-979] IPR0081456.4e-24Guanylate kinase/L-type calcium channel
[645-811] IPR0014521.6e-10Src homology-3 domain
[458-558] IPR0081442.7e-08Guanylate kinase
[699-760] IPR0115112.9e-06Variant SH3
Orthology groupMCL10655 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207352-TA
ATGAAGTTGGATAACCATGGATTTGAAAAATTGCAGACAGCCGAACGTAATTCTGGTTGGGAGACACACAGGGTGCGTCTGAACCGCGTGCCGGGGTACGGGTTCGGGATTGCGGTGTCCGGGGGCAGGGACAACCCCCACTTCGCAAGCGGTGACCCCTCAATCGCTGTTTCTGATGTCCTCAGAGGAGGACCCGCTGAAGATAAATTGCAAGTGAATGATAGAATAGTATCAGTGAACGGAGTGCCGTTGGAGAATGTGGAGTATGCGAGGGCAGTCCAGGTGTTACGAGAATCTGGGGCAACGGTGTCTCTGGTGGTGCGGAGACGAGCGCCGGCACCACCGCCCACCGCCCCCACCACCATTAAACTAGCTTTGACGAGGAACGGAAAGAAGGAAGATTTCGGAATAGTCCTTGGATGCAAAATTTACGTGAAAGAACTAACGATGCGGGCGAGGGATCAGTTGAACCCGTCGGGTCAAGGGTTGTGTGAGGGTGACGTCATAACTAGGATCAATAATACAGCCGTCACAGACGCCATGACGTTGAAAGAGGCCAAGAAGTTAGTGGACTCGTGCAAGGATCGTCTTAATCTGGTGATAACCAGAGAACTGATCAGAGAGGAGACCGTCACGAATGGAAATTACCAAAACAATTACAGTAGTCTAGAAGCCGCGCCTCACACTTACCCGAACAGCTCAGAGGCTATGTCGTCACCATATTCCATAAGCGGACAAAACCTCTACGTGGCAGCACCAGTCCGCGGAGACAGGCGACACGACGACCAGCCGCCAAGACCTCCGCCACCAAGGAACGACGACTATTACAGCAGTAGAAGACAGTTGTACGAAGAAGATGGTATGACGAATCAGAGGAACAAGCCACCGAGCGAGCCGAGGTTGATAAGTTTCCAGAAGGAGGGGTCGGTGGGCCTGCGTCTGTGTGGCGGGAACAGGTCGGGGGTTTTCGTATCAGGAGTACAGCCCACCAGCCCCGCCGCCATACAAGGTCTACAGCCCGCGGATAAGATTCTCAAGGTCAACGACATGGAAATGAAAGGCGTGACGAGGGAAGAGGCCGTTCTGTTCCTCCTGAGTCTTCAAGATCAGATAGACCTCATAGTACAACACTCGCCTGACGAATACAACGCCGTGGCCAGCGGACAAACACCGGGCGACTCGTTCCACGTCAAAACACATTTCCATTACACCGAGCCGACGGAGGGGGAGATGTCGTTTCGTTGCGGCGACGTCTTCCACGTGTTGGACACATTGCACAATGGAACCGTCGGCGCCTGGCAGGTCTACCGGATAGAAATGGACAGTACTCTGGAGGACAGCAAGTCGAAGCCGAGCGGCATCATCCGCCTCTCCAGCATTCGCAGCATCATGGAGCGAGGAAAACACGCGCTGTTGGACATCACGCCTAACGCCGTGGACAGGCTCAACTACGCGCAGTTCTATCCCATAGTCATATACCTGAAAGCCGACAACAAACATATCATCAAACAACTCAGAAGCGGCCTGCCTAAGTCCGCCCACAAGTCATCGAAGAAACTTCTAGAACAGTGTCAGCATATGGAGCGAGTGTGGGGTCACGTGTTCACACACACCATCACACTTAGTGATGCCAACATCAATACATGGTTCGGCAAACTCGGCGAGCTGGTGCAACGCACACAGCAACAACAGCTATGGGTGTCTGAGACTAAGGTTGGTATACCTGCAGACTATTACAGCAGTAGAAGACAGTTGTACGAGGAAGATGGTATGACGAATCAGAGGAACAAGCCACCGAGCGAGCCGAGGTTGATAAGTTTCCAGAAGGAGGGGTCGGTGGGCCTGCGTCTGTGTGGCGGGAACAGGTCGGGGGTTTTCGTATCAGGAGTACAGCCCACCAGCCCCGCCGCCATACAAGGTCTACAGCCCGCGGATAAGATTCTCAAGGTCAACGACATGGAAATGAAAGGCGTGACGAGGGAAGAGGCCGTTCTGTTCCTCCTGAGTCTTCAAGATCAGATAGACCTCATAGTACAACACTCGCCTGACGAATACAACGCCGTGGCCAGCGGACAAACACCGGGCGACTCGTTCCACGTCAAAACACATTTCCATTACACCGAGCCGACGGAGGGGGAGATGTCGTTTCGTTGCGGCGACGTCTTCCACGTGTTGGACACATTGCACAATGGAACCGTCGGCGCCTGGCAGGTCTACCGGATAGGTCGTAATAACCAAGAAGTTCAAAAGGGCACGATCCCGAACAAGGCTCGTGCTGAGGAGCTTGCTACAGCACAGTTCAATGCCACCAAGAAGGAGATGTCCGGCAACGATGCTAAGACTAACTTCTTCAGACGACGACGATCAACACACAGGCGCAGCAAGAGCCTTGGAAAGGAGCATTGGGACGAGGTGGTGCTATCAGACAGCATCAGCAAGTTCCCAGCATACGAGCGTGTGGTGCTCGCTCAGCCAGGGTTCGTGAGGCCTGTGATAGTGTTGGGAGCTCTCGCGGACGTGGCCAGGGAACGCCTGCTGGCTGAACACGGAGACAAATTCGCCTCGCCCAAAATGGACAGTACTCTGGAGGACAGCAAGTCGAAGCCGAGCGGCATCATCCGCCTCTCCAGCATCCGCAGCATCATGGAGCGAGGAAAACACGCGCTGTTGGACATCACGCCTAACGCCGTGGACAGGCTCAACTACGCGCAGTTCTATCCCATAGTCATATACCTGAAAGCTGACAACAAGCATATCATCAAACAACTCAGAAGCGGCCTGCCTAAGTCCGCCCACAAGTCATCGAAGAAACTTCTAGAACAGTGTCAGCATATGGAGCGAGTGTGGGGTCACGTGTTCACACACACCATCACACTCAGTGATGCCAACATCAATACATGGTTCGGCAAACTCGGCGAGCTGGTGCAACGCACACAGCAACAACAGCTATGGGTGTCTGAGACTAAGCACGTTGAAATGGTGTCGGACATCTACTTCCCCACCCCACCTTCTCCATACCAAACCTTCTATCCCATCTCCCATCCCCTAGGCTACTATCACAGCCCACAGAGATCTATCACGCCCAAAGCATCGCCATGTAACACTAACAGACTCGTCAAACGCCACGACCAAGTGTACGACGCTAACAGAAACCATGTTGTGCCATATTATTACTCGGAAACATTAACGCCGTGCATGCCCAATAATTTTTACACACCCAAATACAGTAACCGCAACAGGCATGGCCCATACGATGAGAGACACCCTCAGAGAACGAGATTGACACCGAGACTGCAAAGTTCCAGTGTCAGTCCTGAACTCACACAGATTTTCACAAAAACCAACACACCGCAGTCGTTCAGACCGTACAGTGTTCTAGAACATTCTACAGCGTCGCCTTTGCCTAAAATACAAAGGCCCACGTCTGTTGTTCCAATGTCAGATAACTTTAGACCGGCTTCCGTGTTCTTCAGAGATACTAAACCCAATGAAAAGTCAAACGAAGTATCGGAAATCAAACCGCTTCCCCAAAAGGAAACAACTCAACCAATTCTGGATAAATCTTTCAGACAGATCTGTAGAATACCAAATTGTAATTGCAATTTGAAGCCTATGAACAATCTGCCGAGATTTAGCGAGTCCCGTACTTTACCTAGCCTGTCTTCTAAATCGATCGACAAAGTTAAAAATTTATCTCTCCCAACACTCAAATTAGACGATTTAAAAGACTTTGGGAGAGATGTTCCTGAGATCGAAAGAGATCCGGAGTCCAAACCTGTGTCCGAGAAACATTTGGGTTTTGTGTAA

Protein sequence:

>DPOGS207352-PA
MKLDNHGFEKLQTAERNSGWETHRVRLNRVPGYGFGIAVSGGRDNPHFASGDPSIAVSDVLRGGPAEDKLQVNDRIVSVNGVPLENVEYARAVQVLRESGATVSLVVRRRAPAPPPTAPTTIKLALTRNGKKEDFGIVLGCKIYVKELTMRARDQLNPSGQGLCEGDVITRINNTAVTDAMTLKEAKKLVDSCKDRLNLVITRELIREETVTNGNYQNNYSSLEAAPHTYPNSSEAMSSPYSISGQNLYVAAPVRGDRRHDDQPPRPPPPRNDDYYSSRRQLYEEDGMTNQRNKPPSEPRLISFQKEGSVGLRLCGGNRSGVFVSGVQPTSPAAIQGLQPADKILKVNDMEMKGVTREEAVLFLLSLQDQIDLIVQHSPDEYNAVASGQTPGDSFHVKTHFHYTEPTEGEMSFRCGDVFHVLDTLHNGTVGAWQVYRIEMDSTLEDSKSKPSGIIRLSSIRSIMERGKHALLDITPNAVDRLNYAQFYPIVIYLKADNKHIIKQLRSGLPKSAHKSSKKLLEQCQHMERVWGHVFTHTITLSDANINTWFGKLGELVQRTQQQQLWVSETKVGIPADYYSSRRQLYEEDGMTNQRNKPPSEPRLISFQKEGSVGLRLCGGNRSGVFVSGVQPTSPAAIQGLQPADKILKVNDMEMKGVTREEAVLFLLSLQDQIDLIVQHSPDEYNAVASGQTPGDSFHVKTHFHYTEPTEGEMSFRCGDVFHVLDTLHNGTVGAWQVYRIGRNNQEVQKGTIPNKARAEELATAQFNATKKEMSGNDAKTNFFRRRRSTHRRSKSLGKEHWDEVVLSDSISKFPAYERVVLAQPGFVRPVIVLGALADVARERLLAEHGDKFASPKMDSTLEDSKSKPSGIIRLSSIRSIMERGKHALLDITPNAVDRLNYAQFYPIVIYLKADNKHIIKQLRSGLPKSAHKSSKKLLEQCQHMERVWGHVFTHTITLSDANINTWFGKLGELVQRTQQQQLWVSETKHVEMVSDIYFPTPPSPYQTFYPISHPLGYYHSPQRSITPKASPCNTNRLVKRHDQVYDANRNHVVPYYYSETLTPCMPNNFYTPKYSNRNRHGPYDERHPQRTRLTPRLQSSSVSPELTQIFTKTNTPQSFRPYSVLEHSTASPLPKIQRPTSVVPMSDNFRPASVFFRDTKPNEKSNEVSEIKPLPQKETTQPILDKSFRQICRIPNCNCNLKPMNNLPRFSESRTLPSLSSKSIDKVKNLSLPTLKLDDLKDFGRDVPEIERDPESKPVSEKHLGFV-