Monarch geneset OGS2.0

DPOGS203720
TranscriptDPOGS203720-TA3210 bp
ProteinDPOGS203720-PA1069 aa
Genomic positionDPSCF300010 - 1032406-1035892
RNAseq coverage64x (Rank: top 67%)
Annotation
HeliconiusHMEL0059480.077.80% 
BombyxBGIBMGA003506-TA0.066.09% 
Drosophilaa-PB2e-2961.29% 
EBI UniRef50UniRef50_UPI00020618CC8e-2963.92%UPI00020618CC related cluster n=1 Tax=unknown RepID=UPI00020618CC
NCBI RefSeqXP_392250.21e-4729.65%PREDICTED: similar to PDZ domain containing 3 [Apis mellifera]
NCBI nr blastpgi|3287804572e-4629.65%PREDICTED: hypothetical protein LOC408714 [Apis mellifera]
NCBI nr blastxgi|3407099381e-5329.29%PREDICTED: hypothetical protein LOC100650835 [Bombus terrestris]
Group
Gene OntologyGO:00055154.9e-27protein binding
KEGG pathwaynve:NEMVE_v1g2162822e-12 
 K06095 (MPDZ, MUPP1)maps-> Tight junction
InterPro domain[930-1062] IPR0014784.9e-27PDZ/DHR/GLGF
Orthology groupMCL24913 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203720-TA
ATGCGCCTCTTCCGCGGGCCGAAGCGCAGGGAAGTGACCGGTCCGCAGGCCGCGACGTCTCCGCGCAGAGACGCCTCGCACACCCCGCCAACATACACTCTTGTCACTGCAGTACCAGCAATGGTCATGGAGGACGATGCCAATGATATGAAAAAAGTGTTTGCTACGTGGGGTCGCCGAATGGGGAGGAAACTAGATTTATTAAAAAAGAATGAATCTAAAGAGATGCCTGAAGAACCAGGTACACACGAATCTGTAAGCAACGATTCTTCTATAGTTTTAACTGGACAGTATAAAAAGAAACAAAACTGGAAAATGGGAAGAAGTACATCAGACTCATCATCTCTAAAAAGAGACAGTGATACAGAATCAATACGCAGCGGATCCAGAGATAGGTCACCAAGCCCTTTTAAATCTTTTTTTCATAGAATGGGTTCAACGGGTATGTTAAATAGTTCTAAAACACAGTCTTTAAATTCTCCGAACCCACCAAACAGCTATTCTTTATCCAATGGTCCCGCTCTCTATAGAAGTTGTTCGACCTCACACTTATCGACATACGTAAAAGCTGATGACCCATCTGACGATATAGATTTACAAAATACTGGAGCCGAAACTAAAAATTCTCCTACAAAACAAAAGAACGCTAATCTTTTGACCGAAGACAATTTTGTATCTTCATCTACTAAAGCTATCAGTTGTGATAATATTCCAAATAAATTGGAAGCCCCACAAAATACGGGCACTTGCAAAAAACCTAACTTTCCATATGCTTTTTTAAGATCTAAATTATCAGTTTTACCAGAGGAAAATAGCTTGGCTTCACACCGCTCTGTTTCTGTCAGACAAAGCTTTTCCGAAAGGATAGATCGAAAATCTCCTAAATTTCGAAAAGAAAGGTTATACCTGTCAAACTCCCGATCAGAAGAACGATATCAAAGTAGCGATAACATATCCGTCCACGATGAAAGTATACTTATAAATTCAGGTCTGAGAAACTGCAACGACTTTGCGAGAAGTTCTATAAGAAGCATAAACGAGACTGATGTCTACCAAAATAAACATGTTGATGATGTACTTAGGAGATCATCTATGATATCTCACAAAGCACCAATTGATTATGACCCTATGTTAATTCCTCGAAATAGAAATAGCCTGCCAGTTTATGAGTACAGATCTACTATAGGTTCTGCACATGATCTTAAAGGTGACCTTGCTTCTTCTAATCAAAGCATTCATCAAGAAGTACTTCAAGCACATCGCCTCAGTAGCTATGTTAGTTCTAACGAATCTGGATATGACAGTGATGGAAGACCAACAGATGAGCACAGTAACCATTCACCTCCAGGTTATGGCAATAATATGGGAGGAATATCAATTACAATGAATGGGGAACATAATCTTGGTAACTTTTCACGACAATTAAGTTTAAATCACAAGATTAATGTCAACAAAGTTCAAGTACCCATGCGTAGAAGTTCTACTCCGTGTGCTTTATTTCCCATTGAAAAAAAGTCATACTATGAGTACGAAAATGAAAATAGAGATGACAATCAATATCAAAATAACGATTCAAATCTTATCATCATCGATTATAATGATGGAAAACCACCGCCACTGCCAAAGAAAACAGTCCATAAAAAGACACCATATATTTATAAAACCATCACTTTAGATGAACGTGTTCAAACAAGAAAAATAAAATTGCTACAGTCACAATTAGTTTCTGTTGATCACTCGTCTACGCCGAATCTCAGTGATAACTCTTTAAATACTTCTAAGGAATTGAACAATGTACAAAGAATTCATGAGACAGATATACTTGGAAGGGGTCCATGTACAAAACGATTTAGAAAAATAAGACTACTAAAATCAAGATTGGATGAAAGTTTGGGTGTTTATTTGGCACAAAATAGAGTGGATTTTGATAAAAGTGGAAATAATTATGAAATCCGTTATATAATTGTTAAACTTGATTTTGATGGAATTGCTCACCGAGATGGGCGACTCAGAATAGGTGATGAGATCGTTAACGTAAATGGGAAAGTCTTAAGAGGACTTTCTTCGCTCAGAGACGTACAACATATAGTAAATTCTTGCTCTACTGAAGCAACAATCGAGGACAGTGGTCTTTTTCAGAGGTATCAAGTTGATCTGGTCATGGCACATGATGAAATATCTCCCATTACCTTAAGTCGTATTATTAATAATAAATCTAGTGAACCCAGTTCTGTACACTCCTCTTCAAGTATTCCTCCTGATATTATTGCACAAACCCACAAACCATCACCCCCAAATCAGATGGTAATTGAAACTCATTTTCCAAGCACTGACAGCACATTAAATTTACGTAATAGCAATAATACCAGTGGATATGTGAATGATATGCCAGAAAAGAATCAAAAGTGTGAAGTCGGTATACAGGTCAACAATAATCTATTGCTTTCTAACCAAAAAAGTGACGACGAAAAATTGTTGGAGCGAAACTATTATACACAATCCCATTCATCTCTACAAAATATAACTAATATAGAAATTTCAATGAGATCATCGCCTACACCGAGACACGGCACCAATAGTTCATATCGTCCCATATCATTTCACAGCACTCGTCCTCAACATTTATGTGGAACAACACTGTGTAACGATAATACTACCAACGATATCAAAGAAAATTCTTCTCATTATATAAAGAACGTTCATAGGACATACCAAAATAGGTCTTATGAAAGTATACCGGAACAACTGAGAAGCACCAGTCGCAGTAGATTCTTTTCTAGAGTTGGATCACAAAGATCTTCACCTAATTATGGTTCTCATGTATCACGACAGCAGGAATACAGCTATCATGATATACAATCCTCTCAAAATACTTTGCATAGAGCAGAATTCTGGAAAGGTCCAGGTCACAAAAGTCTTGGCTTCAGTATCGTTGGAGGCACAGATTCTCCAAAAGGGCAAATGGGTATATTTGTTAAAACAGTCTTTCCAAACGGGCAAGCAGCCGATAAGGGAACTATTTATGAAGGTGACGAAATCTTATCAGTTAACAACGTGGCAACTCGAGGATTAAGCCATGCTGGAGCAATATCATTGTTCAAAAAAGTTAAAGAAGGAAAATTAGAATTGACGCTTTCAAGAAGAAGAGCTCCTAGGTCAAGATCTGTAGAACCTCTCGGTAACTTTCGCAACGATAGCAAAAGAGATTGA

Protein sequence:

>DPOGS203720-PA
MRLFRGPKRREVTGPQAATSPRRDASHTPPTYTLVTAVPAMVMEDDANDMKKVFATWGRRMGRKLDLLKKNESKEMPEEPGTHESVSNDSSIVLTGQYKKKQNWKMGRSTSDSSSLKRDSDTESIRSGSRDRSPSPFKSFFHRMGSTGMLNSSKTQSLNSPNPPNSYSLSNGPALYRSCSTSHLSTYVKADDPSDDIDLQNTGAETKNSPTKQKNANLLTEDNFVSSSTKAISCDNIPNKLEAPQNTGTCKKPNFPYAFLRSKLSVLPEENSLASHRSVSVRQSFSERIDRKSPKFRKERLYLSNSRSEERYQSSDNISVHDESILINSGLRNCNDFARSSIRSINETDVYQNKHVDDVLRRSSMISHKAPIDYDPMLIPRNRNSLPVYEYRSTIGSAHDLKGDLASSNQSIHQEVLQAHRLSSYVSSNESGYDSDGRPTDEHSNHSPPGYGNNMGGISITMNGEHNLGNFSRQLSLNHKINVNKVQVPMRRSSTPCALFPIEKKSYYEYENENRDDNQYQNNDSNLIIIDYNDGKPPPLPKKTVHKKTPYIYKTITLDERVQTRKIKLLQSQLVSVDHSSTPNLSDNSLNTSKELNNVQRIHETDILGRGPCTKRFRKIRLLKSRLDESLGVYLAQNRVDFDKSGNNYEIRYIIVKLDFDGIAHRDGRLRIGDEIVNVNGKVLRGLSSLRDVQHIVNSCSTEATIEDSGLFQRYQVDLVMAHDEISPITLSRIINNKSSEPSSVHSSSSIPPDIIAQTHKPSPPNQMVIETHFPSTDSTLNLRNSNNTSGYVNDMPEKNQKCEVGIQVNNNLLLSNQKSDDEKLLERNYYTQSHSSLQNITNIEISMRSSPTPRHGTNSSYRPISFHSTRPQHLCGTTLCNDNTTNDIKENSSHYIKNVHRTYQNRSYESIPEQLRSTSRSRFFSRVGSQRSSPNYGSHVSRQQEYSYHDIQSSQNTLHRAEFWKGPGHKSLGFSIVGGTDSPKGQMGIFVKTVFPNGQAADKGTIYEGDEILSVNNVATRGLSHAGAISLFKKVKEGKLELTLSRRRAPRSRSVEPLGNFRNDSKRD-