Monarch geneset OGS2.0

DPOGS214891
TranscriptDPOGS214891-TA3150 bp
ProteinDPOGS214891-PA1049 aa
Genomic positionDPSCF300487 + 27669-38409
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0101420.085.08% 
BombyxBGIBMGA010946-TA0.088.27% 
DrosophilaGrip-PA3e-8837.84% 
EBI UniRef50UniRef50_D6X3C71e-13741.03%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X3C7_TRICA
NCBI RefSeqXP_973894.12e-13841.03%PREDICTED: similar to glutamate receptor interacting protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910906465e-13741.03%PREDICTED: similar to glutamate receptor interacting protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910906461e-13740.97%PREDICTED: similar to glutamate receptor interacting protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055154.9e-24protein binding
KEGG pathwaycin:1001866314e-20 
 K06095 (MPDZ, MUPP1)maps-> Tight junction
InterPro domain[443-574] IPR0014784.9e-24PDZ/DHR/GLGF
Orthology groupMCL11494 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214891-TA
ATGCATGTCAGATACCGTTTGTTGTGTAAATCAGAATGGCACTCTATCTTCCCATTGTGTCGTTCCGAGCCGGGCGTAGTTCCATCAATAGTTGGGTTCACGAAAGACTCGGTAGCAAATGATTCGGATCGATTAGCTCCAGGTGATAGAATATGTAGTGTTAATGGTATTTCAACAGCAAGATTAACTAATGATGAAGTACTGAGGTTGTTAGATAATGTGGAAGAAAGGGCATCGTTGGAAGTAGAATACTATATGCCAAACTATGCTTCACAGAGTTCTCTTTATATTACAACCAAGCTAGCAGAGGTGCCGGTTGAAAAAATAAATGGCTCTCTGGGAGTAACAATACGAGGTGGTTTACCCGAAAACTCATCGTCTAGTGCTGATTTAGTTTTAAACAGCAGGTCTCTAGATGCATTACCTTTAGTCGTCACTCATATTAGGCCAGGTGGTGCTGCCTATAATACGTCTAGGATAAAACCTGGCGATAGGCTTTTAAAAGTCGACCATATCTCTTTAACGAATAAAACACTGTCAGAAGTTCATCATATACTCCAAAGTTGTCCTCAAGTGACGAGTCTTACAATTGAATACGATGTGTCAATAATGGAATCGGTTAAATTAGCAACCGGACCTCTACTAATTGAAATCGAGAGACCGTGTAATGAGGATTTAGGGTTATTTTTAAGCAATCAAAGATACTCTGACGATGTTTATAGTTCTGGATCTGATACTTATCAGAGAGTTGGAACTAGCAATGCTATTTACATAGACAGTATCCTTCCAGCTAGTATCAGTGATCGATGTGGTGCATTACATCCTGGAGATCAGCTCCTCGCATTTGATGACCACGTTATAGATGGTAACAACTACACAGCAGAGGAGGTTATGTGCTATCTTGAAAACTGCGAGGCTGGTTTTACAAGATTACATATTGCCCCACGACACATACTGAGCCACGGCGGAAGATTTACTAGAGATAATTCAATGTCGGGTTCATCAACTTTAAATCCTAAAAAGCATCGGCAATGGAATTATCGACAGAGCTCAATGCCTAAATTAGGTCCACAAGAAGATTACGATGGTCAGCAGAATTACATGAGTTTGGGCATGTGTCGTAGTGAATCGTTGAATATACAACTGGAAGTGCCTCCAGGACAAACCAGCGGTTTGGCAGTTCATGATGAAAACGCTATCTTAATTATATCACACGTTGCCACACAGTCACCGGCGTATCGAACCGGCTGCTTACAAGTTAGAGACAGAGTAATGTCTATTAATGGTCACGAGAATCTCACTTGTGATGTAGCCAATGAGATATTACAGAGGAGAAACGACTCTCACAATCCTAAATACCTCACCCTTAATATTGAATTCAGTATGCCTAATGCTATTGAAGCTTCAAGTGGAGTGTTTAACGTTAAGCTCGCCAAAACGTCAGCAGGTCTCGGCGTCACAATAACAGGCTGCAAACAGAAGTTGCTTACTAATGAAGAACCCATGGTAATATCAGATATCAAGCCAGGATCTGTTGCTCACCGAAGCGGCGCTTTGACACCAGGGGACCAACTTCTCGCCATAAACGGACAGCCTTTACATAATCTGTCCCTGGACACAGCCTTCAACATACTTCAGAATTCACCAGAAGACATAATAACTTTGAAAATACGAAAGCGTGATCTAACGGAAGACTGGTCCAATATTCACAAACATAACGCAAAATTGACATTACAGAGCTTCAGTAACATTGAAACAAAGGCTGTCGTTCATAGCGGGGAAGATTCAGGTCACCATACGGGTAGTCCCAACAATAGTGCCAAAGATAGTGAAAGAAGTCACGGCAGTGATAATGGAACTGTAGTTTTTATGGTTGAGTTGATAAGACAAGAAAACGGTCCCCTAGGACTTACGATAGCGGGAAGTGAAGACGTCACACAGGCCATCTTATTAAGTGGTTTGGTTGAAGGTGGACTAGCTGAAAAATGTGGGAAATTGTCAGTCGGGGACGAATTACTGAGTATTAATGGAGAAAGCGTGTTGAATAAACCATTATCGGAGGCCATTAAACTCTTACAACAGAGCGGGAAACGCGTTCAATTACAAATGTGTAGAAAAATAACTGGTTCCCTTGACTGTGCTGAATCTAGTGTACGCGATTCCAGTCATTCCACATCCAGTCCAGGGCTCTCAAACGACAGCGCTGTTGAATCCTGGGATCAAAACACGCCAGTTAGAGTTAGTGCTAATTGCGGTAATTCCGAAGTAATAGAATATGCGGTTCCCGACAAGAGCCGAATAATAGATAAGCAGCCCTACTCACCGACAGATGAGGATAAACTTTTAGCTTGTAGTTTCAATTCAACAACTCCGTACACGGTGCACGATTTGCCTCTCCCGAATTATTCACTCAATAATTCTCTGAAGACCTTCCACTACGAAAATACATGTATCATTCCAGAGAACACTTTGAAAAATAAACAGAATATAACAAGAGAAGATGATGTTCAACAAATTGAAATTCTGACAAGCAACATGAAAGACTGTCAGTTACATAATATGGAAAAAAGTTCGTGTAAATGTGATTATGTACAAATGGGACCTTATGGTATCGTGTCACCAAAAAATAGACGCCCAAATTGGGACAGCGATTACTTGAGTAATGGTATTTACACTGTCACAACTCCACAGAAGTCACCTTTAAAGCCAAATGTTCCCGGCCCCAGTTTTCAGTTTACAACAAGTCCTATTTATGAAAACGACGTACCCAGTATTTATGGTAGTGAAACTCTTTCACCAGCTCGAGGTTCTGTCCATCACGTAATTTTATATAAGGACGCAATTTATGATGACTATGGATTCTCTGTATCCGATGGGTTGTATGAAAGAGGAGTGTATATTAATCGTATAAGGAAGGGAGGGCCGGCTGATATAGTAGGGCTGTTGAGACCCTACGACAGAATTTTACAGGTGAACGGCACAAGAACTGTGGACTACGACTGCTGTCTTACAGTGCCTTTGATAGCAGCAGCCGGAGATAGGCTGGAAATTGTTGTCCAAAGGAATGTTACATCTAGAGATCTCAAAAACCAGAGACATGAAGACAGTTCAAGCCCTAGTGAAAGTAGTATCGTGACTAAGACCATATAG

Protein sequence:

>DPOGS214891-PA
MHVRYRLLCKSEWHSIFPLCRSEPGVVPSIVGFTKDSVANDSDRLAPGDRICSVNGISTARLTNDEVLRLLDNVEERASLEVEYYMPNYASQSSLYITTKLAEVPVEKINGSLGVTIRGGLPENSSSSADLVLNSRSLDALPLVVTHIRPGGAAYNTSRIKPGDRLLKVDHISLTNKTLSEVHHILQSCPQVTSLTIEYDVSIMESVKLATGPLLIEIERPCNEDLGLFLSNQRYSDDVYSSGSDTYQRVGTSNAIYIDSILPASISDRCGALHPGDQLLAFDDHVIDGNNYTAEEVMCYLENCEAGFTRLHIAPRHILSHGGRFTRDNSMSGSSTLNPKKHRQWNYRQSSMPKLGPQEDYDGQQNYMSLGMCRSESLNIQLEVPPGQTSGLAVHDENAILIISHVATQSPAYRTGCLQVRDRVMSINGHENLTCDVANEILQRRNDSHNPKYLTLNIEFSMPNAIEASSGVFNVKLAKTSAGLGVTITGCKQKLLTNEEPMVISDIKPGSVAHRSGALTPGDQLLAINGQPLHNLSLDTAFNILQNSPEDIITLKIRKRDLTEDWSNIHKHNAKLTLQSFSNIETKAVVHSGEDSGHHTGSPNNSAKDSERSHGSDNGTVVFMVELIRQENGPLGLTIAGSEDVTQAILLSGLVEGGLAEKCGKLSVGDELLSINGESVLNKPLSEAIKLLQQSGKRVQLQMCRKITGSLDCAESSVRDSSHSTSSPGLSNDSAVESWDQNTPVRVSANCGNSEVIEYAVPDKSRIIDKQPYSPTDEDKLLACSFNSTTPYTVHDLPLPNYSLNNSLKTFHYENTCIIPENTLKNKQNITREDDVQQIEILTSNMKDCQLHNMEKSSCKCDYVQMGPYGIVSPKNRRPNWDSDYLSNGIYTVTTPQKSPLKPNVPGPSFQFTTSPIYENDVPSIYGSETLSPARGSVHHVILYKDAIYDDYGFSVSDGLYERGVYINRIRKGGPADIVGLLRPYDRILQVNGTRTVDYDCCLTVPLIAAAGDRLEIVVQRNVTSRDLKNQRHEDSSSPSESSIVTKTI-