Monarch geneset OGS2.0

DPOGS212867
TranscriptDPOGS212867-TA1992 bp
ProteinDPOGS212867-PA663 aa
Genomic positionDPSCF300086 + 450030-452021
RNAseq coverage238x (Rank: top 43%)
Annotation
HeliconiusHMEL0173392e-3839.58% 
BombyxBGIBMGA000812-TA0.085.45% 
DrosophilaPatj-PA9e-15544.16% 
EBI UniRef50UniRef50_G6DHJ70.0100.00%Putative uncharacterized protein n=4 Tax=Coelomata RepID=G6DHJ7_DANPL
NCBI RefSeqXP_002425192.13e-18051.85%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3838588040.053.06%PREDICTED: patj homolog [Megachile rotundata]
NCBI nr blastxgi|3838588040.053.28%PREDICTED: patj homolog [Megachile rotundata]
Group
Gene OntologyGO:00055151.8e-25protein binding
KEGG pathwaytgu:1002302293e-83 
 K06095 (MPDZ, MUPP1)maps-> Tight junction
InterPro domain[1-105] IPR0014781.8e-25PDZ/DHR/GLGF
Orthology groupMCL10605 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212867-TA
ATGGTGCTGAGCACGGAGTGGGCGCAGGTGGAAGTGGTTGAGCTTACGAACGACGGGAATGGTTTAGGGTTTAACCTAGTGGGGGGACGTAGCACGGGTGTCGTGATTAAATACGTTCTTCCTGGAGGAATCGCTGACAAGGACGGACGACTGCAGAGTGGCGATCACGTACTTCAAGTCGGTTCAATTAACCTTCGCGGCTTCACGTCGGAGCAGGTAGCTGCAGTGCTGCGTCAGGCAGGGCCTACGGTGCGCCTTTTGGTCGCTCGGCCTGCAGATCCTGCCGCAGCACTCCGCGCTCCAATTCCTGGCATTGCACTCGTACCCACTAAACTCTTAGCTGATCCCGAACTTCTCGATCGTCACCTTATCGAAGCGGGTTATGGAGCAGTCTACGATTTGTCTCAGTGTTACAGTGACTACATCAATGGACAAAATGTAGAGATTGAAAATTTAGCCCTAGTGGCAGCAGTCAGTGTTATTGGTGAAAATCCTCAACAGATCCCAGACCACCCAATTGTTGCTGATACTCTTTCCCCGACAATAACTATAACAGTCCCAGTGGAGTTGCCTACACCTCCAGAAGTGGAAATAGTTCATGTAGACTTAAATAAGAATGTGTATGGTTTAGGTATAACAGTGGCCGGATATGTATGTGAAAAAGAAGAACTATCTGGAATCTTTGTTAAAAGTATAATTGAAGGTAGCAGTGCTGAACAGAGTGGTCAGATAAGGTTAAATGATAGAATTATTGAGGTTGATGGTGTCTCTCTAGCAGATAAAAGTAATCCTCAGGCTGTGGAGATATTAAGAAATACTGGTATATCAGTTCATTTAGTTTTAGAAAGGTATTTGAGAGGTCCAAAGTATGAGCACTTACAATTAGCTATATTTAATGAGGAACGACCAGCTTCACCTTCTCCTTCCGCAACTACCCTGACCTGGTTCCCAGTGCCTTCCCAAGCAGAAGATATCAGTACTACAGAAATTGAGCCCGAACCTGAATCAAATACCACCATAGATTCTAGTGTCTTAGAGGTTGGTGAATGTGATGCCAATGATCCTACTCAGGAAGAATTAGATGCAAAATTTGATGCAATTCTAGCTGTAAGTGAAGAGGAAATAAAAATTAGGTGGGAAGAGGAGATTGGTCCTGGTAAAGACATTATTGTGGCAGAGGTACACAAACTATCGGGCTTAGGAATCAGTTTAGAAGGAACTGTAGATGTAGAGGGGGGCCAAGAAGTGAGACCTCACCACTATATCAGATCAGTTTTGCCAGAAGGTCCCATAGGACAACAAGGCACACTTGCAGCAGGTGATGAACTTCTAGAAGTAAATGAACACAGGTTACATGGACTTACACACACTGAAGTGGTCAATATACTAAAGCAGCTCCCGAATAGGGTACGTCTGGTTTGTGCAAGAAGTAGCACAGAGAGTGGACCTCGCCCTGTTGTCAATCTAGCCCAAGATCGAGAAGGCTTTGAAGCACGAAAGATCATATCTGGTAGTTTAAATAATTTGACAACTATAGTCAAAGCTCAATCAGATACATCCATCAATACATCGAGTACTGCTACTCTCACTAACCAATCTAATCAATCAAAGAAATCTAAATCTCTAGAGTGTGTGTCAGGCCTAGCCATGTGGCAGAGTAAAGAAGACATTGTTAAGCTGATGAAGGGTGATCAGGGGCTTGGATTTTCTATACTAGATTACCAAGACCCAATTGATCCTAATGGTACTGTTATTGTGGTAAGGAGCTTAATTCCTGGAGGTGTGGCAGAAAAAGATGGTCAAATATCACCTGGAGATAGAGTGATGTCTGTGAATGGATCAAGTATCAAAAATGCTACTCTGGACCAAGCGGTTCAGGCTTTGAAAGGAGCTCCGAGGGGAGTTGTACGAGTCGGCATTGCGAGACCACTCCCATCCTGTGACTCCTCAAAATCAAAGAGCACAAGTACTCTCAACATCAAACCCAGTTAA

Protein sequence:

>DPOGS212867-PA
MVLSTEWAQVEVVELTNDGNGLGFNLVGGRSTGVVIKYVLPGGIADKDGRLQSGDHVLQVGSINLRGFTSEQVAAVLRQAGPTVRLLVARPADPAAALRAPIPGIALVPTKLLADPELLDRHLIEAGYGAVYDLSQCYSDYINGQNVEIENLALVAAVSVIGENPQQIPDHPIVADTLSPTITITVPVELPTPPEVEIVHVDLNKNVYGLGITVAGYVCEKEELSGIFVKSIIEGSSAEQSGQIRLNDRIIEVDGVSLADKSNPQAVEILRNTGISVHLVLERYLRGPKYEHLQLAIFNEERPASPSPSATTLTWFPVPSQAEDISTTEIEPEPESNTTIDSSVLEVGECDANDPTQEELDAKFDAILAVSEEEIKIRWEEEIGPGKDIIVAEVHKLSGLGISLEGTVDVEGGQEVRPHHYIRSVLPEGPIGQQGTLAAGDELLEVNEHRLHGLTHTEVVNILKQLPNRVRLVCARSSTESGPRPVVNLAQDREGFEARKIISGSLNNLTTIVKAQSDTSINTSSTATLTNQSNQSKKSKSLECVSGLAMWQSKEDIVKLMKGDQGLGFSILDYQDPIDPNGTVIVVRSLIPGGVAEKDGQISPGDRVMSVNGSSIKNATLDQAVQALKGAPRGVVRVGIARPLPSCDSSKSKSTSTLNIKPS-