Monarch geneset OGS2.0

DPOGS206150
TranscriptDPOGS206150-TA1386 bp
ProteinDPOGS206150-PA461 aa
Genomic positionDPSCF300028 + 1545489-1555039
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0134734e-14288.13% 
BombyxBGIBMGA000711-TA8e-13991.13% 
Drosophilametro-PA5e-15362.30% 
EBI UniRef50UniRef50_A1Z8G07e-15162.30%Skiff, isoform A n=12 Tax=Pancrustacea RepID=A1Z8G0_DROME
NCBI RefSeqXP_001362009.21e-15363.70%GA15582 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984614183e-15263.70%GA15582 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|3320266593e-14765.41%MAGUK p55 subfamily member 7 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055151.4e-28protein binding
KEGG pathwayxla:4435691e-65 
 K06091 (MPP5, PALS1)maps-> Tight junction
InterPro domain[210-310] IPR0014521.4e-28Src homology-3 domain
[108-225] IPR0014781.2e-19PDZ/DHR/GLGF
[238-301] IPR0115112.7e-10Variant SH3
[68-122] IPR0041721.6e-08L27
Orthology groupMCL13099 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206150-TA
ATGGATGGCTCAACTGAGAACTGGGACCCTGCTCTTACTCGCTTACTGTCTTCATTAAAAGATGTTCAGTCTGGTGGTGAAGATGTGGCTTTCCTCAATGAACTCCTTCAGTCAAAGCAGCTTCATGCACTCGTTCAAGTGCACAATAAGATTGTAGCAACTCAATGCAAGGATGATAAGTTTTATCCCTTATTATCTAATGCAATGCAGGTTACATTAGAAGTTTTAGAAATGTTTGTTGATGTAGTACAAATCTCCAGTGAATACAAAGAGTTGTTAGGTCTCCTACAAAAGCCTCATTTTCAGGCTATACTCTGTACACATGATGCTGTCGCGCAGAAAGACTATTACCCACATTTGCCAGACATACCTCCTGATGCTGACGATGAAGAGGAAACTGTTAAAATTGTCCAACTAGTGAAAAGTGACGAACCTTTGGGTGGAGCTCAGAGTGCAGAGCCTATAGTGGGTGCTACGATCAAAACCGATGAGGATACTGGTAAGATCGTGATAGCTCGTGTGATGCATGGCGGTGCCGCTGATAGATCTGGTCTTATACATGCTGGGGACGAAGTTATTGAGGTCAATGGCATCAGTGTAGAAAGCAAGACACCAGCTGATGTCCTCTCTATACTGCAAAATTCGGAAGGCACAATAACTTTCAAACTGGTACCGTCTTTCGGTAAAGGAGGTTCGAGGGAGAGTAAAGTAAGGGTGAGAGCACTGTTCAACTATAATTCCTCGGAAGATCCTTACATCCCTTGCAAAGAGGCTGGTCTTAACTTCAAAAAGGGTGACATTCTTCATATTGTGTCCCAAGACGATGCTTATTGGTGGCAAGCCCGTCGTGAGGGTGACAGAGTTATGAGAGCCGGTCTGATACCTTCGAGGACCTTGCAGGAGGGTCGCATTATACACGAGAGACAGACAGACCCTCAGACTGTTGACGGTAAACCAGCACTTTGCAGTCCAACAGCCGCAAATTCCGAATGCGGTCCAAAGACTCCCTGCTCGCCTACTCCTAACGCTGCCGCTCTATTGCCCTGCAAGTCCATGCCTAAGGTGAAGAAAATTATATATGATATAAAGGAGAACGATGACTTCGACAGAGAATTGATACCGACTTATGAGGAGGTGGCCAGATTATACCCACGACCCGGGTTTATAAGACCCATTGTTCTTGTCGGGGCTCCCGGTGTGGGCAGGAATGAATTGCGAAGGAGGCTGATTGCTTCCGATCCCGAAAAATACGTCACCCCCGTACCATGTTCGCCGCAATTAAGAACAGATGCCAGTACCAAATTTTATTATAGGGCCCATCTAGGGCAAGCCCTTGAAGATAGTATTGGTATTTCAATCGCTGTTGTCTTTTACGTGGTGTACTGA

Protein sequence:

>DPOGS206150-PA
MDGSTENWDPALTRLLSSLKDVQSGGEDVAFLNELLQSKQLHALVQVHNKIVATQCKDDKFYPLLSNAMQVTLEVLEMFVDVVQISSEYKELLGLLQKPHFQAILCTHDAVAQKDYYPHLPDIPPDADDEEETVKIVQLVKSDEPLGGAQSAEPIVGATIKTDEDTGKIVIARVMHGGAADRSGLIHAGDEVIEVNGISVESKTPADVLSILQNSEGTITFKLVPSFGKGGSRESKVRVRALFNYNSSEDPYIPCKEAGLNFKKGDILHIVSQDDAYWWQARREGDRVMRAGLIPSRTLQEGRIIHERQTDPQTVDGKPALCSPTAANSECGPKTPCSPTPNAAALLPCKSMPKVKKIIYDIKENDDFDRELIPTYEEVARLYPRPGFIRPIVLVGAPGVGRNELRRRLIASDPEKYVTPVPCSPQLRTDASTKFYYRAHLGQALEDSIGISIAVVFYVVY-