Monarch geneset OGS2.0

DPOGS207355
TranscriptDPOGS207355-TA2694 bp
ProteinDPOGS207355-PA897 aa
Genomic positionDPSCF300188 + 345639-349817
RNAseq coverage365x (Rank: top 32%)
Annotation
HeliconiusHMEL0088580.081.88% 
BombyxBGIBMGA010280-TA0.076.34% 
Drosophilapyd-PA4e-5163.77% 
EBI UniRef50UniRef50_Q17PB63e-5267.39%Tight junction protein n=1 Tax=Aedes aegypti RepID=Q17PB6_AEDAE
NCBI RefSeqXP_001953294.15e-5735.33%GF17278 [Drosophila ananassae]
NCBI nr blastpgi|1947416349e-5635.33%GF17278 [Drosophila ananassae]
NCBI nr blastxgi|1700418138e-6329.73%tight junction protein [Culex quinquefasciatus]
Group
KEGG pathwaygga:4153885e-26 
 K05701 (TJP1, ZO1)maps-> Tight junction
    Gap junction
    Adherens junction
    Vibrio cholerae infection
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[764-856] IPR0009064.8e-28ZU5
Orthology groupMCL25725 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207355-TA
ATGCACAATCTGAGCTCCGACGTCATGAACTCCAGCGTGGAGGCGGACGAGAATTTCTCACCAAAAACTAAAAAGAAACTATCTAAAAAGCCATCGTTCATAAAAAATGCCGTCCTCGATCTGTTCAGATCAAAAACGAAAAATTCACAGAAAGTATCGCGACAGAAATCGTTGTGCGAAAGCGATTTGCAACAGAGATACGCCCAACAGATGCCGGCGTTGAGACGGGAGAAGTCCGATCTCAGTGATCTGAACACACACATGAGGAACACGCATATCAGGAACCAAATGTTGCAACGCAGCAACTCCTCGTCATGCGAAAAACCGAATCTAAAGCCGATACTCAAGAGACAGAGCTCGTTCTGTGACGACAGGAACGGTTCGATATGTACCGTCAGAGATTACGAAACGAAACCCGTGTCTAGGAACTTGATACGACAGAACTCCATGTGCGAAATGGAAACCGAACCTAAAGTTACCCCCTTACTACGTAGACAGAACTCACTCATAGAATATAGTAGGAGAGGAGTCTACCCACCTACCCCTAACTTTAATATGATCACCAAAATAGATCCCATTTACCAAAGACAACAGCAAATTAAACATGAACCTTTCTACCCAACTCAACCGCTACCTCCAGAGCCAATATACCAAAGCAAAGAACATATTGTATACGATCCTATCCCTAAGCCCAGGAGAACATATACTTCTCTTTACGATACCTCCAGCGAGAACCCTTACGCGAGTAGATCAGAAATTTCAAATCAAAGTTTCGAAAGCCCTTACGGCAATCGTGAAACTGTAGCAATGCCTCTACTAGATCCTCCCTATGCTACAAGAGCAGAATGCATGAGGGAAAGTAATTATGGTACTAGAAACGAAATGCCCGAAAATCCATACTCCCTGAAACAAGACGCGCCAAGAAAACCCGAATCACTCTATAGCGAATTGCCTTATGTGAGCAAAGAGCAGATGTTGAGACAAAGGATGACGCCTGAAAGTCCCTATGCCACTAAAGAGGATATGCTGAGACAAAGATTCTCAGAATCCCCGTACAGTCCAAAAGACGAAATGATTAAACAAAGAATGGATGGCTCGTTAACATGTAAAGAACCGATTTATGGTAAAAGAACATACGAGCCACCACCAGGCACGTCCTGGTCTGAAACTAATACGTATGGCGTACGGCCAGACAGACGGCTGCAAACACCAGTAAACTTCATAGGAAGAATATCGCCTATGAGCAAATGTATGAGTGAACCTCCGTATGTGAGTAGAACAGAAATGATGGCGCGAATCGCCCAGACAGAATCTCCATACTCAACCCGATCGGAACTGAAATCGAGTTGCTCGGATAACAGTTACAACAGGAACGACAGTCTCGATCACTCCTCAAACGTGAGGCCGGCTTCCGCGGAGTGTACATACGTTACCAAACAAGAAATACTATCACAAAAGGCAGCTTTACTGGCCAACAAAGAACCCATTTACGGTAGTAAGTTAGAGGAAAAGTTAGAAATCATAAAGCAAAGAAATCAAGCTAAAAAAGAAAAAATATATCAAACCAGACTCGAAACGAACGACAGTGACTCAATCAAAAGCAGAGAGCCGTTGTACGTTTCTAAGAGGGAATTAAAAGATAGTGTCATATATGAGTCTAACCAGGAAGCGAAAGAGGTACAGCCAGTCAGTCAGGAAGGCAGCGCTAGAAGCAGTCCTTACGAACTGAGTGACGCGAATCACCTGTCACGACGAGACTCGCTGTACCAAACGAAAGCAGAGGCATTGGACAGTGAAGTGGAAAAGTTCCAAGAAATCGATTTTTCAAAACTGAGATTAACAGAATCCAAAACCGATGCAGAAAAAGAAAAAACAAAGAATTTAGAGAACAATATATTAAAAGGCGAACCGCAGTACGCGCCAAGATTGCACGGAAAGACCGATCATATATCGAACGCACTGAAAGTAACCACATCCCCAACTCCCTACGATTCGACAACCTCAATGGAAACACATTACGCGTCCGAGTGCAGTATGAACTTCGAGAACAAGCCACAGAGCACGCCGTACACGTCACAAGATTTACAAGATAAATCGAAAGCTAACCGAACAGTGACTTTTTGTGAACAAATATTAGAAAAGAGTCCAGAGACTAGCGAGGATAAATCCGAAAACACGACCGTTGGAAACGAAACCACAGTCATCAGCAATGACAGTACAGTCATCAAGACTGAAGCGGGAGAGGCTGATGACTCCGCCAATAAGACGGAAGTGGAACCAGACGGTCCCCATACCACTTGGGGTATATTCGACAGTGAGGGCGGAGTTCTAGAGGACAGGCATTGGGGTGTCTCACTGATTATCCCGCCAAAAGCCATCGCACCGGGCATCAAGCAAAAGATCTATTTTACCGTGTCAGACCCTCGACTGAGCCAGCGGGTTGGTGGACCCCCCATCGACCTCGATAACGGTGAAGCGATGCTGTCCCCACTAGTGATGTGCGGTCCGCAGGGTCTGGTATTCTTAAAACCGGTGACGTTACGATTGCCGCACTGCGCTAACGCGGTCCCATCACTAGGACTCACAATCAAGGCGACAGACACGGAGGCACACTTGAGCACGGACTGGGATCAGATACATCTCCCAGCGACTACAACACTCAACACCGTCGCTGTCAAAGTAGATCATTTTTAA

Protein sequence:

>DPOGS207355-PA
MHNLSSDVMNSSVEADENFSPKTKKKLSKKPSFIKNAVLDLFRSKTKNSQKVSRQKSLCESDLQQRYAQQMPALRREKSDLSDLNTHMRNTHIRNQMLQRSNSSSCEKPNLKPILKRQSSFCDDRNGSICTVRDYETKPVSRNLIRQNSMCEMETEPKVTPLLRRQNSLIEYSRRGVYPPTPNFNMITKIDPIYQRQQQIKHEPFYPTQPLPPEPIYQSKEHIVYDPIPKPRRTYTSLYDTSSENPYASRSEISNQSFESPYGNRETVAMPLLDPPYATRAECMRESNYGTRNEMPENPYSLKQDAPRKPESLYSELPYVSKEQMLRQRMTPESPYATKEDMLRQRFSESPYSPKDEMIKQRMDGSLTCKEPIYGKRTYEPPPGTSWSETNTYGVRPDRRLQTPVNFIGRISPMSKCMSEPPYVSRTEMMARIAQTESPYSTRSELKSSCSDNSYNRNDSLDHSSNVRPASAECTYVTKQEILSQKAALLANKEPIYGSKLEEKLEIIKQRNQAKKEKIYQTRLETNDSDSIKSREPLYVSKRELKDSVIYESNQEAKEVQPVSQEGSARSSPYELSDANHLSRRDSLYQTKAEALDSEVEKFQEIDFSKLRLTESKTDAEKEKTKNLENNILKGEPQYAPRLHGKTDHISNALKVTTSPTPYDSTTSMETHYASECSMNFENKPQSTPYTSQDLQDKSKANRTVTFCEQILEKSPETSEDKSENTTVGNETTVISNDSTVIKTEAGEADDSANKTEVEPDGPHTTWGIFDSEGGVLEDRHWGVSLIIPPKAIAPGIKQKIYFTVSDPRLSQRVGGPPIDLDNGEAMLSPLVMCGPQGLVFLKPVTLRLPHCANAVPSLGLTIKATDTEAHLSTDWDQIHLPATTTLNTVAVKVDHF-