Monarch geneset OGS2.0

DPOGS207354
TranscriptDPOGS207354-TA1380 bp
ProteinDPOGS207354-PA459 aa
Genomic positionDPSCF300188 + 340804-345582
RNAseq coverage1126x (Rank: top 11%)
Annotation
HeliconiusHMEL0088583e-11665.78% 
BombyxBGIBMGA010280-TA8e-9353.01% 
Drosophila% 
EBI UniRef50UniRef50_B0WIW09e-2132.23%Tight junction protein n=2 Tax=Eumetazoa RepID=B0WIW0_CULQU
NCBI RefSeqXP_001848644.12e-2132.23%tight junction protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700418133e-2032.23%tight junction protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700418132e-4231.91%tight junction protein [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207354-TA
ATGTCATCGAGCTCAGAGAATAGATTGTCGTACGCGTCTAGTCCTGAGAGCGACTTGGAAGTAAGTCCACCGCCCGCTCCTAGACTGGTGAAGTCTTCCTCGGACCCATCGATAGCGACCACCCAAGATAACATGGACCGCGACGACGACATCAACCTCATGGCTGAAGCTGTACCTCCTCCGTATTCGCAAGGTGGTTATGATAGCAAGTATGGGTTCGCTAACGGGAACAGTAATGGCACTCAGAGCCTGCCGACCGAGGCCCCGCCTTACCAGATGCAGTCCTCGCCCCCGAACTCACCTTTGTATGGAACAGTGCCCGAACTTCCTCCCCGAGGTCCGCCGCCGGGTGGAACCCCCTCCAGACCGACGGGTGGAGTACTACTGCCCGCACCTCCCCCCACGAGACACGGACTGCGACACACTAATAATCCTCGCCCGTCGGCCCAGGAGCGTCTGTTTGGGTCCAAGGAAACCGACGACACGTACAGCGCCCGGCCCCAGCAAGGATCGCTGGACAGGCACAGACATACTAACAACCCGCAGGGTTCATACGACGGTACATATGACTACAGCACCAACGCTCACAATCCATCTCGACTCCCGCCGAACGCGCCGGACGACCTTAAAGTTGCCCCCACCATTAAAATGGAACCTCAGCCGTCGGCGAATCCGACGAATATGGCAACACAGAGTCCACAAAGAAACTCAAATTCCCACGAACACAACTCGTTGGACTATAACAGGACACCGGAGAACAACTACAGACCCGACAACTACCGCACCCCACAAGTCCGACCGCCCATGAACGGAAATAGCCCGCACGCGCAGAACGCCCATAACGCCCACAACAACCATAACGCTCATAACGCCCAAGTCCAGAACGCTCACAACCCTCACAATGCCCACAACGCTCACAACAGTCAAATCGTGAACACGCCCATGCACGCCCGAGGACCCTCCTTACCAAATGTACCGACGAACGACCACGCGAAATACAGCGCACGCACGAACTCAGCGTCGCAGGCGGACTACGGCGGACGAGCGCCGCCCTACAAACCCGTACCGCCTCCGAAACCCAAACACTACCGACCTCCCGAACAACCGCTCCAACAACAGCACCCGCAACAACACTCGCAACAACACCAACAACAACACCCACAACAAATCCCTCAACATCTCCCTCCGCATCCACGGAATGGAAGTATGGACGTGTCTGTTCCCGGCACGCACTACTCCCACTCCCACTCCCACTCGCAGCCGCACCGCGCGCCCCACGGACACGGCTACCACCAGATGTCATCACATATGAGGCCATCTCGTGTTCATACAGGACATCATAGTCTCAGAGCTAAATACAATATTACCTTCCAGATTTAG

Protein sequence:

>DPOGS207354-PA
MSSSSENRLSYASSPESDLEVSPPPAPRLVKSSSDPSIATTQDNMDRDDDINLMAEAVPPPYSQGGYDSKYGFANGNSNGTQSLPTEAPPYQMQSSPPNSPLYGTVPELPPRGPPPGGTPSRPTGGVLLPAPPPTRHGLRHTNNPRPSAQERLFGSKETDDTYSARPQQGSLDRHRHTNNPQGSYDGTYDYSTNAHNPSRLPPNAPDDLKVAPTIKMEPQPSANPTNMATQSPQRNSNSHEHNSLDYNRTPENNYRPDNYRTPQVRPPMNGNSPHAQNAHNAHNNHNAHNAQVQNAHNPHNAHNAHNSQIVNTPMHARGPSLPNVPTNDHAKYSARTNSASQADYGGRAPPYKPVPPPKPKHYRPPEQPLQQQHPQQHSQQHQQQHPQQIPQHLPPHPRNGSMDVSVPGTHYSHSHSHSQPHRAPHGHGYHQMSSHMRPSRVHTGHHSLRAKYNITFQI-