Monarch geneset OGS2.0

DPOGS203808
TranscriptDPOGS203808-TA1254 bp
ProteinDPOGS203808-PA417 aa
Genomic positionDPSCF300010 + 1967696-1969821
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0133110.084.67% 
BombyxBGIBMGA003714-TA8e-17276.54% 
Drosophilaa-PB9e-2757.47% 
EBI UniRef50UniRef50_UPI00015B435A2e-3534.30%UPI00015B435A related cluster n=1 Tax=unknown RepID=UPI00015B435A
NCBI RefSeqXP_001608193.14e-3634.30%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565379907e-3534.30%PREDICTED: hypothetical protein LOC100124270 [Nasonia vitripennis]
NCBI nr blastxgi|1565379903e-3334.30%PREDICTED: hypothetical protein LOC100124270 [Nasonia vitripennis]
Group
Gene OntologyGO:00055152.4e-28protein binding
KEGG pathwaydre:3687231e-15 
 K06092 (INADL, PATJ)maps-> Tight junction
InterPro domain[295-413] IPR0014782.4e-28PDZ/DHR/GLGF
Orthology groupMCL24936 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203808-TA
ATGGGTGCCCCAGCATGGGGGTCGCGTGGTGGCGATCGTCCCCGCCTGCTTCGGATGACGCCACTGCGACATAGCTTTGCCGCGCCAGCCACGCCGCCGGCGCAGCAACAACGGACACATGACTACGCATCTGATGCTGATAGTAACAAAAAAGAAGAAAGCTGGAATGCCAACTTGTCGCAGAGATGGAGAAAGTTACGACGTAGGTGTTCCCGACTACGGCCTGGATCTGGCAATCGTGAAGCAAGTCCCGTACGATGTTCTCCATCACCACCGCGACCTCAACCACAATGTTCACCTCCACCCCCATCAAAACTCTCATTTAGACATCGTGGTAAAGTTTACACAACCGCTTCTTTACGTGTCACCAGCGGAGCACCGGACTTGCTCAGAGCGCTGGGGAAATTAGGAGGAGGATTAAGAAGACGTGCTCTTTCCGCCCATGACGTGCTTACACCACCTCAACCGCAACAACCAGCAACTTTTTATGTTCCTAGTCCAAGTACAACTCGGTCTCCCTCGTCACCAATGTCACACAGAGAGGCTACAGTTCGCAGACGATGTAGCTCCCCTAATGTGTATAGACCTAAAGATGTACCTAGAGATCGAGCACCACTTATATCTCAAGAAGATGAAAGTCGAGACGTTGTTGATTACAACTACACGCCTGAAAGAAGGAAACCGCACAAAGAGATAAGAAGTCGACCATATAGCGAAAACGTAGAAATAGATCCTAAATTCAAAAATGGCTATAATCGTCTCACCCCAGAACCTACTGAGAGACTATGGGAAGAGCCTTATCGACTACCCCGTGTACAGGTTCGTCAAGAACCTATTCAACAATTAAGAAATTGTGTAGCTGAGTTAAGAGTTAGCACTACTCCACGGGCTCCACGCCCTCCTCCGGCTCCTACATCCCCGCAGAAAATAATTAAACGACCGCCAGCTCCTGAACCCCGAGATTCAAATACCTTTGAGGTGCGTTTCACCAAATCAGCTGGAGGTAAAGGACTCGGGTTTAGCATCGTAGGCGGACGTGATTCACCTCGTGGGGACATGGGCATATTTGTAAAAACTATATTTAATAACGGCCAAGCCGCTGAGTCTGGACTACTTCGAGAAGGGGACGAGGTTCTTTCAGTTAACGGTCGTGGTACGGCTGGTTTAACTCACAGCGAAGCTATAAGGCTGTTTAAGGACGTGCGTGCGGGACCGGTGTTGCTAAAAGTAACAAGACGTGCTCCAACTCGCTGA

Protein sequence:

>DPOGS203808-PA
MGAPAWGSRGGDRPRLLRMTPLRHSFAAPATPPAQQQRTHDYASDADSNKKEESWNANLSQRWRKLRRRCSRLRPGSGNREASPVRCSPSPPRPQPQCSPPPPSKLSFRHRGKVYTTASLRVTSGAPDLLRALGKLGGGLRRRALSAHDVLTPPQPQQPATFYVPSPSTTRSPSSPMSHREATVRRRCSSPNVYRPKDVPRDRAPLISQEDESRDVVDYNYTPERRKPHKEIRSRPYSENVEIDPKFKNGYNRLTPEPTERLWEEPYRLPRVQVRQEPIQQLRNCVAELRVSTTPRAPRPPPAPTSPQKIIKRPPAPEPRDSNTFEVRFTKSAGGKGLGFSIVGGRDSPRGDMGIFVKTIFNNGQAAESGLLREGDEVLSVNGRGTAGLTHSEAIRLFKDVRAGPVLLKVTRRAPTR-