Monarch geneset OGS2.0

DPOGS204072
TranscriptDPOGS204072-TA1851 bp
ProteinDPOGS204072-PA616 aa
Genomic positionDPSCF300200 + 119602-127921
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0131378e-8671.96% 
BombyxBGIBMGA010813-TA2e-8585.71% 
DrosophilaTsp3A-PB4e-11071.01% 
EBI UniRef50UniRef50_Q9W4X66e-10871.01%MIP05777p n=41 Tax=Pancrustacea RepID=Q9W4X6_DROME
NCBI RefSeqXP_966752.11e-13077.78%PREDICTED: similar to tetraspanin, putative [Tribolium castaneum]
NCBI nr blastpgi|910901882e-12977.78%PREDICTED: similar to tetraspanin, putative [Tribolium castaneum]
NCBI nr blastxgi|910901883e-13078.06%PREDICTED: similar to tetraspanin, putative [Tribolium castaneum]
Group
Gene OntologyGO:00160212.1e-47integral to membrane
KEGG pathway 
InterPro domain[15-260] IPR0184992.1e-47Tetraspanin
[18-41] IPR0003013.2e-31Tetraspanin, subgroup
[116-239] IPR0089522.2e-10Tetraspanin, EC2 domain
Orthology groupMCL11689 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204072-TA
ATGCATCGTCGGCGGGGTCCCAATTTTACATATGTCAGCGGTTGTGTGAAATATATGATTTTTGTTTTGAATTTTGTCTTCTGGTTGTTTGGAGGATTGCTTATTGGAGTGGGACTCTATGCATTCATAGACAAATGGCAGGCCATGGGTCAAGTTAAGCTAGAGACAGTATATGATGTACTTCTAAACATTAGTCTACTGATAGCTCTGTTGGGAGGTATTGTCTTCATTGTCAGTTTTGCCGGCTGTATTGGTGCTTTGAGAGAAAATACATGTCTATTGAAATTCTACTCGCTGTGTTTACTAGTGCTGTTCCTGTTTGAGATGGGATCGGCTGTGTGTGGTTTCGTGTTCCCGCGTTCGCTGCACGGCCTGTTCGAGTTGGGCTTCACAGACCGAGTCGTACACTCATATAGAGATGATCCCGACTTGCAGAACTTTATCGACTTTGCTCAGAAGGATTTCCACTGCTGCGGACTAACATCAGACGGTTACATGGACTGGTCGAAGAACGAATACTACAACTGTACTTCCCCGTCAGTGGAGCGTTGTGGGGTTCCGTTCTCTTGTTGCATCAACGCTACGGACATCAGCTCGGGCCTCGTGAACATCATGTGTGGTTATGGTGTGCAGAATTATCCCGTTGCCGAAGCCAGCAAACGTGTTTGGACGAGCGGTTGTATAGAGATAGTCCGTTCCTGGGCCGAGAGGAATCTCTATACAATAGCCTCAGTGGCTCTGGGTGTAGCGCTGTCACAGTTGTTTGTTATCTACCTGGCTAAGACTCTGGAGGGACAGATTGAACTGCAAAAGGCTAGGTGGGGATTTTGGAAATGTAGGGAGGGTCTAGTCCATAGATTGTTTGGAGGATTGCTTATCGGAGTGGGACTTTATGCATTCATAGACAAATGGCAGGCCATGGGTCAAGTTAAGCTAGAGACAGTATATGATGTACTTCTAAACATTAGTCTACTGATAGCTCTGTTGGGAGGTATTGTCTTCATTGTCAGTTTTGCCGGCTGTATTGGTGCTTTGAGAGAAAATACATGTCTATTGAAATTCTACTCGCTGTGTTTACTAGTGCTGTTCCTGTTTGAGATGGGTTCGGCTGTGTGTGGTTTCGTGTTCCCGCGTTCGCTGCACGGCCTGTTCGAGTTGGGCTTCACAGACCGAGTCGTACACTCATATAGAGATGATCCCGACTTGCAGAACTTTATCGACTTTGCTCAGAAGGATTTCCACTGCTGCGGACTAACATCTGACGGTTACATGGACTGGTCGAAGAACGAATACTACAACTGTACTTCCCCGTCAGTGGAGCGTTGTGGGGTTCCGTTCTCTTGTTGCATCAACGCTACGGACATCAGCTCGGGCCTCGTGAACATCATGTGTGGTTATGGTGTGCAGAATTATCCCGTTGCCGAAGCCAGCAAACGTGTTTGGACGAGCGGTTGTATAGAGATAGTCCGTTCCTGGGCCGAGAGGAATCTCTATACAATAGCCTCAGTGGCTCTGGGTGTAGCGCTGTCACAGTTGTTTGTTATCTACCTGGCTAAGACTCTGGAGGGACAGATTGAACTGCAAAAGGCTAGGTGGGTATTTTGGAAATGTAGGGAGGATGGCAAACATGAAATAGGGGCTCAGCGGGCGCGTACCGGGACGAGTGTCGCCGAGCGCTGCTGTAGCCCTTCCCCCACCACGCGCCCCACCACATGCATATATAATGCGGCGCGGGCGGCGGCTACCGTCAGTCGGTGTCGCGACATCGCTACAGGACTTGCTGCGACGCAGCATACAGTGCTGTGTCAGATCATAGTTCAGCCATATATGTGTGTTGTGCTGAGTTTGTAA

Protein sequence:

>DPOGS204072-PA
MHRRRGPNFTYVSGCVKYMIFVLNFVFWLFGGLLIGVGLYAFIDKWQAMGQVKLETVYDVLLNISLLIALLGGIVFIVSFAGCIGALRENTCLLKFYSLCLLVLFLFEMGSAVCGFVFPRSLHGLFELGFTDRVVHSYRDDPDLQNFIDFAQKDFHCCGLTSDGYMDWSKNEYYNCTSPSVERCGVPFSCCINATDISSGLVNIMCGYGVQNYPVAEASKRVWTSGCIEIVRSWAERNLYTIASVALGVALSQLFVIYLAKTLEGQIELQKARWGFWKCREGLVHRLFGGLLIGVGLYAFIDKWQAMGQVKLETVYDVLLNISLLIALLGGIVFIVSFAGCIGALRENTCLLKFYSLCLLVLFLFEMGSAVCGFVFPRSLHGLFELGFTDRVVHSYRDDPDLQNFIDFAQKDFHCCGLTSDGYMDWSKNEYYNCTSPSVERCGVPFSCCINATDISSGLVNIMCGYGVQNYPVAEASKRVWTSGCIEIVRSWAERNLYTIASVALGVALSQLFVIYLAKTLEGQIELQKARWVFWKCREDGKHEIGAQRARTGTSVAERCCSPSPTTRPTTCIYNAARAAATVSRCRDIATGLAATQHTVLCQIIVQPYMCVVLSL-