Monarch geneset OGS2.0

DPOGS204222
TranscriptDPOGS204222-TA1341 bp
ProteinDPOGS204222-PA446 aa
Genomic positionDPSCF300046 - 760210-765516
RNAseq coverage1083x (Rank: top 12%)
Annotation
HeliconiusHMEL0151380.084.30% 
BombyxBGIBMGA007500-TA6e-16368.70% 
DrosophilaTMS1-PA4e-12548.84% 
EBI UniRef50UniRef50_Q178Z73e-14757.54%Membrane protein tms1d n=6 Tax=Neoptera RepID=Q178Z7_AEDAE
NCBI RefSeqNP_001037624.10.075.55%membrane protein TMS1 [Bombyx mori]
NCBI nr blastpgi|1129833560.075.55%membrane protein TMS1 precursor [Bombyx mori]
NCBI nr blastxgi|1129833560.075.55%membrane protein TMS1 precursor [Bombyx mori]
Group
Gene OntologyGO:00160205.9e-228membrane
KEGG pathway 
InterPro domain[2-446] IPR0050165.9e-228TMS membrane protein/tumour differentially expressed protein
Orthology groupMCL11666 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204222-TA
ATGGGAGCCGTGTTAGGAATTTGTTCAGCAGCACAGCTGGCCTGCTGCTGTGGCAGTGCTGCGTGCTCATTGTGCTGCTCAGCATGTCCATCATGTGCCAATTCCACATCCACCCGACTGATGTACACTGTAATGTTGCTACTTGTCATGGTAGTGGCCTGCATCACTCTTGCTCCAGGATTACATGAACAGATGAAAAAGGTACCCTTTTGTGAAAATTCAACATCAATGGTGCCAGGCACCTTCAAAGTTGACTGTGATAATGCTGTTGGTTATTTGGCTGTTTATAGAATCTGCTTCGCAACATGTCTCTTCTTTATACTGATGGCGTTACTCATGATTGGAGTGAGATCTTCCAAAGATCCTAGAGCTGGTATTCAAAATGGGTTCTGGGGTATAAAATATCTTATTGTTATTGGCGGTATCATAGGAGCTTTCTTCATACCAGAAGGATCATTTGCATCAACCTGGATGGTGTTTGGAATGATTGGCGGCTTCTGCTTCATTGTTATACAGCTTATTTTGATCATTGACTTTGCTCACTCTTGGGCCGAAAAATGGGTTTCCAATTACGAAGAAACACAATCTCGAGGTTGGTACTCGGCGTTGCTGTTGGCCATGTTGTCATGCTATGCGCTTACATTGACTGGCATTGTTTTACTTTATGTCTTCTATACTAAGCCAGATGGATGTGACCTATCAAAGTTCATCATCTCCTTCAATCTGATCCTGGTTGTGGTAGCAAGCGCTATATCAATCCTGCCATCAGTCCAGGAATACCAACCCAGGTCTGGTCTGCTACAGTCAGCGGTTGTCTCATTGTATGTGATGTACCTGACTTGGTCAGCCCTGTCCAACTCCGCTGCCCCATGTAATGCCAGTATCACTGACGAAAATGAGTCGTCATTCGACAAGCAGTCTATCATTGGCCTGGTGATTTGGGTGTGCAGCGTCCTATACTCGTGCGTAAGGACAGCATCTTCCTCTTCCAAGATTACAATGTCTGAGCACATTCTGGCCAAGGATGGAGCTACAGGCGAGGGAGGGCTGATTGCTAATGAAGAAGGAGATGGCGGCGAGGCCGGCGCCAAGGAGACCAAGGTGTATGACAACGAAGATGACGCCGTCGCCTACTCCTGGAGCTTCTTCCACGTCGTCTTCGCTCTAGCCACCCTGTATATCATGATGACACTCACCAACTGGTACAACCCGAGTTCTCAGCTGTCCAAGTCTAATGTGGCGTCCATGTGGATCAAGATTACATCATCTTGGTTGTGCATCGGTCTATACATTTGGACCCTCGTCGCACCAGCAGTTCTGCCTGATCGCGACTTTAGTTAA

Protein sequence:

>DPOGS204222-PA
MGAVLGICSAAQLACCCGSAACSLCCSACPSCANSTSTRLMYTVMLLLVMVVACITLAPGLHEQMKKVPFCENSTSMVPGTFKVDCDNAVGYLAVYRICFATCLFFILMALLMIGVRSSKDPRAGIQNGFWGIKYLIVIGGIIGAFFIPEGSFASTWMVFGMIGGFCFIVIQLILIIDFAHSWAEKWVSNYEETQSRGWYSALLLAMLSCYALTLTGIVLLYVFYTKPDGCDLSKFIISFNLILVVVASAISILPSVQEYQPRSGLLQSAVVSLYVMYLTWSALSNSAAPCNASITDENESSFDKQSIIGLVIWVCSVLYSCVRTASSSSKITMSEHILAKDGATGEGGLIANEEGDGGEAGAKETKVYDNEDDAVAYSWSFFHVVFALATLYIMMTLTNWYNPSSQLSKSNVASMWIKITSSWLCIGLYIWTLVAPAVLPDRDFS-