Monarch geneset OGS2.0

DPOGS210974
TranscriptDPOGS210974-TA1092 bp
ProteinDPOGS210974-PA363 aa
Genomic positionDPSCF300004 - 259691-260782
RNAseq coverage168x (Rank: top 51%)
Annotation
HeliconiusHMEL0250251e-10493.88% 
BombyxBGIBMGA006403-TA0.087.05% 
DrosophilaSur-8-PA1e-15873.28% 
EBI UniRef50UniRef50_Q9UQ133e-14168.32%Leucine-rich repeat protein SHOC-2 n=90 Tax=Eumetazoa RepID=SHOC2_HUMAN
NCBI RefSeqXP_396017.23e-17281.82%PREDICTED: similar to Sur-8 CG5407-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838658612e-17182.09%PREDICTED: leucine-rich repeat protein soc-2 homolog [Megachile rotundata]
NCBI nr blastxgi|910915825e-17382.64%PREDICTED: similar to shoc2 [Tribolium castaneum]
Group
KEGG pathwayana:alr01245e-39 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
Orthology groupMCL11889 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210974-TA
ATGTTGAGCTTAAGAGAAAATAAAATAAAGGAACTCTCTTCTGGTATTGGAAAACTTGTAAATTTGGTGACTTTTGATGTTTCCCATAATCATTTAGAGCATTTACCACAAGAGATTGGTAATTGTGTTAATTTATCTACATTAGATTTGCAGCACAATGAGTTGTTAGACATTCCAGATACAATTGGCAACTTACAGGCACTAAATCGTATTGGGTTGAGATATAATAGACTCAATGCTATACCAGCTTCTTTAAGTAACTGTAAACATATGGATGAATTTAATGTAGAGGGGAACTCAATATCTCAACTTCCAGATGGACTACTGTGTAGTCTGACTGAGCTTACAAGTCTTACATTGTCCCGAAATTCATTTATGAGTTATCCTAGCGGTGGGCCAGCACAATTCACTAGTGTTTCATCCATAAATCTAGAACACAATCAAATAGATAAAATACCCTATGGAATATTTTCGAGGGCAAAAAATTTAACCAAATTGATAATGAAAGAAAATTTGTTGACATCCCTTCCTTTGGACATTGGAACATGGACAAACATGGTGGAACTTAATTTGGGTACAAACCAACTGGTAAAACTACCTGATGACATACAAAGCTTAATAAACTTAGAAGTGTTGATTCTATCAAACAATCTTCTTAAGAGGATCCCACCAAGCATTGGCAACTTGAGAAAGCTAAGGGTATTGGATTTGGAAGAAAACAAGATTGAAATCCTTCCCAATGAAATAGGATTTCTACAGGAATTGAAAAAACTGATTGTTCAATCTAATCAATTGACTTCATTGCCGCGCTCAATTGGACATCTGATAAATCTCACATATCTTAGTGTTGGTGAAAATAATTTGCAATATTTACCTGAGGAGATAGGCACCCTTGAGAATTTAGAATCACTGTACCTCAATGATAATCCTAATTTATGCAATCTGCCATTTGAGCTAGCTCTATGTGTTAGCTTGCAAATTATGAGTATTGAGAATTGTCCTTTGACCTCTCTACCACCAGATGTAGTATCTTCAGGGCCATCTTTGGTAATCCAATATCTCAAATCACAAGGGCCTTATAGATCCATGTGA

Protein sequence:

>DPOGS210974-PA
MLSLRENKIKELSSGIGKLVNLVTFDVSHNHLEHLPQEIGNCVNLSTLDLQHNELLDIPDTIGNLQALNRIGLRYNRLNAIPASLSNCKHMDEFNVEGNSISQLPDGLLCSLTELTSLTLSRNSFMSYPSGGPAQFTSVSSINLEHNQIDKIPYGIFSRAKNLTKLIMKENLLTSLPLDIGTWTNMVELNLGTNQLVKLPDDIQSLINLEVLILSNNLLKRIPPSIGNLRKLRVLDLEENKIEILPNEIGFLQELKKLIVQSNQLTSLPRSIGHLINLTYLSVGENNLQYLPEEIGTLENLESLYLNDNPNLCNLPFELALCVSLQIMSIENCPLTSLPPDVVSSGPSLVIQYLKSQGPYRSM-