Monarch geneset OGS2.0

DPOGS205077
TranscriptDPOGS205077-TA1896 bp
ProteinDPOGS205077-PA631 aa
Genomic positionDPSCF300074 + 65799-69811
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0121220.064.58% 
BombyxBGIBMGA006873-TA0.052.56% 
DrosophilaCG34114-PB6e-2323.41% 
EBI UniRef50UniRef50_E0VMK92e-2527.64%Sidestep protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VMK9_PEDHC
NCBI RefSeqXP_002427353.14e-2627.64%sidestep protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420133099e-2527.64%sidestep protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420107711e-2625.80%sidestep protein, putative [Pediculus humanus corporis]
Group
KEGG pathwaymdo:1000157271e-10 
 K06467 (CD22, SIGLEC2)maps-> Cell adhesion molecules (CAMs)
    B cell receptor signaling pathway
    Hematopoietic cell lineage
InterPro domain[82-137] IPR0137831.1e-07Immunoglobulin-like fold
Orthology groupMCL25285 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205077-TA
ATGGTAGTGCCGCCGCAGCTGGTGTCCGTGAGGGCGCTGGGTGTGGAGGGTGCGATAGAATCGTCTCAGCGGCTGGTTGTCAGTGAGATCGAATTAAGTGTATCGCACAAGCACGACGAGTCGCAAGTTACTTGCTGCGCGCCCGCACACCGCCGCGCGGACGAACAGTACGTGTGCGCACCCAGCCTGCCCATTACTGTGCTCTATCCGCCCGTCCTTGAAATAATGACAGAGGAAGTTCTAATAAACAATACGCTGTCCGTCGTGAAAGGATCCAACGTGACACTCAACTGTAGTTACCAAGCGAACCCCGCTGTGTACCAACTCATATGGTTCCATGAGGAAGATCTCCTGAACTCCGAGGAGTCGGTCGCTCCATCCCTGGTGGTCCACGAGGCTGGTGAATACGTGTGCGCCGCCACTAACGACCAGGGCTCCGCTTACAGCGACCCCGTCTTCATCGACGTTATATATCCTCCATACTGCGAGGACGAAACTATCGTGGAGTATGGTATCGGGGACAACGATTGTCTGAACCTCACCTGCAAAGTCAAAGGCAACCCTGAACCGACAGCGTATCGCTGGCTCTTGATCAGTGAAATTAATGGAATCAAACTCAGGAACAACCACACGCTGAGCTTGGAGACGCAGGACGCCACTTTGCAATACCAGAGACCGAATGGAACTAACACTCTTATATATTGTTGGGGTTTAAATGGAGTTATCAACAACGAGCTGGAACACAAACGCTGCTCGTTCATGGTAACCGACGAGACTGTGCCGCGGCCTCCTGCTAATTGTGTAGCAGAAAAGAATATTATGAAGGAGATCACAGTCGTCTGCGAGGAAGGACACGATGGAGGCTTACAGCAGAAATTCAAGTTCACAGTAAACGACTTGGACACCGATGATCAACTAGTGTCCATTATCAACCAAGAACCGAAATTTATGATCCAAGAACCGAAAAAAGAAAACTATAAATTCGTGATCAATGCCTTCAACGAGAAAGGTGACAGTGAGACCGTGGAAATAGACAAGGATAGTATTGTGGATGAGAGCGCAGGCTCTTTGGAAACAATAAGCGCGGTGACAAACATAACGACTTTAGCTCTGTCATTATGCGGGGGCGTAGCGTTACTGGCACTGGCGGCGTGCGGCCTCGTGCTTTGTGCACACGAGCGACCGCCGCATAACAAGGATACACTCTGTGCCTACACTGACGATACCAACTGTGAAACCTTCCATGACAGTGAAGATGACAGCGAATGTAACGTGCGACGGACGGAGTCGTTCCGACGCGCCATGACGAAATATCCTATGAAAAATTACGACGTTAGAAGGACAAGTTCCTTCCACTCCGCACGTTATATACACGACATGCAAGAACACGATAGCCCCAAGTGTAATGACTTCGCAAAGCGCAGTGCCAGCTGCAGGGTTCACTCATTACAAAATATCAGCAGGAAGAGAGACGCGGACATACTTTGTGACCATTTGGTGATGCATCTTCCGCCAGAAACGGGCTACAATGTGCCTAAACCGATGAACACATTTTATACAATGCCAAGAAAAATGCGCCACAAAGCCAAAGAAATAAGCGATGAGACCTCGGAGATAACGCAAACATCAGACGGATTCTCTTTACCGCCGCCGCCAGACGAATTTGGATCTTATAGGGCGGGGACGAGGATTCGAGACGTACCAACAAAATCTACTCCCTCCTACACGACGATTATTAGGCACGAGCCTAATAAAGATTCTGTAAAGTATAGTAACGTTATAGTATCTCCTATGAATACAGTAGGACTACCCACCGTCAGTGGTGCCCACAATAGCGTGTACTCCTACCCCGAAGACGATCAGGTAACGACGAATCCCTTTGATGAATCGCCTTAG

Protein sequence:

>DPOGS205077-PA
MVVPPQLVSVRALGVEGAIESSQRLVVSEIELSVSHKHDESQVTCCAPAHRRADEQYVCAPSLPITVLYPPVLEIMTEEVLINNTLSVVKGSNVTLNCSYQANPAVYQLIWFHEEDLLNSEESVAPSLVVHEAGEYVCAATNDQGSAYSDPVFIDVIYPPYCEDETIVEYGIGDNDCLNLTCKVKGNPEPTAYRWLLISEINGIKLRNNHTLSLETQDATLQYQRPNGTNTLIYCWGLNGVINNELEHKRCSFMVTDETVPRPPANCVAEKNIMKEITVVCEEGHDGGLQQKFKFTVNDLDTDDQLVSIINQEPKFMIQEPKKENYKFVINAFNEKGDSETVEIDKDSIVDESAGSLETISAVTNITTLALSLCGGVALLALAACGLVLCAHERPPHNKDTLCAYTDDTNCETFHDSEDDSECNVRRTESFRRAMTKYPMKNYDVRRTSSFHSARYIHDMQEHDSPKCNDFAKRSASCRVHSLQNISRKRDADILCDHLVMHLPPETGYNVPKPMNTFYTMPRKMRHKAKEISDETSEITQTSDGFSLPPPPDEFGSYRAGTRIRDVPTKSTPSYTTIIRHEPNKDSVKYSNVIVSPMNTVGLPTVSGAHNSVYSYPEDDQVTTNPFDESP-