Monarch geneset OGS2.0

DPOGS214704
TranscriptDPOGS214704-TA2211 bp
ProteinDPOGS214704-PA736 aa
Genomic positionDPSCF300022 - 1212757-1231373
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0115872e-17050.56% 
BombyxBGIBMGA004752-TA0.058.19% 
DrosophilaCG14372-PC0.052.73% 
EBI UniRef50UniRef50_B4R1860.052.89%GD18910 n=10 Tax=Endopterygota RepID=B4R186_DROSI
NCBI RefSeqXP_001999129.10.054.50%GI24340 [Drosophila mojavensis]
NCBI nr blastpgi|1951091020.054.50%GI24340 [Drosophila mojavensis]
NCBI nr blastxgi|1951091020.054.14%GI24340 [Drosophila mojavensis]
Group
KEGG pathwaycfa:4764853e-18 
 K06467 (CD22, SIGLEC2)maps-> Cell adhesion molecules (CAMs)
    B cell receptor signaling pathway
    Hematopoietic cell lineage
InterPro domain[192-291] IPR0137834.7e-21Immunoglobulin-like fold
[201-276] IPR0131624.6e-12CD80-like, immunoglobulin C2-set
[299-383] IPR0130989.1e-11Immunoglobulin I-set
[300-384] IPR0035995.9e-09Immunoglobulin subtype
[475-578] IPR0089578.2e-07Fibronectin type III domain
Orthology groupMCL10896 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214704-TA
ATGGTGCTGTGGTTCAAGGAAGCTGACTGGGAACCATTTTACAGTTACGATGTTCGTGGACGTAGCGTCAACCAGCCCAAGCTCTGGTCGTCTCCAACGGGTTTCGGTTCACGAGCTTACTTCCGAGCCACAGCGACCCCGGCCATTCTTCTTGTAGATAATGTCGGTACTGCTGACAGTGGAGTGTACCGATGTCGAGTGGACTTCAAAAACTCACCGACTAGAAATTTGAGAATTAATTTCACGGTTATTACTCCTCCGAATCGGCCAATTATAATGGATGCAAGGACAAGGGACCACACTAGGCTTTTGGAACCCTATAACGAGGGAGACACATTGGAGTTGTTATGTGAAGTTTATGGAGGTGATCCAAAGCCATCTTTGATATGGTATTTAGAAAACACAATAATTGATGAATCGTTTGAACAAAAATCTGACGGAAAAACGATAAATACTCTTACTTTTCCAAGTATTGGTCGGCAACATCTCAACTCGAGACTTATTTGCCAAGCGTCGAACACAAACCTGACTCCTCCGCAATCGAAACTACTTATATTGGATATAAATCTAAGACCATTGACAGTGCAGATTCTCAATAAAAACCGACACCTCTCAGCTGATAGGAGTTACGAGGTGGAGTGTAGAACGATTGGTTCCCGTCCAGAGGCTCAAATAACTTGGTGGAAAGAAAAGAAGCCAATGAGAGGCAAGGCCAGAAATTATTCCGATACGAACACAACTACAAGTGTTCTAGTATTTACACCTGAAGCAGAACATCATGATAGTCAGCTAACATGTCGGGCTGAAAATACGAGACTTGAAAATTCAGCCATAGAGGATACATGGAAGTTAAATGTACATTATGTTCCTGTTATAACTCTGAAAATGGGATCTAATCTAAACCCTAGATATATAAAAGAAGGTGACGACATCTACTTTGAATGCAGCGTTCAATCAAATCCTAAAGTAACGAAATTATCTTGGTTCAAAGATTCTCTCAAAATCCAGCAAAATCCTAGTTCGGGTATAATTTTAAGCGACCAAAGTCTCGTCCTGCAGCGCGTCAATCGTAACGCATCTGGTGACTACATCTGCTCCGCACAGAACAGTGAAGGCAGTGCATCAAGTAATCCTGTTTCTTTGCAAGTTAGATATAGCCCAGTTTGCAAGTCAGACGAGGAACAAGTGTTTGGAGCGTCGGTACTCGAGCCGATCGAGTTATCTTGCGTCGTCGATTCTAGTCCACAGCCGACGAGCTTTGAGTGGATATTCAATAGAGATGGTGACCGTAGCGAGTTGCCACCAAGCTTATATAACGTGTCTGGTCACAAATCAGTACTTCGATATATTCCTACTGAAGATAAAGACTTTGGAACATTATCTTGCCTGGCCACTAATTCAATTGGGAGGCAGGAAATTCCGTGCGTGTATAGTATAATTGCTGCAGGACGACCGTCATCTTTGAAAAATTGTAGCATAGTCAACGAAAGCGTCGATAGCATTTTGGTGGATTGTATTGAAGGATTTGACGGAGGAATAAGTCAAGTGTTCACTATGGAAGTACTGGAACTACCTTCGTATACTATGCGAGCGAACATAACATCAAATTCAACACCCAATTTTGAGGTCACAGGTTTAAACCGAGCTTTAAGTTACGCAGTTAATCTTTATGCGAGTAATCCTAAGGGCCGTAGTGAGATAGTTACTGTGTATTCAGTGGCTTTACGCTCTCCGGATAAGTACACAGGTGTGGGCAGTACTTTCAGCTTATCTCCGCTAATAGCATCGCTCCTCACAGTCATAGTGTTGCTTAGCGCCGCGACTTGTGCAATCATATTCGCTGTGTACCGCAGACATTTCTTACAACGGCATGTTAAACAACCCATAAACCCGCATTACTTGAACGACAGTATAGAATCATTGCCAAAAAATGTTTTGTCAACGTACTCGTCTTCCCCGAAAGTAGACTTCAATACGCAATACGAACTCAAGATCAATAGCGAAGTGGAAGACGACCCTGATATCATCGCCGTACATTATGATAAGAAGACACTAGACGATTATTGTAAGTCCAAGGTTGGAGAAGGTGACACGGCCAAAGTGTTCAATGACAATGACACCAGTGTACCGAATACGGGGAACATATCATTTGTGAACCGAGGTGTAACGGCGCGCGTCTCGGATCTACGAGTGAGGGAAAGTTGCATATAG

Protein sequence:

>DPOGS214704-PA
MVLWFKEADWEPFYSYDVRGRSVNQPKLWSSPTGFGSRAYFRATATPAILLVDNVGTADSGVYRCRVDFKNSPTRNLRINFTVITPPNRPIIMDARTRDHTRLLEPYNEGDTLELLCEVYGGDPKPSLIWYLENTIIDESFEQKSDGKTINTLTFPSIGRQHLNSRLICQASNTNLTPPQSKLLILDINLRPLTVQILNKNRHLSADRSYEVECRTIGSRPEAQITWWKEKKPMRGKARNYSDTNTTTSVLVFTPEAEHHDSQLTCRAENTRLENSAIEDTWKLNVHYVPVITLKMGSNLNPRYIKEGDDIYFECSVQSNPKVTKLSWFKDSLKIQQNPSSGIILSDQSLVLQRVNRNASGDYICSAQNSEGSASSNPVSLQVRYSPVCKSDEEQVFGASVLEPIELSCVVDSSPQPTSFEWIFNRDGDRSELPPSLYNVSGHKSVLRYIPTEDKDFGTLSCLATNSIGRQEIPCVYSIIAAGRPSSLKNCSIVNESVDSILVDCIEGFDGGISQVFTMEVLELPSYTMRANITSNSTPNFEVTGLNRALSYAVNLYASNPKGRSEIVTVYSVALRSPDKYTGVGSTFSLSPLIASLLTVIVLLSAATCAIIFAVYRRHFLQRHVKQPINPHYLNDSIESLPKNVLSTYSSSPKVDFNTQYELKINSEVEDDPDIIAVHYDKKTLDDYCKSKVGEGDTAKVFNDNDTSVPNTGNISFVNRGVTARVSDLRVRESCI-