Monarch geneset OGS2.0

DPOGS216214
TranscriptDPOGS216214-TA2553 bp
ProteinDPOGS216214-PA850 aa
Genomic positionDPSCF300368 + 344-29974
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0043620.070.86% 
BombyxBGIBMGA004752-TA7e-10335.64% 
DrosophilaCG12484-PB2e-12540.70% 
EBI UniRef50UniRef50_UPI00017584CF5e-13840.35%UPI00017584CF related cluster n=1 Tax=unknown RepID=UPI00017584CF
NCBI RefSeqXP_974285.21e-13840.35%PREDICTED: similar to AGAP002104-PA [Tribolium castaneum]
NCBI nr blastpgi|1892392462e-13740.35%PREDICTED: similar to AGAP002104-PA [Tribolium castaneum]
NCBI nr blastxgi|1892392462e-14038.89%PREDICTED: similar to AGAP002104-PA [Tribolium castaneum]
Group
KEGG pathwayecb:1001464027e-12 
 K06467 (CD22, SIGLEC2)maps-> Cell adhesion molecules (CAMs)
    B cell receptor signaling pathway
    Hematopoietic cell lineage
InterPro domain[168-267] IPR0137832.3e-18Immunoglobulin-like fold
[442-555] IPR0089573.2e-10Fibronectin type III domain
[181-249] IPR0131621.7e-07CD80-like, immunoglobulin C2-set
Orthology groupMCL17091 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216214-TA
TTGGATACGAGGGAGGGTGTGACGTCACACTGGTCGGACCCCACGACCCTCGGCTCCAGGGCCACCTTCCGCAGCAGCACAACGCCGGCTGTGCTACTTCTTACCAAGCTTAGGCCTGAAGACAGCGGACAATATAGATGTCGTGTGGATTTCATCAGATCACCCACCAAGAACACGAGATTGAACCTCACTGTACTCATTCCACCGGAACGTTTAATTATTCTAAATCAAGAAGGTGATGAGATCAAAGGCGGAGTACTCGGTCCATATGATGAGGGGACTGAAGTCAATCTAACTTGCGTTGCCGTTGGAGGTCGGCCTCCAGCGAGGGTGTCCTGGTGGAAGTCGCACGCACTCCTGGCTAATTCAGAGGCGAGAGCCGCTGTTACTTTCACGCTACAGAGGTCTGATTACGGAACCGATATCACGTGTCAGGCTGTCACTGATCCGACCATAACGCCGCTCTCCGAGAATGTCTCCATAGACGTGAATTTGCGACCTCTATGGGTAAGGCTGTTAGGGGGTAAGCGACCTCTAGTGGCGGGGCAGAGTACTGAATTAGTCTGCCAAGCGGTCGGAGCGCGACCAAAACCTAACATATCATGGTGGAAAGGAGGCACAAGATTGAAGAATGTTAGAGAAACAATTTCATCGGATGGCAATGTTACCAGCAGTATCCTGACCTTTGTGCCCTCGATCGATGATGCTGGACGGGTTTTATCGTGCAGAGCTATACAACCTCGTTTACCACACTCTACTCACGAAGACGGTTGGAAACTAGAAATACAACATTTGCCAGTGGTCAAATTGGAATTAGGTGCTAACTTGGACGCGGATAAAGTGATAGAAGGTTCTGATGTGTATTTAGATTGTATGGTGCGAGCAAATCCATGGCACAGTCATGTATATTTTACTCACAATGGTGCAATAGTAAAACCCGGCCCAGGAGTTGTTTTGGCCAACCAGAGCCTCGTTCTTCAGCGTATGTCTAGAAAAGCTACGGGCGGTTATGTTTGCGTCGCAAGGAATGCGCTAGGCGAGGGCTACAGTGACCCACTAGTGTTGGAAGTGAAATATGCTCCTACATGCAAATCTCATCAAGCGACCGTCATCAGAGCAGCTCGTGGGGAGGTCGTCGATATTATGTGTGAAATTGATGCCAATCCTATGGAACCGATGACATATCAGTGGTGGTTTAATAGCAGTACGCAAACAAAACTGGAACTCAATACGTTTTCAACAAATTCTCAGAATAACCTCGGAAGGTATTTGTACACAGTGAACACTTCATCTGACTACGGCTGGGTTCAATGCACTGGTACGAATTCTGTTGGAAGACAGAACACTCCTTGCTTGTTTCACATCCTTCCTGCTGAGAAACCGTCGTCGGTGAAAAACTGCGAAATCACAAACATGACGTATGACTCACTAACTCTTGGATGTTCCCCCGGACACGACGGTGGCATGAAACAATCGTTTCTATTACAAGTATATGACATATCAACCGGTATTCTGCTTCGTAATATCACAAGCGAAGAGGCTCAATTCATAGTTTGGGGTCTGTCCGGAAGTACAGCAGTTGGTATATCCGTTAGAGCCTTCAATAAGAAAGGCTTAAGCGAGCCCTTTACCCTCACATCAAACTTATTGAAACATCCACAACGACATACAGCCAACGTTCCGGTTCGTGTGGAACTAACAACTGTGTTGGTGAGCGTGCTCGTTGCTGTGGCCGTTATAGCAATACTCACAGCAGTATCCGCTGTCATTTTCTGTTGGAAATACTGTAATAAAGACGACAAAAATGAAAAGACAAATCGAAAACAGGACGAGATATCAAATATCCCGTTGACTGGTACCAACGAGTGTGAGAGCGTTGATAGCTTGGATAAAAATCCAGACATAATACCAATCGAGGGTAAAATAAGCGATAATTGTTCGAACAAATCCTCCGCGACCGATTATAGCTCCATAAGACCGTTGTTATCAAAATCGGACGACAAATACGATCACAATCCGACCGATCGGCACTACGACCACGGCGATCCGTGTAGCCAATACGTCCAAGTACGACCGTTAGACTTCCGAGTACATCAGTACCAGCATGGTCTACAAATGCCCAACGTGGGCCAAGCTATGCCAGGCCTTGGGATGCACATGCCAAATATTGGACCAAACGAGTGCCGGCCGTATGATAAGGTTTACGAGAATTGGCTGAAGTACAAGAATGCCTTACCGCTAAATACTAGCGGGCTGCTGCCGGTAGAACAGCTCATGCCACCGGAACTGTACACTCCATCGATATACACAACCTCCAGCCGGAATCCTCTCAACCTGTCTCAGATGCCGGAACTCGAGCCTTTCTGTGACAGAAGGTCGCCCCCCGTTAGGGTCAACGCGGATAATAACAGCGCCTACTATAATGTTGACAGACATACACACACCCTGCCTTTGAGGCGACCTGCCAACCATCAATTAGATACGAAGCAGAATCAATTGAAAACAGCGGAAAACATCATGAAGCAACCGGAAACGAGCCAAGAGTGTAGATGA

Protein sequence:

>DPOGS216214-PA
LDTREGVTSHWSDPTTLGSRATFRSSTTPAVLLLTKLRPEDSGQYRCRVDFIRSPTKNTRLNLTVLIPPERLIILNQEGDEIKGGVLGPYDEGTEVNLTCVAVGGRPPARVSWWKSHALLANSEARAAVTFTLQRSDYGTDITCQAVTDPTITPLSENVSIDVNLRPLWVRLLGGKRPLVAGQSTELVCQAVGARPKPNISWWKGGTRLKNVRETISSDGNVTSSILTFVPSIDDAGRVLSCRAIQPRLPHSTHEDGWKLEIQHLPVVKLELGANLDADKVIEGSDVYLDCMVRANPWHSHVYFTHNGAIVKPGPGVVLANQSLVLQRMSRKATGGYVCVARNALGEGYSDPLVLEVKYAPTCKSHQATVIRAARGEVVDIMCEIDANPMEPMTYQWWFNSSTQTKLELNTFSTNSQNNLGRYLYTVNTSSDYGWVQCTGTNSVGRQNTPCLFHILPAEKPSSVKNCEITNMTYDSLTLGCSPGHDGGMKQSFLLQVYDISTGILLRNITSEEAQFIVWGLSGSTAVGISVRAFNKKGLSEPFTLTSNLLKHPQRHTANVPVRVELTTVLVSVLVAVAVIAILTAVSAVIFCWKYCNKDDKNEKTNRKQDEISNIPLTGTNECESVDSLDKNPDIIPIEGKISDNCSNKSSATDYSSIRPLLSKSDDKYDHNPTDRHYDHGDPCSQYVQVRPLDFRVHQYQHGLQMPNVGQAMPGLGMHMPNIGPNECRPYDKVYENWLKYKNALPLNTSGLLPVEQLMPPELYTPSIYTTSSRNPLNLSQMPELEPFCDRRSPPVRVNADNNSAYYNVDRHTHTLPLRRPANHQLDTKQNQLKTAENIMKQPETSQECR-