Monarch geneset OGS2.0

DPOGS211458
TranscriptDPOGS211458-TA1071 bp
ProteinDPOGS211458-PA356 aa
Genomic positionDPSCF300223 + 143926-145421
RNAseq coverage169x (Rank: top 51%)
Annotation
HeliconiusHMEL0138222e-7353.08% 
BombyxBGIBMGA002163-TA9e-8047.54% 
DrosophilaCG14866-PB3e-4135.78% 
EBI UniRef50UniRef50_D6WKQ76e-4936.41%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKQ7_TRICA
NCBI RefSeqXP_972559.11e-4936.41%PREDICTED: similar to C-type lectin (AGAP002625-PA) [Tribolium castaneum]
NCBI nr blastpgi|910948672e-4836.41%PREDICTED: similar to C-type lectin (AGAP002625-PA) [Tribolium castaneum]
NCBI nr blastxgi|910948672e-4838.49%PREDICTED: similar to C-type lectin (AGAP002625-PA) [Tribolium castaneum]
Group
Gene OntologyGO:00054887.3e-20binding
KEGG pathway 
InterPro domain[199-321] IPR0161877.3e-20C-type lectin fold
[193-320] IPR0161861.3e-14C-type lectin-like
[199-319] IPR0013045.7e-06C-type lectin
Orthology groupMCL16611 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211458-TA
ATGGCGCTGCTGTTGTTCGGAGTGGCTTTATTAATAGCCGGCGTGAGTAGTGACAACTTCACGGAGTTAGAGTTATGTGCGGTGGAGGGTGACGGCCCCTGGAGCCGAGCTCTGGAAGACTGGAGCATCTCGGCCGGCTCCGGCAGACACGGTAAGAACTTCAACACAGGTCGCATCATGGCGCTGCCGCAGAGTAAGGGCTACTACGTGTCGGATCGTGTCGAGACGCCCTTCCTGTCCCCCCCGCCGCCTCTGTACAACCCTCAGGGGTACGCGCCCCCGGCGCAGGCCATGCAGGTGCCGCACGGGTACAAGGAGTGGGAAGGGAAACCTTCACATCCCGGAAATGGCAAGATAGTCAACCGGCCGCCCAATAAAGATAAATTTGCGCCGAGTTACCCCCCTCAGGGTCACCGGCCCTCCATCGACCGCGTGGACGACCCCCCGCGGCGTCAGGTCACCGAGACAGACCTGTACCTCCTGAGCGCGGTCGAGAAACTCGCGCACCGAGCGGACTTCATGGAGAAGAGACTCCGGCGACTCGAGGAAAGCCTGTACCACGCGCTGCAGAACAAGGAACCCGCCCCCGCGCCGTGTCCCGGGAACTTCACCCGCGTGGGCTCGTCCTGTTACTCCGTGTCGTCCAGCCAGCGGGACTGGAAGGACGCGTCCCTGGCGTGTCGGGCGACACACGCCGCCCTGCTCGAGCTGGCCGACGAGAAACAGAAGAAGATCGTGCTGGCCTGGATGCTGGCCGACACGGACCGCAGAGGCGTGGACTACTGGACGGGCGGCCTCAACCCCGGCCTGCTGTGGATCTGGTCGCACTCCGCCCGCCCTGTCAACGGCAGCGTGGCGGGCGACGGGCGGTGCCTGGCCGCCGTCCACGACCCCGCCCTCAACACGCACGTCTACAGGGGGAGGGACTGCGCCGCCAGGCTGCACTACATCTGTGTCAAGGAGGACGACGGACATCTCAGTAACGAGGTGCAGAGAGCGGCCAGGGAACTCACGAGGAGGAGAGAGGGAACCGGGGGAGAGACGGCGGGAGCGGGGGCGCGTACATTATAA

Protein sequence:

>DPOGS211458-PA
MALLLFGVALLIAGVSSDNFTELELCAVEGDGPWSRALEDWSISAGSGRHGKNFNTGRIMALPQSKGYYVSDRVETPFLSPPPPLYNPQGYAPPAQAMQVPHGYKEWEGKPSHPGNGKIVNRPPNKDKFAPSYPPQGHRPSIDRVDDPPRRQVTETDLYLLSAVEKLAHRADFMEKRLRRLEESLYHALQNKEPAPAPCPGNFTRVGSSCYSVSSSQRDWKDASLACRATHAALLELADEKQKKIVLAWMLADTDRRGVDYWTGGLNPGLLWIWSHSARPVNGSVAGDGRCLAAVHDPALNTHVYRGRDCAARLHYICVKEDDGHLSNEVQRAARELTRRREGTGGETAGAGARTL-