Monarch geneset OGS2.0

DPOGS215784
TranscriptDPOGS215784-TA1044 bp
ProteinDPOGS215784-PA347 aa
Genomic positionDPSCF300041 + 1896325-1907145
RNAseq coverage6859x (Rank: top 2%)
Annotation
HeliconiusHMEL0085436e-17390.21% 
BombyxBGIBMGA003660-TA4e-15776.82% 
DrosophilaCG43164-PC3e-7664.29% 
EBI UniRef50UniRef50_D6W6W73e-8251.52%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W6W7_TRICA
NCBI RefSeqXP_967981.12e-7672.43%PREDICTED: similar to CG18431 CG18431-PA [Tribolium castaneum]
NCBI nr blastpgi|2700151581e-8151.52%hypothetical protein TcasGA2_TC014184 [Tribolium castaneum]
NCBI nr blastxgi|2700151581e-8052.13%hypothetical protein TcasGA2_TC014184 [Tribolium castaneum]
Group
Gene OntologyGO:00054888.8e-20binding
KEGG pathway 
InterPro domain[23-187] IPR0161868.8e-20C-type lectin-like
[4-191] IPR0161872.9e-14C-type lectin fold
Orthology groupMCL15754 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215784-TA
ATGCTTCTACTCGCCAGCTTTGTCTTGGTCTTCGCAACCGCTATTGGGTCTTCAGCGGCCCAAAGGATCACAACTATTCAACTGGACGGAGTGCAGTATTTTATATCAAGAATGAACCCCTACAGCCCCGAACTAAATTATTTCCTCGCTTATCAGTATTGCAGGTCTCTGGGACTGCAGCTAGCATCATTCGAAACCAAGGAAAAGGCAGATTCAATTACTACATATCTAACAAACGCAGGCTACAACAAGTATGATTTCTGGACGTCCGGCAACAACCTCGGCACTGACATGTACTTGTGGATGAGCACCGGTCTGCCATTCAACGCGACATTTAACTACATGAGAAGAGTAACAATGGATCAACCCAACCACCACGATGACGATAGCATGGACCCATTAGATATGCCCCAGGGAAGCACAGCCCCTCAACGCACCGCCAGACATGGGACTGAGCATGTGATGACAAACGGCTGCGTATCTCTCAAGGCGCCATCCTTCCATTGGGAGCCGCAGCACTGCGGAGAGATCAAGGACTTTATCTGCGAGCAGACACGTTGCTACTATTACAACTACGGCTCCATCCCAGTCTCCTCGGCGCAGGGGAAACCGCTGTCGATGACGACAACGACGACGGCCCACCCGCTAACCTCGTCGTCGCCGCCACCGCCACGCTCCGAACACCTCCCGACACATTTCACTCTTAACGACCTCATCGGCAAGCTGCGACCATCATCCCTCGACGGCCTCCAGTCACCTCACCTCAAGACCGGAGCACTATTAAAATCCCCACCTCAGATCGACTCGCATTACTCGCGGGCTTCCGATGCCCACTCGAAGGAGGTAAAAGAGCCCGAACAGAAAGAGGACACAACTCAAGGTCCTTACGAGGCCGACGAGGGAATGGCGGGCGATGATCCCGCTGCGTATCTCACCCACGAGGCCCGGGAGCTATCCATCGAGGACGCTCCCTCCACCGTCGAGCCGGACGCCACAGAGCACTCCGTCCACGCCAGCGGCATGTTGGCACCCCCCGCCTATTAG

Protein sequence:

>DPOGS215784-PA
MLLLASFVLVFATAIGSSAAQRITTIQLDGVQYFISRMNPYSPELNYFLAYQYCRSLGLQLASFETKEKADSITTYLTNAGYNKYDFWTSGNNLGTDMYLWMSTGLPFNATFNYMRRVTMDQPNHHDDDSMDPLDMPQGSTAPQRTARHGTEHVMTNGCVSLKAPSFHWEPQHCGEIKDFICEQTRCYYYNYGSIPVSSAQGKPLSMTTTTTAHPLTSSSPPPPRSEHLPTHFTLNDLIGKLRPSSLDGLQSPHLKTGALLKSPPQIDSHYSRASDAHSKEVKEPEQKEDTTQGPYEADEGMAGDDPAAYLTHEARELSIEDAPSTVEPDATEHSVHASGMLAPPAY-