Monarch geneset OGS2.0

DPOGS210065
TranscriptDPOGS210065-TA1137 bp
ProteinDPOGS210065-PA378 aa
Genomic positionDPSCF300017 - 620431-645752
RNAseq coverage905x (Rank: top 14%)
Annotation
HeliconiusHMEL0154692e-9475.43% 
BombyxBGIBMGA012705-TA3e-4856.19% 
Drosophilargn-PA3e-2538.13% 
EBI UniRef50UniRef50_D6WN771e-3641.15%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WN77_TRICA
NCBI RefSeqXP_970274.22e-3741.15%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892369905e-3641.15%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892369902e-4134.05%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00054882.4e-24binding
KEGG pathway 
InterPro domain[39-169] IPR0161872.4e-24C-type lectin fold
[31-172] IPR0161861.6e-18C-type lectin-like
[57-167] IPR0013041.4e-10C-type lectin
Orthology groupMCL26000 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210065-TA
ATGGAGGCTCTTGAAAGGAGGGGCAATCTCGCCGTCATATTAGCGACAGTCTGCATCGCTTTGATTTACAATTGTGATAGTGTTGCTGCTAGGAACCACAGCATGTCGTACGTGTGTCCGCCCCAGTTCATCCGACTCGGTCACAGCTGTTACTTCTTCAGTGAGAACAAAGCCACGTGGCAGTCGGCTCTTTTCTTCTGCAAGGATCGTGATAGCAACCTAACAGTGCCAGCTAGGTGGGAGGACAGGAATCTGAGAAACTACCTCAATAAACCAAACGTCGACAAAGCAAGTCGCTGGATTGGTGGAATATATGACGCAGCGTCGCGCTCCTGGAAATGGGGAGGTGAACTTCGACCGATGCATTATCAGTCGTTCTCCAAGATGAAGAAGCTGACCCCCGAGCAATTGCAGTTCCATTGCATTGCTATGATACCCGAGCTTCTCTACAGATGGGCACCAAGAAGCTGTTTCGAGCCGAGGGAGTTTATTTGTCAAACTAAACTAAAGAAAGTATCGAAGGCGAAAGTTAAAGAACTACAAAAGAGATACCAACGAATGGGAAAAATCAATGAGATAACAGCGCCGAGCGCGAGCAGAGAAGTTGAAGATGGCCATATCAATGACGTCACCAGTAACCCTGTGTTGAACCCCAAGTCCTTCGACCTTCACCCGAGTCCTCTCAGACGAGGGCCGAAGAAGAATCCGATTCGAAACAAGACACACGGCTTTCGACCCAACGAGTTGAAGAAACGATCGCGTCCCCATATTGAGAGACGGCTCAAGAGACCGTTCCCAGGTTATCAATGGAACCGCCGCAAACCCGAGGAATCGTACAAATACAACATGGAGCTCCTACTCTCTGGTCGCACGGGGCTCAGTCCACAGCAAGTGAAGTTTCACCTGTCTCGTCTGCAGCGTCTGCGGGACAGGCAGCTAAACCGGATGAGGGAACGCGACGATTGGCTTGTCAACGAAACGAAGCCCGCCATAGTCCGCACCCACGCGAGGACATACACACTCGATAATAACATCAGCGCTCTCCACCCTAAGACCATTGTTGAGGAGTTTGATATGCTTCCACCCCCAGCTCCCGCACCCGTCGTAGTGCCAAGACCCACTCGTGGTATTTGGTAA

Protein sequence:

>DPOGS210065-PA
MEALERRGNLAVILATVCIALIYNCDSVAARNHSMSYVCPPQFIRLGHSCYFFSENKATWQSALFFCKDRDSNLTVPARWEDRNLRNYLNKPNVDKASRWIGGIYDAASRSWKWGGELRPMHYQSFSKMKKLTPEQLQFHCIAMIPELLYRWAPRSCFEPREFICQTKLKKVSKAKVKELQKRYQRMGKINEITAPSASREVEDGHINDVTSNPVLNPKSFDLHPSPLRRGPKKNPIRNKTHGFRPNELKKRSRPHIERRLKRPFPGYQWNRRKPEESYKYNMELLLSGRTGLSPQQVKFHLSRLQRLRDRQLNRMRERDDWLVNETKPAIVRTHARTYTLDNNISALHPKTIVEEFDMLPPPAPAPVVVPRPTRGIW-