Monarch geneset OGS2.0

DPOGS207666
TranscriptDPOGS207666-TA1530 bp
ProteinDPOGS207666-PA509 aa
Genomic positionDPSCF300133 + 164053-168212
RNAseq coverage709x (Rank: top 18%)
Annotation
HeliconiusHMEL0093053e-18063.96% 
BombyxBGIBMGA010527-TA2e-16562.59% 
DrosophilaCG6766-PA2e-9637.65% 
EBI UniRef50UniRef50_F4WB208e-13647.92%Endoplasmic reticulum lectin 1 n=5 Tax=Myrmicinae RepID=F4WB20_ACREC
NCBI RefSeqXP_971325.14e-13049.04%PREDICTED: similar to xtp3-transactivated protein b [Tribolium castaneum]
NCBI nr blastpgi|3320286033e-13547.92%Endoplasmic reticulum lectin 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320286035e-13648.02%Endoplasmic reticulum lectin 1 [Acromyrmex echinatior]
Group
KEGG pathwaytca:6599681e-129 
 K14008 (ERLEC1, XTP3B)maps-> Protein processing in endoplasmic reticulum
InterPro domain[101-186] IPR0129132.4e-17Glucosidase II beta subunit-like
[286-463] IPR0090115.7e-12Mannose-6-phosphate receptor, binding
Orthology groupMCL13969 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207666-TA
ATGAAACCTTTATGGGTAGTTTTATTAAGTGTAATAGGAGTTTTAAGTATTGAACATGATTTAAAAGGTTTTGATGACAGTATATTATTTCGTATAAATTTCGATCATCCTTCGGGGGAAGGTATAGTAGGTGGAGAAGAAAAATTTAAAAATAAAGATTTTGTGAAAGTTGTAACATCACACAAAGAAAGATATGATTGTTATTTACCAGAGCTGCAGGAGAAAGAATCCACAGGGATACAGGATTATGACGGTCCATCGCCCATAAACCTCATGAAACCTTTGTTCGGTCAAAAAATTTGTTCATACCGATTGGAAAGTTATTGGAGCTATGAAGTATGCCATGGACGGTACATTCGACAATACCACGAAGAGAGAGAAGGGAAACAGATCAAGACACAAGAATATTTCCTCGGTCACTGGAGCGCGGAGAAACAGACAAAATTGGAGGAGGAATTAAAGGCTAAGCAGGAAAGCAAATCAAGTCTTAAAACTACTAAGGTTGAGGGATTGAACCTGCCTTATATAGAGCTGAAGATGGATGATGGCACAGTCTGTGATCTCAGCGGTAAGCCCCGTCTCACAAGGGTGTTATATGTTTGCTTCAGTCACGGCAAACATGAGGTATATTCATTCAAAGAGATAGCAACCTGCGAATATGAAATGATAATATTATCACCGTTACTGTGTGAACATCCTCTATACAAACCGAAGGATGTAGGGCAAAATGATATCGATTGCATCCCAAGAGACGGTGCCCCTATAAGGCCGAGGAATATGTTAAAGAATGAGATTGAGAGAGTGAAATTCCAACATCAGACACTGAAATTACTTAGTGAGGATAAAGAAGCTAGGGATGTAGTTGCTGTGTTGAAAGTTGAAAAAATTGATAAGGATGGTGAAACTCATCTGAAATTCGAGCTGCATCCGCTAGAAGATCCGATTACTGAAAAGGTTCCTGTCATAGAAGCGCCAAAGACCGTCAGAGATGAAGCGGCAATAAAAGCTTTCCTGAACGGAGAGACGTGTTTGAATGGAGGTACAGGTTGGTGGAAATACGAATTCTGTTACGGCCGTCACGTGATTCAGTATCACGAACATCGAGGAGGTGATACCGAGAAGCTTCTGCTGGGGTCCTTTGATGAGGCGGAACACTTGCAATGGATCAAGGAAAACAGGAATAAGGCACCAAAACCTATTGACGAGCGTACATCCGTTTCCCATTTCTACAGTGGCGGTGACATCTGTCAGAAGAGCGGCAAACGGAGACAGACCGAAGTAAAGCTGAAATGTCTCCAGAATTCTTCGAGTCCAGCCCAAGTGTCACTGTATCTCTTGGAACCGAGGACCTGCCACTACATCCTTGGCGTGGAATCGCCATTGATCTGCGACATTCTGCCAATGGCCGACGACAATGGCCTGATCAAATACACCCAACCAGCCGTGGAAACGCCCAGTAAGGTTGTTATCAAGGAAGAAGAGGATGACGTCATCAAACTAAAAGAGTTAAACAAATTCGGTCTAGATTGA

Protein sequence:

>DPOGS207666-PA
MKPLWVVLLSVIGVLSIEHDLKGFDDSILFRINFDHPSGEGIVGGEEKFKNKDFVKVVTSHKERYDCYLPELQEKESTGIQDYDGPSPINLMKPLFGQKICSYRLESYWSYEVCHGRYIRQYHEEREGKQIKTQEYFLGHWSAEKQTKLEEELKAKQESKSSLKTTKVEGLNLPYIELKMDDGTVCDLSGKPRLTRVLYVCFSHGKHEVYSFKEIATCEYEMIILSPLLCEHPLYKPKDVGQNDIDCIPRDGAPIRPRNMLKNEIERVKFQHQTLKLLSEDKEARDVVAVLKVEKIDKDGETHLKFELHPLEDPITEKVPVIEAPKTVRDEAAIKAFLNGETCLNGGTGWWKYEFCYGRHVIQYHEHRGGDTEKLLLGSFDEAEHLQWIKENRNKAPKPIDERTSVSHFYSGGDICQKSGKRRQTEVKLKCLQNSSSPAQVSLYLLEPRTCHYILGVESPLICDILPMADDNGLIKYTQPAVETPSKVVIKEEEDDVIKLKELNKFGLD-