Monarch geneset OGS2.0

DPOGS213306
TranscriptDPOGS213306-TA1338 bp
ProteinDPOGS213306-PA445 aa
Genomic positionDPSCF300130 - 324352-332098
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0048384e-6051.61% 
BombyxBGIBMGA005611-TA1e-5452.22% 
DrosophilaCG42402-PC6e-5434.03% 
EBI UniRef50UniRef50_D6W7U04e-6333.09%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W7U0_TRICA
NCBI RefSeqXP_974111.26e-7235.07%PREDICTED: similar to AGAP003572-PA [Tribolium castaneum]
NCBI nr blastpgi|1892339471e-7035.07%PREDICTED: similar to AGAP003572-PA [Tribolium castaneum]
NCBI nr blastxgi|1892339473e-7036.87%PREDICTED: similar to AGAP003572-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055292.9e-13sugar binding
KEGG pathwaymcc:7086816e-06 
 K03681 (RRP40, EXOSC3)maps-> RNA degradation
InterPro domain[122-207] IPR0009222.9e-13D-galactoside/L-rhamnose binding SUEL lectin domain
Orthology groupMCL25152 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213306-TA
ATGCAGCGAGCGGCTTGTGACGATGAGATGGTATCGCTGTCCTGTCCTCCAGGGACTTTAATAAGCATTCAAATTGCTCAATACGGAAAAGTTGTGCCTGGAAGCCATGCTTGTATAGCAGATGTAAATCAACAAATGGATGATGCAGAAGAAATATGTCTGTGGCCAAATGAAATGCAGTATTCACTTCTACGAAGGGTTGTAGAGGCCTGTCAGAAGAAACCTCAATGTAAGTTCAGTACGAAGTTAAAGCCAGAGAAGGTCGATCCCTGCCCCTTAGCTCGCAAGTTCGTCGAAGTGGCTTACAAATGTCGTCCCCATGAATTTCGAAGTCGAACTGGCTGTGAGGATGATGTGATCAAGCTGAGCTGTAACCAGCAATCACGGGTAGCGATATACGACGCCCAATACGGTCGTTTGGCGTATGAAACCGTTTCTTGCCCCAAGCCACAGGGTGTTTCGGATGAATCGAACGCTGTTTGCTCAGCTCCCTATGCTGTTGAAACTGTTATGCAGATCTGTCATGGAAAGAGGTATTGCCAGGTTGTAGCAAATAACAAGACGTTCGGATCGAATTGCAACCCAAATTTCAAGAGTTATTTCAAAGTCGTTTACGCATGTGTTCCATTGGGAGTCCTAACAGAGCGCTACGAAAGTGCTACGGAGAACGATAGAGTCATGAATACACATACGAATAACGCGGAAGGCTACTTTGATGAATCAGATACTGGTGAAAAATGGAAAGAACCGAATGGAGGTATTCCAATCATCAATCCAGTTTTTCCAGGGGAAGTTGATATGATTAACCCGAACCGAAATAAAATTGACGACACCAAAATTACCTTCAACTCTCATAGAAACAACGACAAAGACTCATTTTATACACCGAACACGAAATTTTTGATCTATGTTAGTATTGGTATAATAGTTGTCATTATATTGATAATTGTTTTAATATTAACACGATATTATAAAAAACGAGATCAGACGGACAGATCAAAAAATGGAGATATGTTCACAACAGAAACACCGAACATATTTAGCGATAATGTATCAGATTTGGACGTCGACGTTGACGTTAGTCAATTTTCTGGAACTTTCTACGATCCAGCACATCCAGATATGATTTTATACAAGGACGGACCGAATAAAGCGACACTTAGAGCCATGAAGCCATTGTCCACAGTGTATCCTTGTGTAGGCACGAGCATGTACGGGAATGTAGACTATGTGCCGCCACAATCTAGTGACACTAGTTTCAGCAAGGATCCAAAACAAGAATTAGTGATGAGTCCAAAGAGTTTAGGTTATGCTAACAACCAGTACTTTTATGGGTGA

Protein sequence:

>DPOGS213306-PA
MQRAACDDEMVSLSCPPGTLISIQIAQYGKVVPGSHACIADVNQQMDDAEEICLWPNEMQYSLLRRVVEACQKKPQCKFSTKLKPEKVDPCPLARKFVEVAYKCRPHEFRSRTGCEDDVIKLSCNQQSRVAIYDAQYGRLAYETVSCPKPQGVSDESNAVCSAPYAVETVMQICHGKRYCQVVANNKTFGSNCNPNFKSYFKVVYACVPLGVLTERYESATENDRVMNTHTNNAEGYFDESDTGEKWKEPNGGIPIINPVFPGEVDMINPNRNKIDDTKITFNSHRNNDKDSFYTPNTKFLIYVSIGIIVVIILIIVLILTRYYKKRDQTDRSKNGDMFTTETPNIFSDNVSDLDVDVDVSQFSGTFYDPAHPDMILYKDGPNKATLRAMKPLSTVYPCVGTSMYGNVDYVPPQSSDTSFSKDPKQELVMSPKSLGYANNQYFYG-