Monarch geneset OGS2.0

DPOGS213307
TranscriptDPOGS213307-TA1470 bp
ProteinDPOGS213307-PA489 aa
Genomic positionDPSCF300130 - 226418-252271
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0094289e-5777.78% 
BombyxBGIBMGA005611-TA3e-10792.27% 
DrosophilaCG42402-PC2e-6640.28% 
EBI UniRef50UniRef50_D6W7U03e-8452.29%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W7U0_TRICA
NCBI RefSeqXP_974111.26e-8855.27%PREDICTED: similar to AGAP003572-PA [Tribolium castaneum]
NCBI nr blastpgi|3800116401e-9244.65%PREDICTED: uncharacterized protein LOC100869731 [Apis florea]
NCBI nr blastxgi|3800116405e-9344.86%PREDICTED: uncharacterized protein LOC100869731 [Apis florea]
Group
Gene OntologyGO:00055291e-16sugar binding
KEGG pathway 
InterPro domain[154-236] IPR0009221e-16D-galactoside/L-rhamnose binding SUEL lectin domain
Orthology groupMCL11645 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213307-TA
ATGGCTCGGACAACCAGTTTCTTTTTTGTGTTTTTGATATCCTCGATCCCTTTAAATAGAGCTGATAACTTAGAATTGCTGCTTGGGACTCTCCGCACGCACCAAAAGGCAGCATGTGACGAAGAAATGGTTACTCTTATTTGTCCTCGAGGAACAACAATTAGCATACAAGTAGCCCAGTATGGAGCATCGACTTCACAAAGCTCTTGTTCATCAGAATTAGCGGAATATCAACCTGTAGCTGTTGAAGTAGTAGGAGACACATCTTGCACGTGGCCGAGTGCATTGCAGACCGTCGTTGAAGCTTGTCAGAAGAAGCGACAGTGTAAATTTCATACGAGCCCCAAGGCCTTCGGTGTTGACCCCTGTCCAGGTTCAAGAAGATTCGTGGAGGTAGCCTACAAATGTCGACCATATGAATTCAGAAGTAAAGTGGGCTGTGAAAACGACGTGCTTCATTTGAGCTGTAACCCACACTCAAGGGTTGCGATATATTCGGCACAGTATGGGCGGACGGAGTACGACTCAATACAATGCCCACAGCCGCGGGGAATGAAAGAGGAAACATGCTTAGAGCCTTATGCTACTGAAACATCGATGAGGGAATGCCATGGAAAACGTCGATGTGTTCTTTCAGCAGACAACAAGATGTTTGGAAGGCCATGTCGAACGGGAAGCAGAACATACCTGAAAGTTGTTTATACTTGTGTGCCCCGAACCGTTTTAAAAGAGAGGTATGAAAGTGCTCCTGAAGAGGATGAAGTCGCCCACGATGTATCAGATCTGGAACACGATGATGTCGATGAGTCTAGCGATCGCTGGTGGGGAGAGTCAGTACCCCCAGCACCAGCAGTAGCGGCTGTCCCACCTCAACGTCCCACAGCGCATACTAACATTAGCAGAGACGGACCCACAACCACTCAGCCAAAACAGCATACATCAAATGATGAACAATTTGATATGATGTACGTATACGTGATTGCTGCTGCCACAGGAATATGTCTTATGTGTCTTATAATCGGAGTAATACGATGTGTGAAGCTCAGAAACAACACAGATCAAGCCAAAGGACCGGATGTCTCCGCCTCTACTGACATCCCTAACGGCTTTAATGACAGTATATCGGAGGTCGATAATGACATAAACATCACAAGCCTCTCGGGCCCAGTAGACACTGTAGACTCTAGTCTGAAGCAAGACATGCAAATTACGAACATGGCCAACATGAGTCCAAAAATCAATCGATACGTTGGCAGGCCAGTGCCTAATACATATCCCCACGTGAGTACTAATATGTATGGACAGGTTGCGGAATATCCAGTTGAAATGCCTCTTCGAACAATGCCTCATGGAACTTTAGGACGTAGTATGGCGGTAAAAACTTTGCCTAGAATCCAATTGCAAACAGAAACGGACCCTAACACTAGGAGTTTATATCGTTATTCAAATGCACAATACTATTTTGGGTGA

Protein sequence:

>DPOGS213307-PA
MARTTSFFFVFLISSIPLNRADNLELLLGTLRTHQKAACDEEMVTLICPRGTTISIQVAQYGASTSQSSCSSELAEYQPVAVEVVGDTSCTWPSALQTVVEACQKKRQCKFHTSPKAFGVDPCPGSRRFVEVAYKCRPYEFRSKVGCENDVLHLSCNPHSRVAIYSAQYGRTEYDSIQCPQPRGMKEETCLEPYATETSMRECHGKRRCVLSADNKMFGRPCRTGSRTYLKVVYTCVPRTVLKERYESAPEEDEVAHDVSDLEHDDVDESSDRWWGESVPPAPAVAAVPPQRPTAHTNISRDGPTTTQPKQHTSNDEQFDMMYVYVIAAATGICLMCLIIGVIRCVKLRNNTDQAKGPDVSASTDIPNGFNDSISEVDNDINITSLSGPVDTVDSSLKQDMQITNMANMSPKINRYVGRPVPNTYPHVSTNMYGQVAEYPVEMPLRTMPHGTLGRSMAVKTLPRIQLQTETDPNTRSLYRYSNAQYYFG-