Monarch geneset OGS2.0

DPOGS214545
TranscriptDPOGS214545-TA3156 bp
ProteinDPOGS214545-PA1051 aa
Genomic positionDPSCF300266 - 248074-255843
RNAseq coverage177x (Rank: top 50%)
Annotation
HeliconiusHMEL0161230.066.17% 
BombyxBGIBMGA003273-TA9e-9279.37% 
DrosophilaCG13676-PA2e-5248.31% 
EBI UniRef50UniRef50_B2DBM90.057.56%Similar to CG13676-PA n=2 Tax=Papilionoidea RepID=B2DBM9_9NEOP
NCBI RefSeqXP_002426626.12e-6359.18%hypothetical protein Phum_PHUM263920 [Pediculus humanus corporis]
NCBI nr blastpgi|1839792540.057.56%similar to CG13676-PA [Papilio xuthus]
NCBI nr blastxgi|1839792540.060.33%similar to CG13676-PA [Papilio xuthus]
Group
Gene OntologyGO:00080616.1e-10chitin binding
GO:00060306.1e-10chitin metabolic process
GO:00055766.1e-10extracellular region
KEGG pathway 
InterPro domain[110-171] IPR0025576.1e-10Chitin binding domain
Orthology groupMCL17559 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214545-TA
ATGAAGCGCTCTCGTCATAGATGGCTGGGGTTCGTGCTATTTTGCGTAACTTTGGCAGCGCTCACGGACTTGCCGACAACGACAGCGGTCCGTGCGACGTCACTCATCTTCTCAAAGACGACGACATCAACCAGCACTGCTGGCCCAGAAGAAGAACCCTCTGCACCTGATGATGAAGCACAGGTTGAACCGAGTGCGGAAGGGGAAGAAGGGAACGCTACTAGTAAATATACGGGAATTCCTCAAATAGATTACATACTTGATCCAAATCTACCACGCGAGCTGAATGGGTACAATTTATCCCAGTATCCCTTCTACGAAGCTGTGCCTCCTCCAGAAACCATGGATTTCAAATGCGATGGACTCCACGATGGTTTCTACGCCTCTATACCCCATAAGTGTCAGGTCTACCACCACTGCCTCTTCGGCACCAGATACGACTTCCTCTGCGCGAACTACACAGCCTTCGATCAAAAAACTTTCATCTGTCACTTCGTATCTGAAGTGGATTGTAAAAATTCAGCAAAATATTTTAGCAGGAACGAAGCTCTATACAAGGCGGCGTCCACTGACCCTCCGTCTACAACATCTACAACAACCACCACAACAACAACCACGCCGCGACCACCACGGCCAGGAAGACGCCGACCACATCCCAGATACGATTACTACGATGACGATTATTACTACCCAGCTAGAGATGACTATGATTACGAAGAGAGAGGTGGTAGACGGAATAGACCTCGTAGACCAGGCAAGCGCAGGCCTCAAGTTGACTACGATGACAGATACGAGACGAGATCGAGACCAAGGGATGATGTTGATGAGGATTACGAAGACAGAAGACCATATGAAAGACCGAGAGCTGCTAAAAGACCGGATTACAGAAGACCATACGACGATGAAGACAGAAGACCATATAAAGGTGGTAGAAGGCCAGGAAAACCAAGGGTTGATGAAGAGTCATATGGTCTCGAAGATGAAGAAAATGACAGGCCGAGAGATAGTAAAAGAGGTGAATTTAGACCCAGGGATAAGTTTCGACCTAGAGACGAAGAAAGATCTAGGGACGAAATCCGTCCACGAGATGAAGATGTTATTCGACCGCGAGATGAAAATAAATACAAAGATGAAGACAGTAATAGAGATGAATTGAGAACTCGAGATGATGTAGACTCTAGAGATGAAATAAGACCCAGAGATGAGATAAGGCCTCGAGATGAAATACGTCCAAGAGATGAAATAAGGCCTCGAGATGAAATACGACCAAGAGATGAAATAAGGCCTCGTGATGAAATACGACCAAGAGATGAAATAAGGCCTCGAGATGAAATACGCCCAAGAGATGAGATAAGGCCTCGTGATGAAATACGACCAAGAGATGAGATAAGGCCTCGTGATGAAATACGGCCCAGGGATAAGAAAAGACCCAGACCACTCAGAGACGAGGTCCCTGCTATAGAAAGTAACCCGAGAGAAAGGGTAAGGGATGACAGGTACCAATCAACTGAAGGTAGAAGATTATACGATAGACCATACAGGCCTAGGGAAGATCGTCCCCGGAATAAGCCTGACCAAGACGATGATGAAGATAGACCAGTAGAAGTCCGACAGAGTCCTTCAACTGCTGATTCTCAACCGTTAGTGAAACCCAACGGACGTGGTATTTTCAGTAAACCTAGAATGCCGCCCAAAATTAAAAGACCCGTTCCGATTAATGAAAAGGAAAAGTATGAATATGTTACAATAACAACTACAAAGGCTCCACCAAAACTGGCTGATGATGAATATTATGATGAATATGATGAAGAGGATAGCCGGCGGCCATTACCCTCTGCAAAAACAAGCACTATTCAACAAAAACCAAAACCTGAGCCAATTGAAAGAGAAAAATTTAAACCAACCAGATCTGCAGGTATAGAGAAATTTAAAAAACCAAAGGTTCCAATTGATTATGATGATGATGAATATTATTATGAGTCCTCAGCGCGACCTCTGAAGTATTCCAATATCCAGCGCGCTAAAGACGAACGCCCATCGAAAGCTAAAGACCCCGACTATTTCTATGATGACAAAGAGAAAGTAACAGAAAATGAAAAATTTAAGGAAAAAGAAGAGGAAATAACTTCCAAGATAAGAGATATTAAACCTAATGTAAAAGTTTTCAGGAGGCCTTTTTTACCGTCTAGGGGTGGAAGTCCTTATCTCCCAAGAGGGCTACAACCAGTAGCTGGTAAAGATATGAGACTACCGCACAACACCACCCCGAAACCTACTTCTACCTCTTCTACTACAACTACAACTACTACAACAACCACTACTACTACTACAACAACGCCTCCACCAACAACAACAACTACTACAGAACCACCAACAACAACCACAGAACGAATAACAACAACAGTCGCAACACAAAAACCCACATCAACCGCTGTGCCTGAAGAAGAATATGAATATTATGACGAAGAAGACATAGAATATGAAACAAAACACAAAAAAGAACCAACGACAACTATTCAGTCAGAAGTTGAAGAAATTAAAGCAATTCCTACCACCACGACTACACCAGCACCCACCACGTTAAAAACAACAACTCAAAGATACATAGAGTCACATAATGATAGAGCAAAAAATTTAGCAACGAAAGTATTGAGGAATTTTAATGAAAATTACGAAGCCATTAAAGAAAAGCTAGAATCAACTCTGTCACCAGGAGATTATCTGAAACCGTACATCCCTTATAGAGATATCAATGTCAACAAGCCAGCTCCGAGGCCTGAAATATCAAATGGTTACACAGCCACAGGAAAACCATTGAAAATACCAGACATCAAACAGAATCTGGCTGATTCAATTGAAAACGAATACGACATTAGGTTAAACGAGGCGATCTCGCCCGTAAGAACTCCTAGCGGCTTCATCGTACCAAACGACAGAGATTATTCCTTCTCGAGGTACAGAAACAATATTCAGTCTGAGCCTCAGTACTCCGCCTCAGAAATAGGGGCTTACCAAGTTAGGAAACGACCTCAAATCTCTGGTGTCACGATCCGAACTCCCGCTTCGTATTTTTTACCGCAAAGGGTCATTTACGACGACGGTTCAGTGAGACAGACTCGCCAACTAATATATAGACAAATAGGAGATGTATATTAA

Protein sequence:

>DPOGS214545-PA
MKRSRHRWLGFVLFCVTLAALTDLPTTTAVRATSLIFSKTTTSTSTAGPEEEPSAPDDEAQVEPSAEGEEGNATSKYTGIPQIDYILDPNLPRELNGYNLSQYPFYEAVPPPETMDFKCDGLHDGFYASIPHKCQVYHHCLFGTRYDFLCANYTAFDQKTFICHFVSEVDCKNSAKYFSRNEALYKAASTDPPSTTSTTTTTTTTTPRPPRPGRRRPHPRYDYYDDDYYYPARDDYDYEERGGRRNRPRRPGKRRPQVDYDDRYETRSRPRDDVDEDYEDRRPYERPRAAKRPDYRRPYDDEDRRPYKGGRRPGKPRVDEESYGLEDEENDRPRDSKRGEFRPRDKFRPRDEERSRDEIRPRDEDVIRPRDENKYKDEDSNRDELRTRDDVDSRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDEIRPRDKKRPRPLRDEVPAIESNPRERVRDDRYQSTEGRRLYDRPYRPREDRPRNKPDQDDDEDRPVEVRQSPSTADSQPLVKPNGRGIFSKPRMPPKIKRPVPINEKEKYEYVTITTTKAPPKLADDEYYDEYDEEDSRRPLPSAKTSTIQQKPKPEPIEREKFKPTRSAGIEKFKKPKVPIDYDDDEYYYESSARPLKYSNIQRAKDERPSKAKDPDYFYDDKEKVTENEKFKEKEEEITSKIRDIKPNVKVFRRPFLPSRGGSPYLPRGLQPVAGKDMRLPHNTTPKPTSTSSTTTTTTTTTTTTTTTTPPPTTTTTTEPPTTTTERITTTVATQKPTSTAVPEEEYEYYDEEDIEYETKHKKEPTTTIQSEVEEIKAIPTTTTTPAPTTLKTTTQRYIESHNDRAKNLATKVLRNFNENYEAIKEKLESTLSPGDYLKPYIPYRDINVNKPAPRPEISNGYTATGKPLKIPDIKQNLADSIENEYDIRLNEAISPVRTPSGFIVPNDRDYSFSRYRNNIQSEPQYSASEIGAYQVRKRPQISGVTIRTPASYFLPQRVIYDDGSVRQTRQLIYRQIGDVY-