Monarch geneset OGS2.0

DPOGS214546
TranscriptDPOGS214546-TA3012 bp
ProteinDPOGS214546-PA1003 aa
Genomic positionDPSCF300266 - 235834-241435
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0161230.062.41% 
BombyxBGIBMGA003273-TA3e-8377.71% 
DrosophilaCG13676-PA1e-5248.31% 
EBI UniRef50UniRef50_B2DBM90.054.98%Similar to CG13676-PA n=2 Tax=Papilionoidea RepID=B2DBM9_9NEOP
NCBI RefSeqXP_002426626.13e-6359.18%hypothetical protein Phum_PHUM263920 [Pediculus humanus corporis]
NCBI nr blastpgi|1839792540.054.98%similar to CG13676-PA [Papilio xuthus]
NCBI nr blastxgi|1839792540.057.62%similar to CG13676-PA [Papilio xuthus]
Group
Gene OntologyGO:00080616.1e-10chitin binding
GO:00060306.1e-10chitin metabolic process
GO:00055766.1e-10extracellular region
KEGG pathway 
InterPro domain[100-161] IPR0025576.1e-10Chitin binding domain
Orthology groupMCL17559 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214546-TA
ATGGGTTCTATGTGCGTAACTTTGGCAGCGCTCACGGACTTGCCGACAACGACAGCGGTCCGTGCGACGTCACTCATCTTCTCAAAGACGACGACATCAACCAGCACTGCTGGCCCAGAAGAAGAACCCTCTGCACCTGATGATGAAGCACAGGTTGAACCGAGTGCGGAAGGGGAAGAAGGGAACGCTACTAGTAAATATACGGGAATTCCTCAAATAGATTACATACTTGATCCAAATCTACCACGCGAGCTGAATGGGTACAATTTATCCCAGTATCCCTTCTACGAAGCTGTGCCTCCTCCAGAAACCATGGATTTCAAATGCGATGGACTCCACGATGGTTTCTACGCCTCTATACCCCATAAGTGTCAGGTCTACCACCACTGCCTCTTCGGCACCAGATACGACTTCCTCTGCGCGAACTACACAGCCTTCGATCAAAAAACTTTCATCTGTCACTTCGTATCTGAAGTGGATTGTAAAAATTCAGCAAAATATTTTAGCAGGAACGAAGCTCTATACAAGGCGGCGTCCACTGACCCTCCGTCTACAACATCTACAACAACCACCACAACAACAACCACGCCGCGACCACCACGGCCAGGAAGACGCCGACCACATCCCAGATACGATTACTACGATGACGATTATTACTACCCAGCTAGAGATGACTATGATTACGAAGAGAGAGGTGGTAGACGGAATAGACCTCGTAGACCAGGCAAGCGCAGGCCTCAAGTTGACTACGATGACAGATACGAGACGAGATCGAGACCAAGGGATGATGTTGATGAGGATTACGAAGACAGAAGACCATATGAAAGACCGAGAGCTGCTAAAAGACCGGATTACAGAAGACCATACGACGATGAAGACAGAAGACCATATAAAGGTGGTAGAAGGCCAGGAAAACCAAGGGTTGATGAAGAGTCATATGGTCTCGAAGATGAAGAAAATGACAGGCCGAGAGATAGTAAAAGAGGTGAATTTAGACCCAGGGATAAGTTTAGACCTAGAGACGAAGAAAGATCTAGGGACGAAATCCGTCCACGAGATGAATTGAATTCTAGGGATGAAATAAGATTAAGAGACGAGGGAAAATCAAAAGATGACACGGATTCTAGAGATGTTATTCGACCGCGAGATGAAAATAAATACAAAGATGAAGACAGCCTCGAGATGAAATACGGACCAAGAGATGAAATAAGGCCTCGAGATGAAATACGCCCAAGAGATGAGATAAGGCCTCGTGATGAAATACGGCCCAGGGATAAGAAAAGACCCAGACCACTCAGAGACGAGGTCCCTGCTATAGAAAGTAACCCGAGAGAAAGGGTAAGGGATGACAGGTACCAATCAACTGAAGGTAGAAGATTATACGATAGACCATACAGGCCTAGGGAAGATCGTCCCCGGAATAAGCCTGACCAAGACGATGATGAAGATAGACCAGTAGAAGTCCGACAGAGTCCTTCAACTGCTGATTCTCAACCGTTGGTGAAACCCAACGGACGTGGTATTTTCAGTAAACCAAGAATGCCGCCCAAAATTAAAAGACCCGTTCCGATTAATGAAAAGGAAAAGTATGAATATGTTACAATAACAACTACAAAGGCTCCACCAAAACTGGCTGATGATGAATATTATGATGAATATGATGAAGAAGATAGCCGGCGGCCATTACCCTCTGCAAAAACAAGCACTATTCAACAAAAACCAAAACCTGAGCCAATTGAAAGAGAAAAATTTAAACCAACCAGATCTGCAAGTATAGAGAAATTTAAAAAACCAAAGGTTCCAATTGATTATGATGATGAAGAATATTATTATGAGTCCTCAGCACGACCTCTGAAGTATTCCAATATCCAGCGTGCTAAAGACGAACGCCCATCAAAAGCTAAAGACCCCGACTATTTCTATGATGACAAAGAGAAAGTAACAGAAAATGAAAAATTTAAGGAAAAAGAAGAGGAAATAACTTCCAAGATAAGAGATATTAAACCTAATGTAAAAGTTTTCAGGAGGCCTTTTTTACCGTCTAGGGGTGGAAGTCCTTATCTCCCAAGAGGGCTACAACCAGTAGCTGGTAAAGATATGAGACTACCGCACAACACCACCCCGAAACCTACTTCTACCTCTTCTACTACAACTACAACTACTACAACAACCACTACTACTACTACAACAACGCCTCCACCAACAACAACAACTACTACAGAACCACCAACAACAACCACAGAACGAATAACAACAACAGTCGCAACACAAAAACCCACATCAACCGCTGTGCCTGAAGAAGAATATGAATATTATGACGAAGAAGACATAGAATATGAAACAAAACACAAAAAAGAACCAACGACAACTATTCAGTCAGAAGTTGAAGAAATTAAAGCAATTCCTACCACCACGACTACACCAGCACCCACCACGTTAAAAACAACAACTCAAAGATACATAGAGTCACATAATGATAGAGCAAAAAATTTAGCAACGAAAGTATTGAGGAATTTTAATGAAAATTACGAAGCCATTAAAGAAAAGCTAGAATCAACTCTGTCACCAGGAGATTATCTGAAACCGTACATCCCTTATAGAGATATCAATGTCAACAAGCCAGCTCCGAGGCCTGAAATATCAAACGGTTACACGGCCACAGGGAAACCATTGAAAATACCAGACATCAAACAGAATCTGGCTGATTCAATTGAAAACGAATACGACATTAGGTTAAACGAGGCGATCTCGCCCGTAAGAACTCCGAGCGGCTTCATCGTACCCAACGACAGAGATTATTCCTTCTCGAGGTACAGAAACAATATTCAGTCTGAGCCTCAGTACTCCGCCTCAGAAATAGGGGCTTACCAAGTTAGGAAACGACCTCAAATCTCTGGTGTCACGATCCGAACTCCCGCTTCGTATTTTTTACCGCAAAGGGTCATTTACGACGACGGTTCAGTGAGACAGACTCGCCAACTAATATATAGACAAATAGGAGATGTATATTAA

Protein sequence:

>DPOGS214546-PA
MGSMCVTLAALTDLPTTTAVRATSLIFSKTTTSTSTAGPEEEPSAPDDEAQVEPSAEGEEGNATSKYTGIPQIDYILDPNLPRELNGYNLSQYPFYEAVPPPETMDFKCDGLHDGFYASIPHKCQVYHHCLFGTRYDFLCANYTAFDQKTFICHFVSEVDCKNSAKYFSRNEALYKAASTDPPSTTSTTTTTTTTTPRPPRPGRRRPHPRYDYYDDDYYYPARDDYDYEERGGRRNRPRRPGKRRPQVDYDDRYETRSRPRDDVDEDYEDRRPYERPRAAKRPDYRRPYDDEDRRPYKGGRRPGKPRVDEESYGLEDEENDRPRDSKRGEFRPRDKFRPRDEERSRDEIRPRDELNSRDEIRLRDEGKSKDDTDSRDVIRPRDENKYKDEDSLEMKYGPRDEIRPRDEIRPRDEIRPRDEIRPRDKKRPRPLRDEVPAIESNPRERVRDDRYQSTEGRRLYDRPYRPREDRPRNKPDQDDDEDRPVEVRQSPSTADSQPLVKPNGRGIFSKPRMPPKIKRPVPINEKEKYEYVTITTTKAPPKLADDEYYDEYDEEDSRRPLPSAKTSTIQQKPKPEPIEREKFKPTRSASIEKFKKPKVPIDYDDEEYYYESSARPLKYSNIQRAKDERPSKAKDPDYFYDDKEKVTENEKFKEKEEEITSKIRDIKPNVKVFRRPFLPSRGGSPYLPRGLQPVAGKDMRLPHNTTPKPTSTSSTTTTTTTTTTTTTTTTPPPTTTTTTEPPTTTTERITTTVATQKPTSTAVPEEEYEYYDEEDIEYETKHKKEPTTTIQSEVEEIKAIPTTTTTPAPTTLKTTTQRYIESHNDRAKNLATKVLRNFNENYEAIKEKLESTLSPGDYLKPYIPYRDINVNKPAPRPEISNGYTATGKPLKIPDIKQNLADSIENEYDIRLNEAISPVRTPSGFIVPNDRDYSFSRYRNNIQSEPQYSASEIGAYQVRKRPQISGVTIRTPASYFLPQRVIYDDGSVRQTRQLIYRQIGDVY-