Monarch geneset OGS2.0

DPOGS201723
TranscriptDPOGS201723-TA2178 bp
ProteinDPOGS201723-PA725 aa
Genomic positionDPSCF300269 - 31870-58764
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0093593e-1429.69% 
BombyxBGIBMGA009809-TA1e-0950.00% 
DrosophilaTequila-PA1e-0630.56% 
EBI UniRef50UniRef50_Q8ISS23e-1135.48%Peritrophic matrix insect intestinal mucin (Fragment) n=1 Tax=Plutella xylostella RepID=Q8ISS2_PLUXY
NCBI RefSeqNP_001161929.12e-0631.58%peritrophic matrix protein 14 [Tribolium castaneum]
NCBI nr blastpgi|246379721e-1035.48%peritrophic matrix insect intestinal mucin [Plutella xylostella]
NCBI nr blastxgi|1157250926e-7542.40%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
Group
Gene OntologyGO:00080612.5e-09chitin binding
GO:00060302.5e-09chitin metabolic process
GO:00055762.5e-09extracellular region
KEGG pathway 
InterPro domain[192-279] IPR0025572.5e-09Chitin binding domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201723-TA
ATGAAACCTTTAGAGAGCCTCATGACTAAACTTTATTCTGATTTGATTGTTAAAGAAGATGATAAAACAGCGATCGATAGAGAAACTTTGAGAAATCAAGATTTAGATAATATTATGTACGTAGTGTCGGAATGGTTAGAAAAAATACCAGTGTTCGCTAACTTTACTGCTTGTGATAGGAGGAAACAAGAAGAAATGGCTCACAGATTAGCAAAAAACCTGAGTAAATTAAAAAATCAAAAAGATTTTGTTCCAAAGGCTAAGTTAGAAATAATGGAGTACCTCAATTCTTCACCTATTTGGCAACCTACTGAATTATCGGCGAAAAATCAGTTCTTTGAAACCCTTGTTTTCGATCTAATCCAGAGGTTAAATAATATGTTTGGTCAAGAAGTAGAACAGAATATAGCTAAAATATTGTCACGTTTACCCTACAAAGAAGGTGTTGATATTAAAAAAATACAGGAAGATTTTATAAGTACAGTCAAACCACAAATTTTACAAATACCAAAAGGTCATCCCAATAATGAAGAGGGCATCGTAGGCTGTTCCTACGGTCGGCCCCAGAGCAATGACCTCCAAGTCCCGGGTGCTACTTGCCACCCAGGATCCAAAGACCAGCTTGTACCCCATGACAGTGATTGCACGAAGTTCTATTCATGTCTGAGCGGCCGGCGATCTCTAGAACCATTGCAGTGCCCTCTGGGGACGGAGTTTTCGTCCAAATACCAGGCGAGTTATAAAATAGCTAATGTCCTGAGAGATCCCATCGTCTGTGTACCCGCCGCTCTATCTGATTGTCAACGGTCAGGAGATATTCAATATCTACCTACAAGTTTACCAGACGATGTCCTCCGAAACGGATGTCCCAAGGATTTTAGCAGACAGTTGCTTATGCCTCATAAATTAGACTGCAATAAGTATTACTACTGTGACCGTGGCGAACTAAAGTTATCCGCCTGTCCGGACCTGAAGTTATTCAACTTCGAAAAACAAATCCTTAAATATAACTTCAGGTATGCGTTTTTCCTGAGGAAGCTGGATGTGATGGAAGTAATGGCAACAACGGAGGTGGCGGTGGCAACGGAAGTGGCGATGGCAACGGAGGTGGCGGATGCAACGGAGGTGGAGCATTCAGCGGAGGTGGCGGATGTAACGGAGGTGGAGGAGGCAACGGAGGTGGAGGCGGAGGCGGAGGCGGAGGCGGTGGAGGCGGTGGAGGCGATGGAGGTGGAGGTGGAGGAGGAGGAGGTGGAGGTGGTGAAGATGGTGGAGACTTCGACAAGCTGCCCAACGGCTGCCCATCAGACTGGAACATTAATTGGCACCTTCCTCATGAATCTAACTGCAGCAAATTTTATCAATGTGTCTTTGGAAAGAAAGTTTGTGATTGGCCTCAAAACGCGGGTTGCTCTAATAATGGTGGAGGCGGTGGAGGTGGAGGAGGTGGAGGTGGAGGTGGTGGAGGCGGAGGTGGAGGCGGAGGCGGAGGAGGAGGAGGTGGAGGAGGCGGTGGATGCGATGGAGGAGATGGAGGCGATGGAGGAGATGGAGGAGATGGCGGCGGAGGTGGAGGTGGCGGTGGTGGAGGCGGCGGAGGCGGAGGAGGTGGAGGAGGCGGCGGAGGCGGAGGAGGCGGAGGAGGTGGAGGAGGTGGTGAAGGTGGTGGAGACTTTGACAAGCTGCCCAACGGCTGCCCATCAGACTGGAACATTAATTGGCACCTTCCTCATGAATCTAACTGCAGCAAATTTTATCAATGTGTCTTTGGAAAGAAAGTTTGTGATTGGCCTCAAAACGCGGGTTGCTCTAATAATGGTGGAGGCGGTGGAGGTGGAGGAGGTGGAGGTGGAGGTGGTGGAGGCGGAGGTGGAGGCGGAGGCGGAGGAGGAGGAGGTGGAGGAGGCGGTGGATGCGATGGAGGAGATGGAGGCGATGGAGGAGATGGAGGAGATGGCGGCGGAGGTGGAGGTGGCGGTGGTGGAGGCGGCGGAGGCGGAGGAGGTGGAGGAGGCGGCGGAGGCGGAGGAGGCGGAGGAGGTGGAGGAGGTGGTGAAGGTGGTGGAGACTTTGACAAGCTGCCCAACGGCTGCCCATCAGACTGGAACATTAATTGGCACCTTCCTCATGAATCTAACTGCAGCAAATTTTATCAATGTGTCTTTGGAAAGAAAGTAA

Protein sequence:

>DPOGS201723-PA
MKPLESLMTKLYSDLIVKEDDKTAIDRETLRNQDLDNIMYVVSEWLEKIPVFANFTACDRRKQEEMAHRLAKNLSKLKNQKDFVPKAKLEIMEYLNSSPIWQPTELSAKNQFFETLVFDLIQRLNNMFGQEVEQNIAKILSRLPYKEGVDIKKIQEDFISTVKPQILQIPKGHPNNEEGIVGCSYGRPQSNDLQVPGATCHPGSKDQLVPHDSDCTKFYSCLSGRRSLEPLQCPLGTEFSSKYQASYKIANVLRDPIVCVPAALSDCQRSGDIQYLPTSLPDDVLRNGCPKDFSRQLLMPHKLDCNKYYYCDRGELKLSACPDLKLFNFEKQILKYNFRYAFFLRKLDVMEVMATTEVAVATEVAMATEVADATEVEHSAEVADVTEVEEATEVEAEAEAEAVEAVEAMEVEVEEEEVEVVKMVETSTSCPTAAHQTGTLIGTFLMNLTAANFINVSLERKFVIGLKTRVALIMVEAVEVEEVEVEVVEAEVEAEAEEEEVEEAVDAMEEMEAMEEMEEMAAEVEVAVVEAAEAEEVEEAAEAEEAEEVEEVVKVVETLTSCPTAAHQTGTLIGTFLMNLTAANFINVSLERKFVIGLKTRVALIMVEAVEVEEVEVEVVEAEVEAEAEEEEVEEAVDAMEEMEAMEEMEEMAAEVEVAVVEAAEAEEVEEAAEAEEAEEVEEVVKVVETLTSCPTAAHQTGTLIGTFLMNLTAANFINVSLERK-