Monarch geneset OGS2.0

DPOGS214868
TranscriptDPOGS214868-TA1863 bp
ProteinDPOGS214868-PA620 aa
Genomic positionDPSCF300091 + 241056-244723
RNAseq coverage450x (Rank: top 27%)
Annotation
HeliconiusHMEL0070200.061.10% 
BombyxBGIBMGA010077-TA0.057.93% 
Drosophilamtg-PD4e-7239.79% 
EBI UniRef50UniRef50_B0WJ011e-7242.17%Chitin binding protein n=2 Tax=Culicidae RepID=B0WJ01_CULQU
NCBI RefSeqXP_397001.29e-7466.01%PREDICTED: similar to CG7549-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800287405e-7365.52%PREDICTED: uncharacterized protein LOC100869445 [Apis florea]
NCBI nr blastxgi|1571250728e-7741.59%hypothetical protein AaeL_AAEL010064 [Aedes aegypti]
Group
Gene OntologyGO:00080611.1e-06chitin binding
GO:00060301.1e-06chitin metabolic process
GO:00055761.1e-06extracellular region
KEGG pathway 
InterPro domain[294-375] IPR0025571.1e-06Chitin binding domain
Orthology groupMCL17844 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214868-TA
ATGATGAGGCCGGTGGCACCCGTCACAAAACCGAACCAGGCGTCGATGGACAAGTTGTCATCCTTACACAGACGTGTTTCAACCATGGATCCTATCGTGGTTCCCAGCCCCGAGACGCCCGTCTTCAAACCGCAACCCGCGGTAACCATGGCTAAACCCATGCCCATGGGCATACCCGCCTTCAAACCATTACCACCTAAGGAAGAAACCCTGAAGGTGCTCCCAATAGCCGCCAAGAAGACGTACTCCCCATGGCCCGTCCCGCAATCCAAACCAGGAACTGTCTACCAAAGTGAGATATCACCAATACAGTACCATACCATGAAAATCACCAAACCACCATACAAGTCATCCAACATCAACCCATTCGTTCCCATACCGTCACAGACATCTATCATACCATCGAATCACTTGCCGATCCAAACCAATCCCCTTTATAACTTTGTAGACCCAGTACCTCACCAGGTCCACGACATAGCGAACAGCTATCCAGCTTACAGCACCAAGAAGATAGACGATTACAACAGCATCAGCGGCTACAGTGACGACACCGCCTTGAAGTTCAATAAAGAAGATATAGACGCTAAGATAGCCGAGATAGCCAAGGCTGGGAACATCTCCACTGAAGCCGTCGAGGCGGCTATAGCGCTCAGACAACAGCAGCTGTTGAGCAAATACGCTAACATACCAGCTCCGACAACCACTACAACAACGACCACAACAACAGAGACTGTTGTGTATCAAGAACCCGAGCCGGAAATCATAGTGGCGCCTGTGCCGCAGAAACCGAAGAGAAACCCAACAACAGGCAAAGTGATGAACGCCCCTCGCGAGTATTACCCTGTGGGATACGAAAAGAACTTCGATGATCACTTCCAGTCTAAAGTGGATCTTCCAGACACCAGCTTCCACTGTGGGGATCAGAAATACTTTCCCGGACTGTACGGGGACGAGACGCTAGGATGTATGGTGTTCCACGTGTGTGCTCTAACAGACGACGGGCTGGTGATGAAGTCGTTCCTGTGCCCGGAGTCCACGCTCTTCGACCAGACCATCCTCAAGTGCAACTGGTGGTTCTACGTCGACTGTAAACACACCACCAGCCTCTACGACACAAACATACCCGTCTCCAAGAGCTACCAGCTCATGAAGGCCCTCACCTTCTTCACCTCTTACAAAAAGGAAATGGAAAATGGTCAGGCCATGAACCCTGAAGATATAGGCGGAGTTAAAGACGCCATAACGATACTACAGAACCAAGATGCCAAGACGGCCGAGAGTGAACGCGTACAAGTAGTGACGTCACATCCGCTCATAGACGAGAGAAGCAGGAGAGACAACTATACAGCACCGGTCTACAGAGGAACCAGAGAATATAACGAGACAGAGAAAGAGACAGAGAAAGACTTCAAATATGTTAGAGTTAGAGCAGACATTAGTTACAGCAACAAGACTAGCAGCACGGAAGAGAAGAATAGACGGAGGCTGGCTCACAGAAACAGAGGCGGGGCTGTTACTACTACTACAACTACAACAACTACTGCAACTACAAAGAAACATGCTGAGAGTCCACTTGTAGAGATCATGAAGCAAGAAATACAACCCATAGAGGTAGAGACCAAGAGAGAGACTCTCAGAGACCAGCTAGACGATACAGAACAAGACAAGACAGACACCCGACCTGTTAGAAGGTTCTACAGATCGAGTGGAGAGAGAGGAGATACGAGGGAGATAAGAGGGACAGAGATAGAGATAGTCAACAACGAGAGGGTGCAGATTGTGAGACCGGCTAACACCGAGTCTAGACTAGACGACGCGGTGCTAGGCAGGTCTAGACACAAGGACAGAGATCAATATACATAA

Protein sequence:

>DPOGS214868-PA
MMRPVAPVTKPNQASMDKLSSLHRRVSTMDPIVVPSPETPVFKPQPAVTMAKPMPMGIPAFKPLPPKEETLKVLPIAAKKTYSPWPVPQSKPGTVYQSEISPIQYHTMKITKPPYKSSNINPFVPIPSQTSIIPSNHLPIQTNPLYNFVDPVPHQVHDIANSYPAYSTKKIDDYNSISGYSDDTALKFNKEDIDAKIAEIAKAGNISTEAVEAAIALRQQQLLSKYANIPAPTTTTTTTTTTETVVYQEPEPEIIVAPVPQKPKRNPTTGKVMNAPREYYPVGYEKNFDDHFQSKVDLPDTSFHCGDQKYFPGLYGDETLGCMVFHVCALTDDGLVMKSFLCPESTLFDQTILKCNWWFYVDCKHTTSLYDTNIPVSKSYQLMKALTFFTSYKKEMENGQAMNPEDIGGVKDAITILQNQDAKTAESERVQVVTSHPLIDERSRRDNYTAPVYRGTREYNETEKETEKDFKYVRVRADISYSNKTSSTEEKNRRRLAHRNRGGAVTTTTTTTTTATTKKHAESPLVEIMKQEIQPIEVETKRETLRDQLDDTEQDKTDTRPVRRFYRSSGERGDTREIRGTEIEIVNNERVQIVRPANTESRLDDAVLGRSRHKDRDQYT-