Monarch geneset OGS2.0

DPOGS201322
TranscriptDPOGS201322-TA1686 bp
ProteinDPOGS201322-PA561 aa
Genomic positionDPSCF300176 + 409974-411997
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0172420.072.52% 
BombyxBGIBMGA003115-TA0.067.08% 
DrosophilaCG6947-PA2e-1821.38% 
EBI UniRef50UniRef50_E0VVF15e-7631.29%Chitin binding peritrophin-A, putative n=1 Tax=Pediculus humanus corporis RepID=E0VVF1_PEDHC
NCBI RefSeqXP_002430095.11e-7631.29%chitin binding peritrophin-A, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420192912e-7531.29%chitin binding peritrophin-A, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420192914e-8729.43%chitin binding peritrophin-A, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00080617.9e-10chitin binding
GO:00060307.9e-10chitin metabolic process
GO:00055767.9e-10extracellular region
KEGG pathway 
InterPro domain[23-96] IPR0025577.9e-10Chitin binding domain
Orthology groupMCL20581 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201322-TA
ATGTCTGCGCGTGCGCCCGCCAGCACCAGTGTGTTCGTGATCACCACGACCGCGCTCGCCGCCGCCGCATTACTACCACACGGAAATGACATCGTGTGTATTAAGGATGGTCTCAGAGCAGACTACTCGACGGAGTGCCAGGAGTATGTGCGCTGTTCGGACGGAGTAGTGAAACAGCGTTACGCTTGCGGACCGGGACGTCTGTTTAGTGAAATAGCCGGGGCTTGCGTGTTGCGTCGACGATACTCCTGTACCCGTAAGGTATGCGCTCCCGGAGACACGTTCGCCTATGCTACACCGGCAACTGCCTGCCGTCACTATTACCGCTGTGAGAACGGAACGGCCATCGATCACGCCTGCCCACCGGGATCCTGGTTCGACTTAGCGAGGCAGGCCTGTTCCCGGGGGGCCGGGACTTGCTACGAACCCGTCTGTGCTGGATTACCGGATGGCGTGTTCCCGGACACCTCTCACGAATGTCGAAGAAAACTGCGATGCCAGGGTGCGGAACTGCGTGCCGTCAGCTCGTGTGTGGGACCGTGTTCAGACACCTGTCCTCCGCCACGATCTGCAGCTGTTCCTCTACCGGCAGGAGACGCAGACTTCTGTTCAGATGAAACTTGTTCTTCATTTTGCCAACAAAAAGCGAACGGAGCTTACGCAGATCGATCGACCGGATGCCGTGAGTACTTCGTGTGTGAATCGCGGCGAGTCATACGGCGAGGAGTCTGCGAACCTGGGCTATTGTTTTCCGGAAGCGGATGTGAACCAGCAGCTCAAAGCTATTGCCCTCCACCTGCCCGAAGTCCTTGCTTTAATAGACCGAACGGCCTATACAGGGACTGGATTGATTGTTCGTCTTGGTACGAATGCTACCGTGAAAGAGTAACCGCTCGAGGCACGTGCGAAGCCAACTTTGTGTTTGACGGTTTCGGCTGCGTTCCGAAAGGGGATTTTTTCTGTGAAGGACCCGCAATGGCTTCAGAATGCGAGGGTATGCCTAGTGGGACATATCAGGACCTGGGATCGAACTGTACCAAATACTACCACTGCGAGGGTTCGTTACGTACGATTCTATCCTGTCCGGAAGGCCAGATTTTTGATGGAGCAAGATGCTCCTCAGAATCTCAGTCTTTGTGCCCGAGTCTAGAACCAGATTCCTGCTACGGCCGCTCAGACGGCCGTTACCGTTCCTCCGACACGGGTTGTCGCGGTTTCTATTCATGTATCAATGGACAGAAGGCGGTGTACGCCTGTCCAGTGGGCAAGGCGTTCGACGGTGACACGTGCGTCTCGTTTCACCCTTCAGTGTGTCCACGCGACGACTACTCATGTTCCACTCTCTCGGATGGATATCACGCAGAACTGGAGTCTAACTGCCGCAGATACTTCTACTGCGAGGGCGGCGACAGATTAGCGACAAGGTCCTGCCTCGGCGGGAAGATATTCGACGGTCACACGTGCGTCGAACCCTCACAGCACACCTGCGGGGCTCCGAGGAGGAGTACCTATGAGAACGGCGGCAAGACCTGCGAATCCGAAGGTTTCTTCGTGCAGTTGGGCACTGAGTGTAAAAAGTACTATTTCTGTTTGACCGGCATAAGGACGAGCCTGTCTTGTTCGGCGAGACAGCTGTTCAATGGGCAAGTCTGCGTCCCCGAAGAACAGTACACCTGTCCAGGCTGA

Protein sequence:

>DPOGS201322-PA
MSARAPASTSVFVITTTALAAAALLPHGNDIVCIKDGLRADYSTECQEYVRCSDGVVKQRYACGPGRLFSEIAGACVLRRRYSCTRKVCAPGDTFAYATPATACRHYYRCENGTAIDHACPPGSWFDLARQACSRGAGTCYEPVCAGLPDGVFPDTSHECRRKLRCQGAELRAVSSCVGPCSDTCPPPRSAAVPLPAGDADFCSDETCSSFCQQKANGAYADRSTGCREYFVCESRRVIRRGVCEPGLLFSGSGCEPAAQSYCPPPARSPCFNRPNGLYRDWIDCSSWYECYRERVTARGTCEANFVFDGFGCVPKGDFFCEGPAMASECEGMPSGTYQDLGSNCTKYYHCEGSLRTILSCPEGQIFDGARCSSESQSLCPSLEPDSCYGRSDGRYRSSDTGCRGFYSCINGQKAVYACPVGKAFDGDTCVSFHPSVCPRDDYSCSTLSDGYHAELESNCRRYFYCEGGDRLATRSCLGGKIFDGHTCVEPSQHTCGAPRRSTYENGGKTCESEGFFVQLGTECKKYYFCLTGIRTSLSCSARQLFNGQVCVPEEQYTCPG-