Monarch geneset OGS2.0

DPOGS204347
TranscriptDPOGS204347-TA2973 bp
ProteinDPOGS204347-PA990 aa
Genomic positionDPSCF300142 + 266668-269737
RNAseq coverage267x (Rank: top 40%)
Annotation
HeliconiusHMEL0070100.039.63% 
BombyxBGIBMGA007250-TA9e-14737.66% 
DrosophilaMur89F-PA2e-3034.76% 
EBI UniRef50UniRef50_B4N9S31e-5140.25%GK11459 n=2 Tax=Eukaryota RepID=B4N9S3_DROWI
NCBI RefSeqXP_002069652.13e-5240.25%GK11459 [Drosophila willistoni]
NCBI nr blastpgi|1954439545e-5140.25%GK11459 [Drosophila willistoni]
NCBI nr blastxgi|1954439542e-10227.57%GK11459 [Drosophila willistoni]
Group
Gene OntologyGO:00080617.4e-13chitin binding
GO:00060307.4e-13chitin metabolic process
GO:00055767.4e-13extracellular region
KEGG pathway 
InterPro domain[360-419] IPR0025577.4e-13Chitin binding domain
Orthology groupMCL16483 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204347-TA
ATGTGCCACAACGAAGGATTCCAGGCCGATCCAATGGATTGTTCTGTTTTTTACCGATGTGTTCAATCATCTCGTGGAAAATTTAACGTTTTTAGGTTCCAATGCGGTCCAGGCACCGTGTATGATCCCGAGACAGAAATTTGTAATCATGCTTCTAATACAAAGAGATCTGAATGTGGTGCTGCAATTTCTTCAAACAATCTCGAAATTAACGAAAATGGTCTTGACATGCAAGAACTGCCATCACCTATTACGACATCACCGTTAAATATTCAAATGAATGGTTCAGTGACCTCAATATCAATTTCAGAGAAAGATCTACATATATTGAATAGTACAACAGCCAGTTTAACAACCACTGCAGCAGTCAATCAACAATTCTCAGAAGCATTTGATATTTCAGGGATGTACCCATGGGATAATAAAAACAGCGATTTCACTCCAAATCCAATATTATCAAAACCAGCACTGATAGAAAAAAATAATCCGTGCACTTTTGATGGATTTGTAGGTGATAGTTATGATTGTAAGAAATTCTACAGGTGTGTCAACAATTTTCGAGGAGGTTTTACACGTTATGAATTTGTATGTAGTGATTCAACGATATGGGACGATGATAAACAAAGTTGTAATTACGCATGGGCTGTTCGAAGTAGAAGGTGTGGTCGCGATGCTACTCGTAATAACGTCTTAAATGCCATGACTACCCATTACGAAAACGAGAATAATCTCGTGGATAAAGATCCGAATTTAAATCATTTCGGTGAAAAAATTCAAAATAGCTATATATCTAACGATAGTCCTACAAGTATAAATCAAAATAAAGATCATACACGTATCACAAGTAGTAGTTTCAGTATGGTGACAATACCTTATGACAACGCAAATATTGTGACCGATAAGAATGAAGACCCTGCTCATTTCATTGACAAACAAAACAATAGCTCTTTATCTGGAACAAGTGACAGTCAAAATCAGGAAATTAAAATAGAATTCACTATGTCTTCAAATCCATCTTCATCAACAGAACACACATTGTTAAGCAGCAGTGTAAATGACTCCTGGAAATTATTGAATAATGTATGTACACAAAGTGGTTTTATTGGTGATCCGAATGATTGTAAAAAATTTTATCGATGTGTTGATAACGGTCAAGGCAGTTATACGAAATACGAATTTTCATGTAGTCAAGGGACTCTTTGGGATAGTGAGATAGAAGCTTGTAATCATGCATGGGCTGTGAAAGGCTGTGGCAATACAGCTTTCACTCAAAGCTTAACGACAGAAACTAGCGTCTCAAATGTAGTTAGTTCAGAGATAAACCCTTTGCCTGACGGAATTAATAATGTTGATGATGACTTCGGTTATCCAAATCTAGTGGATCAGGATCATATGAAAACAACTACTTCAAAAATTGAAACAACGTTTTCTTTGCCAGCAACAAACGAAGAAGCAAACATTTTGGATAAATCATGTTCAACAAGTGGTTATTTTGTAAATTCGCTAGATTGTTCGAAATTTTACAGGTGTGTTGAGAACGGAAAAGGGTCTTTTACTAGATTTGACTATAACTGCGGTGAAGGGACTGTCTGGGATGAAAGTATTGAGGCTTGTAACCATGCCTGGGCTGTAAAATCTTGTAGATCAAATTCAGAAAACTTTATTGATTCTGAACAAATGACAACCATTACTCTCGTTCAAACTTCAACACAAAAAAATAAAGATAACAATGATTACGATTCTATCTACGGTTCACAACAAAGTACAATGGGATCAAACATAGTCACAGAGCTTTCGACAACAACATTGAAACAAACACTGCTTTCTCAAAATGAATGTACCACAAATGGGTTTATAGGAGACAGTAGGGATTGTAAAAAGTTTTATAGATGCGTAGAAAATGGTGATGGTGGGTATACGAAATATGAATTTTCTTGTGGCGATGAGACTGTTTGGGATCCTGTTATCGAAGCTTGTAATCATAATTCAGGAGATAAAGATTGTACAAGATCGTCTAACAATAATTATAATACTGTAGAACCAATTAATAGTAATGAAGATGTAGGAAATCATTATGTTACATCAAGTAGTCAGGATCCAGAAAATCCAGCCCAAAGCCAAAGTACAACAAGCGTTGTTCCAAGTAACAATAATCTGTGTGAAACTGCTGGATTTATGGGAGATTCAAATGATTGTGAAAAATTTTATAGATGTGTTGAAAATGGTAAAGGAGGATATAATAGGATTGAATTTAAATGTGCTGAAGGGACAGTTTGGGATTCCAGTATTGAAGCGTGTAATCACAGATGGGCGGTAGAAAACTGCGGAAAAGATTCTGCTAATGAATTTATAGAAACTACTATTGATTCAATGAGTACTGTTACTGACAAAACGTCAATTTCTACAAAATCTCCTGAATATTCAGAAACTTTAGCTCAATATACATCTACAGAAAGGACTAATGTGTCCGAGGATTCATGTTCCTCAGAAGGATTTTTCGGATCAGTAAACGGGGAATGCAATAAATTTTACCGATGTGTTGATAACGGAAGAGGTGGTTATTATAAATATGAATTTACATGTGGTGATGGGACAGTTTGGGATGAAAATATTAAAGCTTGTAATCATGATACATATAATAAAACTTGTAGAATTTCTGACAGTAAACCTCAAACTGATACGACCATTTCAACTGACGGACCTAAATCGACAACACACGCTTCAATCACAAATGTTCAGGAACCTTCAAAACCAGATGATAAAGAATGTAAATCTGAAGGCTTTATTCCTAATCCTTTAGATTGTCACAAATTCTTCCGTTGTGTTGATAATGGTGAGGGTGGTTATACTAAATTTGAATTTTCATGCGGAGAAGGAACAGTTTGGATTCAAGAAATTCAAGCTTGTGATCACGATACAGGGGAAAATAGCTGTAACCAGCAGAACAACAACAACAACGTTATAACAAGATAG

Protein sequence:

>DPOGS204347-PA
MCHNEGFQADPMDCSVFYRCVQSSRGKFNVFRFQCGPGTVYDPETEICNHASNTKRSECGAAISSNNLEINENGLDMQELPSPITTSPLNIQMNGSVTSISISEKDLHILNSTTASLTTTAAVNQQFSEAFDISGMYPWDNKNSDFTPNPILSKPALIEKNNPCTFDGFVGDSYDCKKFYRCVNNFRGGFTRYEFVCSDSTIWDDDKQSCNYAWAVRSRRCGRDATRNNVLNAMTTHYENENNLVDKDPNLNHFGEKIQNSYISNDSPTSINQNKDHTRITSSSFSMVTIPYDNANIVTDKNEDPAHFIDKQNNSSLSGTSDSQNQEIKIEFTMSSNPSSSTEHTLLSSSVNDSWKLLNNVCTQSGFIGDPNDCKKFYRCVDNGQGSYTKYEFSCSQGTLWDSEIEACNHAWAVKGCGNTAFTQSLTTETSVSNVVSSEINPLPDGINNVDDDFGYPNLVDQDHMKTTTSKIETTFSLPATNEEANILDKSCSTSGYFVNSLDCSKFYRCVENGKGSFTRFDYNCGEGTVWDESIEACNHAWAVKSCRSNSENFIDSEQMTTITLVQTSTQKNKDNNDYDSIYGSQQSTMGSNIVTELSTTTLKQTLLSQNECTTNGFIGDSRDCKKFYRCVENGDGGYTKYEFSCGDETVWDPVIEACNHNSGDKDCTRSSNNNYNTVEPINSNEDVGNHYVTSSSQDPENPAQSQSTTSVVPSNNNLCETAGFMGDSNDCEKFYRCVENGKGGYNRIEFKCAEGTVWDSSIEACNHRWAVENCGKDSANEFIETTIDSMSTVTDKTSISTKSPEYSETLAQYTSTERTNVSEDSCSSEGFFGSVNGECNKFYRCVDNGRGGYYKYEFTCGDGTVWDENIKACNHDTYNKTCRISDSKPQTDTTISTDGPKSTTHASITNVQEPSKPDDKECKSEGFIPNPLDCHKFFRCVDNGEGGYTKFEFSCGEGTVWIQEIQACDHDTGENSCNQQNNNNNVITR-