Monarch geneset OGS2.0

DPOGS207992
TranscriptDPOGS207992-TA1773 bp
ProteinDPOGS207992-PA590 aa
Genomic positionDPSCF300090 + 804566-808111
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0060205e-6666.67% 
BombyxBGIBMGA000347-TA3e-6074.81% 
Drosophilal(3)mbn-PB7e-2532.88% 
EBI UniRef50UniRef50_C0H6L36e-5874.81%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6L3_BOMMO
NCBI RefSeqXP_970663.16e-3071.95%PREDICTED: similar to Protein lethal(3)malignant blood neoplasm 1 (Mbn(83)) [Tribolium castaneum]
NCBI nr blastpgi|2236711452e-5774.81%TPA: putative cuticle protein [Bombyx mori]
NCBI nr blastxgi|2236711451e-8537.86%TPA: putative cuticle protein [Bombyx mori]
Group
Gene OntologyGO:00423022.7e-07structural constituent of cuticle
KEGG pathway 
InterPro domain[30-76] IPR0006182.7e-07Insect cuticle protein
Orthology groupMCL17463 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207992-TA
ATGGCATATTTAGGAATATATTTCTTGATTATCTTCTACGTTTGCTTAACACAAGCAGCCGATAAATATACTGATGCGAACAGACCTTACGAGTTCGGGTTCACGATAGAAGGGGAGCAGCACAGACACGAAAAAAAAGATGAAAATGGGATAATCATGGGTGAATTTGGTTTCATCACTGCTGATGGAGTGTATCATGTAACAGTTTATGCTACCGACGAAAATGGCAATTTTAAAATTTTGTCTATGAAAAACATCAGAGTGAAACCATATCCTACGTCTTCAGGCGCGGAAACCAAAAAGATAGTCTCTGTAAATAATTCTGCTAAGCCGAAAGATTCAATCCCTGAATCATCTAAGCAAGAAAAAATCATTAAACTTGAAGCTCCATCACCTGCAAAATCTTGTTCTCATTGTAGTCTCCCTGCCACCACCACATTATCACCAATTTTATCACAGGAAATTCCATTTACTTCTAATAAGCATTCAGTAGAAAGCATAACACCATTTTATGCGAACCATGATCAAATGAAACAAATATCAATATCAGAAAATATAGGTCATACGCAGCAAAACTACAATTCAAGAACTACTTCTTCGTCATCTGAGAATCAACATAATATTGAACAAAAACAAATCAATCAAAATATCGGAAACCAAAACTTTATCTTGAATAAAAACGTTAATTATCAGGAACACAGCGAAAATTCAGGGTCACACCAGGCCAATTCCCAGGCAAATGTAAACAAATATTCACAAACGGGAATGTATTCTAACGGATTATCCCAAAACAGCGATCGCGGTAGAGAAGAATTATTAACTCAACAAAATATTCCTCCTAATACAGACCAACAGAAACTAAAACATGAAGAATTGCTTCCTAGTACTGGAAAAAATTACAATGTTGCTCCTGAGACTGAAAATTTATCGCCGCTATCTAATCACAATATTTCTCCAATTTCAAGACATTTAGAAGGCCCTATAAATAACGATAACTATTTACAGAAACAAAGTATATCGGAAAAAAGACTACCCAACCTTGAAGATTTCAGCTTTAATCAAAATAATATCCAAAATCAATCAAAAACACAACAAGAAAATCATAAAAACGGAATCCAAATTAACCGTGACGATTTACCAAGCTACCCTAGTAAGGATAACAGTGATCAGCAAACGATAAAACAAAAAGACGAACCTGTTGATTATGGTGTAGGTAAACTTCCTACCCAAATAAATACAAAAAGCGAAAGAAACCCAAAAAGTTTTTCTGACTTTATTCAGGAAGCAGTAGAACCTGAAAGAGTAGCAAATACAGGTCCTAACATATATAGACCATCATTACCAAACCTAAATGAATTTTCTGGTAAACCTGAATTATTAGCAGCACAAATTCAAAAAGTAGACAAAAATACAGATATTAATCACTTAAATCCTGGTGAAAGTGTCGGTCTTCCCAATGGAATAACCAAAGATGATATGTCTAATCTCCTTTACACCTTTAATTATACATTAGGATTTCATGGTCATCATGAGAAAGGCTATGCAAATGGTGTGAAACAGGGTTATTATTTTGTTACTGGTAGAAATGGTATACGGACAAGAGTTGACTATGTAGCTGATGAAACTGGTTTCCATCCAAAAATCACTCAAGAAGTTTTAGACATTCTTTCAGATGACGTACCAAAACCTGAAACAGAAAAAGATGAAAAATATGGACTCAAAGGATATGAATTCAAATGGCTTTACTTTCCAGTTGAAAAGCAATCTAAATAA

Protein sequence:

>DPOGS207992-PA
MAYLGIYFLIIFYVCLTQAADKYTDANRPYEFGFTIEGEQHRHEKKDENGIIMGEFGFITADGVYHVTVYATDENGNFKILSMKNIRVKPYPTSSGAETKKIVSVNNSAKPKDSIPESSKQEKIIKLEAPSPAKSCSHCSLPATTTLSPILSQEIPFTSNKHSVESITPFYANHDQMKQISISENIGHTQQNYNSRTTSSSSENQHNIEQKQINQNIGNQNFILNKNVNYQEHSENSGSHQANSQANVNKYSQTGMYSNGLSQNSDRGREELLTQQNIPPNTDQQKLKHEELLPSTGKNYNVAPETENLSPLSNHNISPISRHLEGPINNDNYLQKQSISEKRLPNLEDFSFNQNNIQNQSKTQQENHKNGIQINRDDLPSYPSKDNSDQQTIKQKDEPVDYGVGKLPTQINTKSERNPKSFSDFIQEAVEPERVANTGPNIYRPSLPNLNEFSGKPELLAAQIQKVDKNTDINHLNPGESVGLPNGITKDDMSNLLYTFNYTLGFHGHHEKGYANGVKQGYYFVTGRNGIRTRVDYVADETGFHPKITQEVLDILSDDVPKPETEKDEKYGLKGYEFKWLYFPVEKQSK-