Monarch geneset OGS2.0

DPOGS213501
TranscriptDPOGS213501-TA2145 bp
ProteinDPOGS213501-PA714 aa
Genomic positionDPSCF300033 - 1258468-1262283
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0145272e-15047.12% 
BombyxBGIBMGA008136-TA8e-11740.00% 
DrosophilaCpr50Ca-PA1e-1553.03% 
EBI UniRef50UniRef50_C0H6Y52e-11440.00%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6Y5_BOMMO
NCBI RefSeqNP_001166630.13e-11540.00%cuticular protein RR-2 motif 143 [Bombyx mori]
NCBI nr blastpgi|2905608716e-11440.00%cuticular protein RR-2 motif 143 [Bombyx mori]
NCBI nr blastxgi|2905608713e-12941.46%cuticular protein RR-2 motif 143 [Bombyx mori]
Group
Gene OntologyGO:00423022.1e-09structural constituent of cuticle
KEGG pathway 
InterPro domain[653-704] IPR0006182.1e-09Insect cuticle protein
Orthology groupMCL25431 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213501-TA
ATGGATCATCCAAAGCACTTGACATTAACGGTCTCCATATTGTTAGTTGCTTTTCAACTGGCATCATCCAATAAAAACGCCAGGCCGGTTTACGAGGTTATTAACGAACAGTCAAGAATTGATGATAGAGTCAGAATATCCAGCAGTCCAGGGTTACTTCCATATGGCAACCCTCTTGCGGGTAACTCCATACAAAATGCAAGATTACACGCCAGCAACCAGCAAGCCATGTTCAACGCTGAGAGGATACGCCAACATAAAGCCCAATTACAACGACAGGCCAATCTGCCAAGAGTCGTCGATAGCTATATGCAAGCGTACCACGAATCACAGGAGAGTCATCATCTAGCTCTCGAACGACAACAGGCGACAATTAATGATAATCCCACAACTAAGGCACCCGTGAGACCACAGCACAGACAAAAACAAAGAAATAGGAATAGAAATTATAGATCATATGGTCCAGTCGATTATCAACAATACTTGGAACCGCAGAAGTACAGAACTGTTTACGTGACTCCAACGACGGATAACGACCAGGCAGTGTCCATAAAAGATAACCTTAACGTACTAGGACATTTGAATAGCAAAACACCGACCAAACTTTACACGGAAGCGATCTCAAGCGATTCTAAATATGGGTATCCAAAACACTATACACCAACACAAAACTTGAAATCGATCCAAGATATTCAAGTTCTTAATTCATTACTGACCAAAAATCCAGTGGATCAATTGACAGAATTTAATGCACTCATAACATCTAGCGCTGAAAACGACAAAAAAGTACCAATCGATCTTTATTTTTATGTTAAAGATCCAAATAGCCAGTCATTATCTCAAGAACAATCAAAATACGCACCAAATTCTAATTCGTACATAATTCCATCAGATTTGAATATAAAAGATCACACGCCGATAACAGAAGATGTAGACGATATTGGTAATCCGATCGAAGACCAACAATATTACAGCCTTCAAACAATCCCAACGACTAAAGCAAGCGAAGCGACAACTACAAAAAACAATTACTACAAAGTAGAAGTGGCCAGTCAAACAATAACCGTACCGAAGCAAGAAGAATACAATCTGAATGATCCCACTTACCAGATATCGCATTACGGTCAAACTTATGGTAAAAACGACGCGAAAAGTGAACAATATTTGAACCTGCATTCACAACCGACCGGAGTCCAACATCTATCTGGCGACGGCACAGAAGTATCGGCTTACGGCGACGATGATGTAAGTATACCGAAAAAACTATCCATATCAATCCACATACGTCAGAAGCGATCCAAACTGGACACATCACCATTCCTTTTCACTGACGAAAAAACGCAATCCACAAAACGCACCGGCGATTTCAACGTTTTTCTATCCCCTAACTTTCCCGTCGGTGAGGCAGCCGACTATAATGATGATTACGATTACCAACAATTAAACTCGAAGCGATCCAAGCAAAAACTGGAATCGTTTTACGATGACGATTATTACGACGACTTTAGTTCAGACTATGATAGCCCTAGACAATATTTTCCCCCTCAATCCGAACCTAAGGGCTATCCTTCAAGAACGCGTAACTTTCGTCGCTCTCCAATTTCTCAGCTCTATGGGAATCCTCCCGCTACTTCTTATGGTGTTCCTATATACGGTACATCAGTGTTCAGCACAGCCCAAACTTTCATTCCTCCTAAATTAGAAGTACCAACATTAACAAACCAATTCCCGAATGTCATCGAACCCGTTTACATGCTCACGCAAACACAATTAAAAGAATTAGTAGGACATCACAATTTGAATATTGAACACCTCGATGTGTACCAACTTCTAAAAGAAAACAGAGCCAAGAAAACACCGTACTACCCAAGAAAATATCGTAAAAGAAATCCCTTTAGGCACCTCAGAAGTAATTTACATAAATTAAACAAATTTCACCTCAAATACGCTGCGAATTATGAATTTGGTTACCGTGTGAGAGACGCCAACACAGCTAATTACTACGGACATAGAGAGTCTAGGAACGGCTTGAAAACCAAAGGACAGTACCATGTGTTGTTGCCAGATGGTAGGATGCAGCAAGTAAACTACGTCGCTGGACCAGAAGGTTACCACGCTGATATAACGTACGACCAACCTCATTAA

Protein sequence:

>DPOGS213501-PA
MDHPKHLTLTVSILLVAFQLASSNKNARPVYEVINEQSRIDDRVRISSSPGLLPYGNPLAGNSIQNARLHASNQQAMFNAERIRQHKAQLQRQANLPRVVDSYMQAYHESQESHHLALERQQATINDNPTTKAPVRPQHRQKQRNRNRNYRSYGPVDYQQYLEPQKYRTVYVTPTTDNDQAVSIKDNLNVLGHLNSKTPTKLYTEAISSDSKYGYPKHYTPTQNLKSIQDIQVLNSLLTKNPVDQLTEFNALITSSAENDKKVPIDLYFYVKDPNSQSLSQEQSKYAPNSNSYIIPSDLNIKDHTPITEDVDDIGNPIEDQQYYSLQTIPTTKASEATTTKNNYYKVEVASQTITVPKQEEYNLNDPTYQISHYGQTYGKNDAKSEQYLNLHSQPTGVQHLSGDGTEVSAYGDDDVSIPKKLSISIHIRQKRSKLDTSPFLFTDEKTQSTKRTGDFNVFLSPNFPVGEAADYNDDYDYQQLNSKRSKQKLESFYDDDYYDDFSSDYDSPRQYFPPQSEPKGYPSRTRNFRRSPISQLYGNPPATSYGVPIYGTSVFSTAQTFIPPKLEVPTLTNQFPNVIEPVYMLTQTQLKELVGHHNLNIEHLDVYQLLKENRAKKTPYYPRKYRKRNPFRHLRSNLHKLNKFHLKYAANYEFGYRVRDANTANYYGHRESRNGLKTKGQYHVLLPDGRMQQVNYVAGPEGYHADITYDQPH-