Monarch geneset OGS2.0

DPOGS209415
TranscriptDPOGS209415-TA3270 bp
ProteinDPOGS209415-PA1089 aa
Genomic positionDPSCF300346 + 279043-283417
RNAseq coverage641x (Rank: top 20%)
Annotation
HeliconiusHMEL0213962e-5677.86% 
BombyxBGIBMGA012602-TA1e-3965.00% 
DrosophilaCpr65Eb-PA7e-2055.45% 
EBI UniRef50UniRef50_C0H6K33e-3765.00%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6K3_BOMMO
NCBI RefSeqNP_001166736.15e-3865.00%cuticular protein RR-1 motif 12 [Bombyx mori]
NCBI nr blastpgi|2905608089e-3765.00%cuticular protein RR-1 motif 12 precursor [Bombyx mori]
NCBI nr blastxgi|2905608082e-5034.94%cuticular protein RR-1 motif 12 precursor [Bombyx mori]
Group
Gene OntologyGO:00423022.1e-11structural constituent of cuticle
KEGG pathway 
InterPro domain[102-151] IPR0006182.1e-11Insect cuticle protein
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209415-TA
ATGGAATCCAGGATTCTAGTTCTATCAATAATAGCTTATGGCTATGCGGATAAATTAGATAAGGGATACCTGCCTCCTGTCAATGCAGCATCATCAGGTGGCAGCCCAGCAGAATTAATTGCTCCAGCTGATCAATCTGAGGTTTTCGGGCAAGGGTTGCCCGTTCCAGAAAGTCAACCAGGCTCTTATATCCAAGATATTGGACAGGAAGTTCTTCAAGCTTACAACCAGGAACGCCCTCAAGCAGCAGCTGATAGAAATGCGGAGATACTAAAGTTTAATAACGAAAATAACGGTGAATCCTATGCATATAATTATGAAACATCTAACGGGATTTCTGTGGAGGAATCAGGTGTTGCATCTAATGGAGTTAATGCTCAAGGTGGCTATGCCTATACTGGGGATGACGGTAAATCCTATTCAGTCACTTATACGGCAGATATAAATGGCTATCAACCTCAGGGTGAACATTTACCTACACCTCACCCTATCCCAGAAGAAATATTAAAGTCCATAGAAGAAAATGCTAGGGCTGCTGCTGCCGGTACACAAGAAGGAGCTTACAATCCTGAGGAGTATGATTCCAACGTTTATTATCAAACAAAGCCAGATCAAGAATCTGATGGTTCTTTAGATGTAATCGAAAGAAACAAAAATCAAGAAATAAGCTCCATATATAACAATCCCCTCGATTCTGTGGGTCAACAATATCAAAAAGCATCTAGTTTAGATGTAAATCCATCAGACCAAAAACGGAATAAGGAAAGTGGACAAGATATTAACCAGATGTACACTCCAAATCCTTATCAATCCCACCAGACCTCTGGTATTGGTATTCAAGGAAACAGTGGTTTTGAATCTAGTAGTCAGTCACCTATTCAAGGAATAGCAGGACAATTTTTACAAGGAGACGGTTATCAATATAATCAACCTAAATTTTCATTGCAGCCAGTTTTTCCAGGACAAGATCAATACAAACCGCAAGTAACTAGCGACAATGAAAATATATCATCATCGTTAAGACCAAGTTCATCAGGTCCCGCATTTAATAGAGATCAAAATTTGAAACCTTCATTTGGAAGCCTTCCCTCAGCTGAACAAACTCCGCAATATCAATCTGGCCAACAAATTCTTCCAGAATTTAGGCCCTCCTCTCATAGCGGACCAAATGCATCTCAAAGTATGAGGGGATCTGTCCCAAACCAAGACCAACAAATCCAAATCAAGGAATCATCTGGCGATTTGAACGAAAGCAAAGGAAATGGTTATCATTACAATCAACCAAAGCCCGTGTTCCAACCCGCTAATTCCGAACAAAGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAATTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTCCCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTGAACGAAAGCAAAGGAAATGGTTATCATTACAATCAACCAAAGCCCGTGTTCCAACCCGCTAATTCCGAACAAAGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAATTTAGGCCCTCCTCTCATAGCGGACCAAATGCATCTCAAAATATGAGGGGATCTCTCCCAAACCAAGACCAACAAATTCAAATCAAGGAATCATCTGGCGATTTGAACGAAAGCAAAGGAAATGGTTATCATTACAATCAACCAAAGCCCGCTTTCCAACCCGCTAATTCCGAACAAAGCTCGTTAAACCAATATCGTCCAGAATTATCTGGCCAAAGTGAAAAGATATCATCTTCTTCAAGGCCAGGCTTATCAGGATCCATATATAATGGAGATCAGAATTTGAAACCTTCATTTGGAAGCCTTCCCCCAGCTGATCAATCTTCGCAGTATCAATCTGGCCAACAAATTCTTCCAGAATTTAGGCCCTCCTCACATAGCGGACCAAATGCATCTCAAAATATGAAGGGATCTCTCCCAAATCAAGACCAACAAATCCAAATTAAAGAATCCTCTGGCGATTTAAACGAAAGCAAAGGAAATGGTTATCATTACAATCAACCAAAGCCCGCGTTCCAACCCGCTAATTCCGGACAAAGCTCCTTTAACCAGTATCGTCCGGAATTATCTGGCCAAAGCGAAAAGATATCATCTTCAGCAAGACCAAGTATGTCAATTCCCATATTTAATAGAGATCAGAATTTGAAACCCTCATTTGGAAGTCGTCCCTCAGCTGATCAATCTCAAAAGTATCAATTCGGCAAACAAATTCCTTCAGTTTTTAAGCCTTCTTCTTATAGAACATCAAACGTTTCTCAAAATAAAGAAAGTCCTTTCCTCAATCGAGACCAACGAGTCCAAATTAAACGACCATCAAGTGGTTTAATACAAAACAAAGTAAACGGTTATCAATATAATCGACCAAAACCTGCCTTTCAGCCAACTATTTCGGGACAAAATAGACCTCGAGTTTCTAACGAAGGAAATAAGAAGCCACTATTAAGTCAAAACACTTTCACTTCAGTAGTTAGTGGAAATAACGGAAACATTAATCCTTCACCACAAAACGGACCAAATGGTTCTCAAAATAAAGGATCTTACCCTATTAAAGTTCTAAAGAACCCAGGCAGTCAGGCAGCTGGCTCTTCACAAGGCAACGGTTCACCTCGCTTCAGTATTTTGAATAAAAATAAACCTTATTCTGCTTTGCAAAAACCCGGACAAGGTTTCCAATCGTCTCCTAGTGAAGGACTTGGAAACAATAAATTTGGAAAAGGACCTATTTCTGCTTTAAAACAAGTCGAAGCGCCTTACCACTACAAAAGACCAAGTGTAAGTTTTACCACACAACGTCCAAACTCTTTTTCGCAAACAACACAGATAAGCAGAGGCAATCAGGATAAAAGTGAACAGTTTGCGGGGTCTCGTCCACCGCCGAGTTTCAGCGAGGAAGAAGGTTACAAATATTAG

Protein sequence:

>DPOGS209415-PA
MESRILVLSIIAYGYADKLDKGYLPPVNAASSGGSPAELIAPADQSEVFGQGLPVPESQPGSYIQDIGQEVLQAYNQERPQAAADRNAEILKFNNENNGESYAYNYETSNGISVEESGVASNGVNAQGGYAYTGDDGKSYSVTYTADINGYQPQGEHLPTPHPIPEEILKSIEENARAAAAGTQEGAYNPEEYDSNVYYQTKPDQESDGSLDVIERNKNQEISSIYNNPLDSVGQQYQKASSLDVNPSDQKRNKESGQDINQMYTPNPYQSHQTSGIGIQGNSGFESSSQSPIQGIAGQFLQGDGYQYNQPKFSLQPVFPGQDQYKPQVTSDNENISSSLRPSSSGPAFNRDQNLKPSFGSLPSAEQTPQYQSGQQILPEFRPSSHSGPNASQSMRGSVPNQDQQIQIKESSGDLNESKGNGYHYNQPKPVFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFGSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNESKGNGYHYNQPKPVFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFGSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMRGSLPNQDQQIQIKESSGDLNESKGNGYHYNQPKPAFQPANSEQSSLNQYRPELSGQSEKISSSSRPGLSGSIYNGDQNLKPSFGSLPPADQSSQYQSGQQILPEFRPSSHSGPNASQNMKGSLPNQDQQIQIKESSGDLNESKGNGYHYNQPKPAFQPANSGQSSFNQYRPELSGQSEKISSSARPSMSIPIFNRDQNLKPSFGSRPSADQSQKYQFGKQIPSVFKPSSYRTSNVSQNKESPFLNRDQRVQIKRPSSGLIQNKVNGYQYNRPKPAFQPTISGQNRPRVSNEGNKKPLLSQNTFTSVVSGNNGNINPSPQNGPNGSQNKGSYPIKVLKNPGSQAAGSSQGNGSPRFSILNKNKPYSALQKPGQGFQSSPSEGLGNNKFGKGPISALKQVEAPYHYKRPSVSFTTQRPNSFSQTTQISRGNQDKSEQFAGSRPPPSFSEEEGYKY-