Monarch geneset OGS2.0

DPOGS213677
TranscriptDPOGS213677-TA1218 bp
ProteinDPOGS213677-PA405 aa
Genomic positionDPSCF300219 - 318826-320438
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0166103e-6255.67% 
BombyxBGIBMGA010320-TA2e-3450.00% 
Drosophilaresilin-PA2e-2750.00% 
EBI UniRef50UniRef50_C0H6Y24e-3250.00%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6Y2_BOMMO
NCBI RefSeqNP_001166631.18e-3350.00%cuticular protein RR-2 motif 140 [Bombyx mori]
NCBI nr blastpgi|2905633741e-3150.00%cuticular protein RR-2 motif 140 precursor [Bombyx mori]
NCBI nr blastxgi|2905633744e-7342.31%cuticular protein RR-2 motif 140 precursor [Bombyx mori]
Group
Gene OntologyGO:00423023e-11structural constituent of cuticle
KEGG pathway 
InterPro domain[196-247] IPR0006183e-11Insect cuticle protein
Orthology groupMCL18335 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213677-TA
ATGATTTTAAGTCTCTGTTTAGCCGTTCTAGTTGTGACTTCAATAACATGTGAACCGCCAGTAAACAGTTATCTACCCCCTGGAGGCAGCAGTGGTTCTAGTGGAAGGCCTTCTACCCAATATGGTCCACCGGGTTCCAATCAAGGAGGATTTGGAGGTGTTGGTTCTGGTAGGGGCCGGTCAGGTGAAAACTATGATCAGTCTGGAAGATCTTTGGGATCTCCTTCTTCTGAATATGGCACTCCAGGAGCTCAGGGATTCCAAGATAGTTTACAAGAACAACAGGGACCAAATTCTCAATATAGAACTCCGGACCAAGGCAGCTCCGGAAATATAGGAGGACCTGCTAGTGGTCGTGGAACACCTCAAAGACCTGATTCTTCATATGGAACTCCAATACAAGGATTTCAATCCGGCAACCAAAACAATTTCCGTGGACAAGGCGCTTTTTCTGGACAAGGCTCTAGAAGACCTGGAGTATCTCCGTCTTCTTCTTATGGAACCCCTGACTTCGGTAATGGTAGAAATATTGAAAATGAAAGTTATCGTAGCTCAGGGGTAGATGAATCAAATGGGCCAGCTAAGTATGAATTTAGCTATGAAGTCGATGATGCGGAAACAGGTACTATGTTTGGTCATTCAGAACAGCGAGATGGTGACTTGACTACAGGACAGTACAATGTAGTTTTACCCGACGGTAGGAAGCAGGTGGTAGAATACGAGGCTGATCAAGACGGCTACAAGCCTCAAATAAGATACGAGGGTGCAGGACGAGGTGTTGGAGGGTCTGGATTTGGACAGGGAAGAAATGGTGGACAAGGATATCCGGGCGTTGGTGCAAATGCTGGTGCTTTTGCAGATGCTCAGTCTGGAGCGCAACAAGGTGGAAATTCGCAACCAGGGTTCGGTGAAGGTTCGAGAGGTGGCTATCCAAGTGGCAGACCAGGAAGCAACGGTGAAGATTTCTCAGGAAATCAAGGCTTTCCGGGTTCAGGAAGTGGGGGTGAATATCCACGTGGATTAGGAGGATCTAGTTTTCCGGGTCGATCTCAAAGTAACTTCCCCGGTCAACAAGGAAACCAAGGTTATTCGACCGGACCTCAAGGAGCTGGAGGTGCGACTCGTGGTGGTGGACGTGGATCCAATGCTGGTAGTGGTGGCAATGGAGAAGGATATCCCAGCGGCGGTCCAAATGGACGACGGGGCTCCGGATATTGA

Protein sequence:

>DPOGS213677-PA
MILSLCLAVLVVTSITCEPPVNSYLPPGGSSGSSGRPSTQYGPPGSNQGGFGGVGSGRGRSGENYDQSGRSLGSPSSEYGTPGAQGFQDSLQEQQGPNSQYRTPDQGSSGNIGGPASGRGTPQRPDSSYGTPIQGFQSGNQNNFRGQGAFSGQGSRRPGVSPSSSYGTPDFGNGRNIENESYRSSGVDESNGPAKYEFSYEVDDAETGTMFGHSEQRDGDLTTGQYNVVLPDGRKQVVEYEADQDGYKPQIRYEGAGRGVGGSGFGQGRNGGQGYPGVGANAGAFADAQSGAQQGGNSQPGFGEGSRGGYPSGRPGSNGEDFSGNQGFPGSGSGGEYPRGLGGSSFPGRSQSNFPGQQGNQGYSTGPQGAGGATRGGGRGSNAGSGGNGEGYPSGGPNGRRGSGY-