Monarch geneset OGS2.0

DPOGS207688
TranscriptDPOGS207688-TA1332 bp
ProteinDPOGS207688-PA443 aa
Genomic positionDPSCF300502 - 46717-50768
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0026162e-1544.83% 
BombyxBGIBMGA000166-TA9e-5745.77% 
DrosophilaCpr50Cb-PA2e-1254.84% 
EBI UniRef50UniRef50_C0H6X42e-5445.77%Putative cuticle protein n=3 Tax=Pancrustacea RepID=C0H6X4_BOMMO
NCBI RefSeqNP_001166638.13e-5545.77%cuticular protein RR-2 motif 132 [Bombyx mori]
NCBI nr blastpgi|2905608777e-5445.77%cuticular protein RR-2 motif 132 precursor [Bombyx mori]
NCBI nr blastxgi|2905608772e-6735.70%cuticular protein RR-2 motif 132 precursor [Bombyx mori]
Group
Gene OntologyGO:00423021.2e-11structural constituent of cuticle
KEGG pathway 
InterPro domain[207-258] IPR0006181.2e-11Insect cuticle protein
Orthology groupMCL23331 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207688-TA
ATGCTGGCACTACTTCTGTTATACATCTCAAGGGTCCAACCTCAGGAGATTACAACACCTGCCTCTCTGCTGGGGCCACAACACTTCAACTATCACGCATACCGGCTCACCCCAGAAGACATAAATCCCACCAAGAAAGGTCCCGTATTGTTTCCAAACGATTCCCCCCCTCCACCCCGTCCTCCCCTAGTTGTAACTTCTCGTCCTATTCTAGAATCTATAGCCAGAAGCGAGCTGAATCCTCTACCTCAAAATAACTCGACACCAGAAACCAAACAAACTGATAAGGCTTTTGGCGAGTCAGCTTCCGATTCGAACATATCAAACAATCCGTATCAGAATTTTGTGGATGTGGATTTAATTGGATTGAATCCAGAAACTCAGGCCAGCTCCTACGCTCTGCCATACCCGGATGTAGTGACAAACAGATTTTATGATTATCCTGATATTAATAAAGGATACTATATTAAACAGAGTACATTACCGCCGATTTACAAGGCAATATCGGACAACGCGAACCGTGAACTCGCAAAGTTACGACAGAAACCGAGCCGATATAACACACAGGAAATTGATTATGACAATTATGATGATAAACAAAGCAAAGAAGAGAATAATTACGCGTTCTCGTACAGAGTGAACGATCACATAACCGGTGATGATTTTTCACACTCACAAAGATCTAGTGGGTCCGCGACCAGCGGCGAATACAGAGTTCGGCTCCCCGATGGTAGAATGCAGATAGTGTCTTACACAGCCGACGAAAACGGATACAAAGCTGACGTGAGATACGAGGATGAAGGACAAACACACAAACAGGAAAACAATTTGAATTTTATACAAAATCAAAAACAAATCGGTTACTCATCAAAAAATTACTACGATACAAGTACAAATTACTACAAACACAATCCGAAAATCCAACAAAATCCTTATTTATCCGATACAAATGACTATTCAGATATCTTCCAATACGAGTCCAATAATCCACACCACAGCAAATTTTCTATTTACGATCCCAAATTTTCGACAATCCGTCCCATAACTACGAATGAATTAAGCGATTCAGTCAAATACGAATCCAGTACGAAATCTGTTATTGTTTCAAATGGAAATATCTATACAACTATAAAACCATCTCTTCAATCTCTTGTCATCTCTCCATCTCCAATTCCAACTCTATCTACAATTCAAACCCAATCTACAATTCCAACTCTCTCTCCCTCCTCGTATCTCGCATCAACTATAGCCAACTTAAGGAACAGAATCGCATCCAAGCCAATATTATCCAAAAGCTTTATAGATAGAATCAACAGATATATAACTTTTTGA

Protein sequence:

>DPOGS207688-PA
MLALLLLYISRVQPQEITTPASLLGPQHFNYHAYRLTPEDINPTKKGPVLFPNDSPPPPRPPLVVTSRPILESIARSELNPLPQNNSTPETKQTDKAFGESASDSNISNNPYQNFVDVDLIGLNPETQASSYALPYPDVVTNRFYDYPDINKGYYIKQSTLPPIYKAISDNANRELAKLRQKPSRYNTQEIDYDNYDDKQSKEENNYAFSYRVNDHITGDDFSHSQRSSGSATSGEYRVRLPDGRMQIVSYTADENGYKADVRYEDEGQTHKQENNLNFIQNQKQIGYSSKNYYDTSTNYYKHNPKIQQNPYLSDTNDYSDIFQYESNNPHHSKFSIYDPKFSTIRPITTNELSDSVKYESSTKSVIVSNGNIYTTIKPSLQSLVISPSPIPTLSTIQTQSTIPTLSPSSYLASTIANLRNRIASKPILSKSFIDRINRYITF-