Monarch geneset OGS2.0

DPOGS207687
TranscriptDPOGS207687-TA1239 bp
ProteinDPOGS207687-PA412 aa
Genomic positionDPSCF300502 - 55779-59765
RNAseq coverage0x (Rank: top 97%)
Annotation
HeliconiusHMEL0026163e-1544.83% 
BombyxBGIBMGA000166-TA4e-6137.67% 
DrosophilaCpr50Cb-PA2e-1254.84% 
EBI UniRef50UniRef50_C0H6X48e-5937.67%Putative cuticle protein n=3 Tax=Pancrustacea RepID=C0H6X4_BOMMO
NCBI RefSeqNP_001166638.11e-5937.67%cuticular protein RR-2 motif 132 [Bombyx mori]
NCBI nr blastpgi|2905608773e-5837.67%cuticular protein RR-2 motif 132 precursor [Bombyx mori]
NCBI nr blastxgi|2905608772e-6537.28%cuticular protein RR-2 motif 132 precursor [Bombyx mori]
Group
Gene OntologyGO:00423021.1e-11structural constituent of cuticle
KEGG pathway 
InterPro domain[176-227] IPR0006181.1e-11Insect cuticle protein
Orthology groupMCL23331 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207687-TA
ATGCTGGCACTACTTCTGTTATACTTCTCAAGGGTTCAACCTCAGGAGATTACAACACCTGCCTCTCTGCTGGGGCCACAACACTTCAACTATCACGCATACCGGCTCACTCCAGAAGACATTAATCCCACCAAGAAAGGTCCCGTATTGTTTCCAAACGATTCCCCCCCTCCACCCCGTCCTCCCCTAGTTGTAACTTCTCGTCCTATTCTAGAATCTATAGCCAGAAGCGAGCTGAATCCTCTACCTCAAAATAACTCGACACCAGAAACCAAACAAACTGATAAGGCTTTTGGCGAGTCAGCTTCCGATTCGAACATATCAAACAATCCGTATCAGAATTTTGTGGATGTGGATTTAATTGGATTGAATCCAGAAACTCAGGCCAGCTCCTACGCTCTGCCATACCCGGATGTAGTGACAAACAGATTTTATGATTATCCTGATATTAATAAAGGATACTATATTAAACAGGAAATTGATTATGACAATTATGATGATAAACAAAGCAAAGAAGAGAATAATTACGCGTTCTCGTACAGAGTGAACGATCACATAACCGGTGATGATTTTTCACACTCACAAAGATCTAGTGGGTCCGCGACCAGCGGCGAATACAGAGTTCGGCTCCCCGATGGTAGAATGCAGATAGTGTCTTACACAGCCGACGAAAACGGATACAAAGCTGACGTGAGATACGAGGATGAAGGACAAACACACAAACAGGAAAACAATTTGAATTTTATACAAAATCAAAAACAAATCGGTTACTCATCAAAAAATTACTACGATACAAGTACAAATTACTACAAACACAATCCGAAAATCAAACAAAATCCTTATTTATCCGATACATATGACTACTCAGATGACTTCCAATACGAGTCCAATAATCCACACCACAGCAAATTTTCTATTTACGATCCCAAATTTTCGACAATCCGTCCCATAACTACGAATGAATTAAGCGATTCAGTCAAATACGAATCCAGTACGAAATCTGTTATTGTTTCAAATGGAAATATCTATACAACTATAAAACCATCTCTTCAATCTCTTGTCATCTCTCCATCTCCAATTCCAACTCTATCTACAATTCAAACCCAATCTACAATTCCAACTCTCTCTCCCTCCTCGTATCTCGCATCAACTATAGCCAACTTAAGGAACAGAATCGCATCCAAGCCAATATTATCCAAAAGCTTTATAGATAGAATCAACAGATATATAACTTTTTGA

Protein sequence:

>DPOGS207687-PA
MLALLLLYFSRVQPQEITTPASLLGPQHFNYHAYRLTPEDINPTKKGPVLFPNDSPPPPRPPLVVTSRPILESIARSELNPLPQNNSTPETKQTDKAFGESASDSNISNNPYQNFVDVDLIGLNPETQASSYALPYPDVVTNRFYDYPDINKGYYIKQEIDYDNYDDKQSKEENNYAFSYRVNDHITGDDFSHSQRSSGSATSGEYRVRLPDGRMQIVSYTADENGYKADVRYEDEGQTHKQENNLNFIQNQKQIGYSSKNYYDTSTNYYKHNPKIKQNPYLSDTYDYSDDFQYESNNPHHSKFSIYDPKFSTIRPITTNELSDSVKYESSTKSVIVSNGNIYTTIKPSLQSLVISPSPIPTLSTIQTQSTIPTLSPSSYLASTIANLRNRIASKPILSKSFIDRINRYITF-