Monarch geneset OGS2.0

DPOGS201644
TranscriptDPOGS201644-TA2043 bp
ProteinDPOGS201644-PA680 aa
Genomic positionDPSCF300254 - 7727-14476
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0145481e-9953.17% 
BombyxBGIBMGA008126-TA4e-5741.06% 
DrosophilaCpr56F-PA6e-0956.25% 
EBI UniRef50UniRef50_C0H6Y78e-5541.06%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6Y7_BOMMO
NCBI RefSeqXP_002428637.15e-1429.08%Pro-resilin precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2236713923e-5441.06%TPA: putative cuticle protein [Bombyx mori]
NCBI nr blastxgi|2236713924e-6741.77%TPA: putative cuticle protein [Bombyx mori]
Group
KEGG pathway 
Orthology groupMCL25427 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201644-TA
ATGCTCGAATTGTGGAAAAACGTTAAGGAGCGTGTGTCTGTATCACTCAAGTGGAAGCGCTCGGGAAGATATCTGACAGAGGAACAGGTGAACTTAGACTTGACGAACAGCACTGACTCTAACATCATGAAGGTCTTCGTATTCATGAGTCTTCTGTCTTTCACTCTGGCTGAGCCACCAGTCGGAGACGGTTACCCTTCATCACGATCAGCTCACCACCACGGCCATGGACACCAAGGATCGCTGACTCAAGAGTATGGACTTCCAGAATTTGGAGCACGGAACCAACAGGAGTACGCTGTAACAGCTGCAAGGTCTCAACCTTCACAAACATACGGAACCCCGACGAGGTCACCTATCTCTCGGGAATATGGAACACCCAACCTTCGGTCAGGACTCTCCCAGGAGTACGGCCCTCCCAGTTTCAGATCCGCTCCATCATTACAATATGGAGTCCCTCATTCCCGCGGCCTGTCTCCACAGTACGGTCCTCCGACCAGTCATTCCTCTCCATCCAGCCAAGATTCATATAAACAAGACGTCGTTATCCAAAGTGCTCGTTTGTCTCAAGACTTCGCGCCACAAGTCAAATCTGCTTCTCTCCCAAGCTTCAATGCCAGTGCTGATTTCTCCTCAGACTCGTATTCAGTTCCACCTCAAGGATCCTACGATTACAACCAGCTCGAACAACGCACTCCGTCCCAAGACTATGGAGTCCCATCTCAGAGAAGCAATGAACAGTCTCCCGCTAACAAATACGGTCCTCCCGGTTTGAGGAGTGCTCAGTCGTTTGGTCAAATAAGAAAGCAAGCGTCTTCGGCTGGACTTAATGCCAAGACTTCATCTGTGACAAGAAGCTTCCAATCATCTTTCGGTTCCAAAAACTCTGCGTCACCCACCTCTGGAACTCCCCGGTCTTTGTCCGCTGTATATGGAGCTCCGAGTTCAAGGAACTTTAATTCTTATTCCCAAAGTCCAAAGACCGTTTCTCAGACTTATCTGCCCAGTTCCCGTACTGTGTCTCAGTCCTATGGGGTTCCGAGCGAGCGTGGAGTCTCAACTGAATACGGTGTGCCAGAAAATGATTTTAACTTAAAGAATAGTGCTGAAATACCACAGTATGATCTTGCAAGATCTCCGTCTTCTCTATACGGAGCGCCTGAAGCCCGCATGCCCTCAGAGCAGTACGGAACACCACAGCAGTATAGTTCCCAGGACTCTCAAGGATACAACTATGACAGAAATGCTCTCGACGAACTTCTGAACCAAGAGCCAGCTAACTATGACTTCGGCTACAAAGTGAACGACCTTGAGAGCGGCAGCGACTTCGGTCACTCGGAGACGAGACAGGAGAACAGGGCCGAGGGCTCGTACTTCGTGGTGCTGCCTGATGGAACCAAACAGGATTGTTTCTATCCAAACATCGTGCAGGCAAAGCCGGATGAGGCCATTCTACCAAGAGCCTTCTCCGTGCCTACACAAACTCAGACCGTGTTCGTGTTGTTCCTCGTCGGTGCCGCGGCGTTTGTTTCAGCGGAGTCTCCAATATCGAAGAGCTATCTTCCTCCGCCACCTCCGAGTGACGTCATCACCTTCTCTAGAGCAGCTTCATCGGGCCAGGACTTCCGAAGAATTCAGGAGCTAGGAGACATGAGCTCGGGTCTAACACACGGAGACAACAACTTTGAAAGGATCAGTTCTAGGGGTCAGGGTGTCGGAGACGACGCTACGGGAAGGGGTCCCTCACAGGAACCAGCGGATCCGGCCAAGTATATGTTCGGTTACTCGGTTAACGAGCTGGAGGGAGCAGACTTCGGTCACAGAGAGGAGAGTTTCGAGGAGAGGAGCCAGGGTCAGTACCGCGTGATGCTTCCAGTCGGCAGAGGACAGGCAGTGGACGACGGAGCTGAAGGTCGCGGGTTCCAACCCCAGGTGGCCTACAGGAGCTCGGGCGCCTCGGCTGGGGGCTCTGGCTACGACTCCTCACACAAGGAATACAACCAGGGCGGCCGAGAGGGGCGAGGGGGGCGCTCAGGATACTGA

Protein sequence:

>DPOGS201644-PA
MLELWKNVKERVSVSLKWKRSGRYLTEEQVNLDLTNSTDSNIMKVFVFMSLLSFTLAEPPVGDGYPSSRSAHHHGHGHQGSLTQEYGLPEFGARNQQEYAVTAARSQPSQTYGTPTRSPISREYGTPNLRSGLSQEYGPPSFRSAPSLQYGVPHSRGLSPQYGPPTSHSSPSSQDSYKQDVVIQSARLSQDFAPQVKSASLPSFNASADFSSDSYSVPPQGSYDYNQLEQRTPSQDYGVPSQRSNEQSPANKYGPPGLRSAQSFGQIRKQASSAGLNAKTSSVTRSFQSSFGSKNSASPTSGTPRSLSAVYGAPSSRNFNSYSQSPKTVSQTYLPSSRTVSQSYGVPSERGVSTEYGVPENDFNLKNSAEIPQYDLARSPSSLYGAPEARMPSEQYGTPQQYSSQDSQGYNYDRNALDELLNQEPANYDFGYKVNDLESGSDFGHSETRQENRAEGSYFVVLPDGTKQDCFYPNIVQAKPDEAILPRAFSVPTQTQTVFVLFLVGAAAFVSAESPISKSYLPPPPPSDVITFSRAASSGQDFRRIQELGDMSSGLTHGDNNFERISSRGQGVGDDATGRGPSQEPADPAKYMFGYSVNELEGADFGHREESFEERSQGQYRVMLPVGRGQAVDDGAEGRGFQPQVAYRSSGASAGGSGYDSSHKEYNQGGREGRGGRSGY-