Monarch geneset OGS2.0

DPOGS207975
TranscriptDPOGS207975-TA1635 bp
ProteinDPOGS207975-PA544 aa
Genomic positionDPSCF300090 + 612317-626079
RNAseq coverage40x (Rank: top 73%)
Annotation
HeliconiusHMEL0070033e-5477.70% 
BombyxBGIBMGA000325-TA4e-3480.49% 
DrosophilaCpr49Aa-PB2e-2566.00% 
EBI UniRef50UniRef50_Q9BPR18e-4661.11%Cuticle protein n=1 Tax=Bombyx mori RepID=Q9BPR1_BOMMO
NCBI RefSeqNP_001036869.12e-4661.11%cuticular protein RR-1 motif 45 [Bombyx mori]
NCBI nr blastpgi|1129836773e-4561.11%cuticular protein RR-1 motif 45 precursor [Bombyx mori]
NCBI nr blastxgi|1129836774e-5076.86%cuticular protein RR-1 motif 45 precursor [Bombyx mori]
Group
Gene OntologyGO:00423023.6e-17structural constituent of cuticle
KEGG pathway 
InterPro domain[448-504] IPR0006183.6e-17Insect cuticle protein
Orthology groupMCL10817 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207975-TA
ATGAAGCGTGTTGATACGATGCGTCGTTTCATCGGATATTTGGCATTACTGGGCATCGGTCTGTGTGTTCCTGATGGACCATCGTCAAATTCTCTTCCCCTTAGAAATCCACTTTTAAACTATAGCCCCTACTATAATCCCGGAAGTGCTACAGCACCCATTCTATCATATTCTAACACTCATGGTGTTGATGGCAGCTATTCTTACAGTTTTACGACTGGCGACGGCAAACAAGCTCAAGAAAATGGATATCTAAAAGATGCTTATATTGATAACATTGGTCAACCCCAGGGAACCCAGGTCAAAGAAGGCAGTTATTCTTATGTATCTCCCGAAGGAACACCCATACAAATTGATTATGTTGCTGACGAAAATGGGTTCAGACACGGTGGTGTTCATTTTACAGCGAACGGTAAAGGAGCAATACCAGCTTCATTGCTATTAGCCCTTGCCGCAGCAGCAGTTGCAGCAGATGTCAGTGACTTGCAACCGTCGTACCTTCAACAACAATATCATACAACGGAACCTATCCCCATAGTACGCCAGGAACAGATAATCAATCCGGATGGATCATACAAATGGAATTCCTCATTTCTATTTGAAATAAATCTGCAGATGCGTCGTTTCATCGGATATTTGGCATTACTGGGCATCGGTCTGTGTGTTCCTGATGGACCATCGTCAAATTCTCTTCCCCTTAGAAATCCACTTTTAAACTATAGCCCCTACTATAATCCCGGAAGTGCTACAGCACCCATTCTATCATATTCTAACACTCATGGTGTTGATGGCAGCTATTCTTACAGTTTTACGACTGGCGACGGCAAACAAGCTCAAGAAAATGGATATCTAAAAGATGCTTATATTGATAACATTGGTCAACCCCAGGGAACCCAGGTCAAAGAAGGTAGTTATTCTTATGTATCTCCCGAAGGAACACCCATACAAATTGATTATGTTGCTGACGAAAATGGGTTTAGACACGGTGGTGTTCATTTTACAGCGAACGGTAAAGGAGCAATACCAGCTTCAATATTCAACCCCAGATTTAATTATAATAACCCTTACAGCAACAGAAATTATCCTTTTGGACAGTATTCCCCAATAAAACCTTATGACCCCAGATATCCTAACAACCCCCGCTATAATATTTATAATCCTTACAGACCTGTCCTTGATGACGTAAAGAAAGAAGTAAAAACGTTGCTATTAGCCCTTACGGCAGCAGCAGTTGCAGCAGATGTCAGTGACTTGCAACCGTCGTACCTTCAACAACAATATCATACAACGGAACCTATCCCCATAGTACGCCAGGAACAGATAATCAATCCGGATGGATCATACAAATGGAATTATGAAACTGGCAACGGTATCTCTGCTGAGGAGTCTGGGTACATAAAGAATCTCGGTATCCCTGAACAAGAGACCCAATCCGTTCAAGGACAGTACAAGTATACGGCACCTGATGGTCAGATCATAGAACTGCAGTACGTAGCTGATGAAAACGGTTTCCAGCCACAAGGAGCTCATCTCCCAACTCCTCCGTCGATTCCTGTGGATATTCAAAAAGCTTTGGACTATCTCGCAACCTTGCCCCCTCAAAACCAAGAACCTGTCAAAAACAGACCATTCTGA

Protein sequence:

>DPOGS207975-PA
MKRVDTMRRFIGYLALLGIGLCVPDGPSSNSLPLRNPLLNYSPYYNPGSATAPILSYSNTHGVDGSYSYSFTTGDGKQAQENGYLKDAYIDNIGQPQGTQVKEGSYSYVSPEGTPIQIDYVADENGFRHGGVHFTANGKGAIPASLLLALAAAAVAADVSDLQPSYLQQQYHTTEPIPIVRQEQIINPDGSYKWNSSFLFEINLQMRRFIGYLALLGIGLCVPDGPSSNSLPLRNPLLNYSPYYNPGSATAPILSYSNTHGVDGSYSYSFTTGDGKQAQENGYLKDAYIDNIGQPQGTQVKEGSYSYVSPEGTPIQIDYVADENGFRHGGVHFTANGKGAIPASIFNPRFNYNNPYSNRNYPFGQYSPIKPYDPRYPNNPRYNIYNPYRPVLDDVKKEVKTLLLALTAAAVAADVSDLQPSYLQQQYHTTEPIPIVRQEQIINPDGSYKWNYETGNGISAEESGYIKNLGIPEQETQSVQGQYKYTAPDGQIIELQYVADENGFQPQGAHLPTPPSIPVDIQKALDYLATLPPQNQEPVKNRPF-