Monarch geneset OGS2.0

DPOGS215913
TranscriptDPOGS215913-TA1332 bp
ProteinDPOGS215913-PA443 aa
Genomic positionDPSCF300029 + 479371-485686
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0053992e-13058.57% 
BombyxBGIBMGA000285-TA3e-10651.06% 
DrosophilaCpr76Bd-PB8e-3677.22% 
EBI UniRef50UniRef50_C0H6S16e-10451.06%Putative cuticle protein n=2 Tax=Endopterygota RepID=C0H6S1_BOMMO
NCBI RefSeqNP_001166681.11e-10451.06%cuticular protein RR-2 motif 79 [Bombyx mori]
NCBI nr blastpgi|2905607522e-10351.06%cuticular protein RR-2 motif 79 precursor [Bombyx mori]
NCBI nr blastxgi|2905607524e-11151.39%cuticular protein RR-2 motif 79 precursor [Bombyx mori]
Group
Gene OntologyGO:00423022.5e-16structural constituent of cuticle
KEGG pathway 
InterPro domain[379-431] IPR0006182.5e-16Insect cuticle protein
Orthology groupMCL26833 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215913-TA
ATGCCTCTGGGTTGTATAAAATATGTCTTAACCGGTATAATACTGCTTGTGACTATTGCATTCCTTTCAAATGCATTCGCTTATCACGATCCCGATCTTAATTACCATTTGAGTCAGGTGCAAAATATTCAAGGATGTGGTCACACTGGCTATAACTATGCAGCTCCCGCAATACAGCTATCATTACCGGAAGCAAAAGCCACAGCTGCCCCAAGACTTCTTCAAGCGGAAGCGCTTTCTGCTTCTAAAACTCCTCAAGTCAGCTATCAGTCGCAGCCAGCTATCACATACCAAACACCAATAGCTGCAACGTATTCTGGTTACTCAGCCCAAAGGGAACAGCATGGGTATGCTACTTCTGCTGGCTTATCAGCATCTGCTACCAGAACTGTAATTCCACAAGCAACTTACGCCCAAGCTCCAATAATAGCTAAAATAACTGCAGCACCGTTGCAAGCAAAATTCACGATATCCCCACCTAGAACCACATATGTGTCCCAAAATTTACTATCGCAACAATCTACTTATAACTCCGGATCATCTGCAAGGGCATCATTAAATTCATTCAGTCACGGAGCTGGTCCAGTAGTGTCTCAAGTATATGCAGCGCCCACGGCTGGTTACACAACTTCACCGGCACTTAAAGTGCAATCGCAACAAACAGTTCTTCCAGCTTATCAGTATCAAACTGCAGCGCCTTTATCAGTGGCGCAAGTTTACAAAGCACCAGTTGGGACTCAATACTCAAGTTCTATTGTGTCTCATGATTCAGCTCCAAGTTTAACACAGTATTCTGCGCCGACTTATAGGGCTTCTAATTTAGCTCAGCAAACGGCATCGGTACAATACTCTTCTCCAGCTCATTACGTAACACAATCGGCAGTTCAATATACATCACCAGTCGCTCAATACTCAGCTCCATCTGTGACTCATTATTCTGCACCAGTCGTGACTCAATACTCAAGAGTACAGCCAGCTGTACAGGCCTCGCAATATATAGCACCGTCAGTATCCCATCAAACAGTTTCCACTCCAGAAGTAGCTATAGCAGATATAGCTCAGGTCGTCAAGAACTCACAGAGGGCTAAAAATGTGCACACGGAATTCTTGGAGAACTACGACGCACATCCTCGTTATGCTTTTGAATACGGTGTAAACGATCCCCACACCGGAGACATCAAACAGCAAAAAGAAGAACGCGACGGAGATGTTGTTAAAGGTCAATACTCTTTGGTGGAACCGGACGGTTCGGTGCGAACTGTCAACTACGTCGCTGACTGGGAGACCGGTTTCCACGCTAACGTTCACAATAGCAAAGACAAGCAACACTAA

Protein sequence:

>DPOGS215913-PA
MPLGCIKYVLTGIILLVTIAFLSNAFAYHDPDLNYHLSQVQNIQGCGHTGYNYAAPAIQLSLPEAKATAAPRLLQAEALSASKTPQVSYQSQPAITYQTPIAATYSGYSAQREQHGYATSAGLSASATRTVIPQATYAQAPIIAKITAAPLQAKFTISPPRTTYVSQNLLSQQSTYNSGSSARASLNSFSHGAGPVVSQVYAAPTAGYTTSPALKVQSQQTVLPAYQYQTAAPLSVAQVYKAPVGTQYSSSIVSHDSAPSLTQYSAPTYRASNLAQQTASVQYSSPAHYVTQSAVQYTSPVAQYSAPSVTHYSAPVVTQYSRVQPAVQASQYIAPSVSHQTVSTPEVAIADIAQVVKNSQRAKNVHTEFLENYDAHPRYAFEYGVNDPHTGDIKQQKEERDGDVVKGQYSLVEPDGSVRTVNYVADWETGFHANVHNSKDKQH-