Monarch geneset OGS2.0

DPOGS215902
TranscriptDPOGS215902-TA1431 bp
ProteinDPOGS215902-PA476 aa
Genomic positionDPSCF300029 + 395806-400143
RNAseq coverage176x (Rank: top 50%)
Annotation
HeliconiusHMEL0086443e-8444.26% 
BombyxBGIBMGA000270-TA2e-4539.93% 
DrosophilaCG34461-PA4e-2357.29% 
EBI UniRef50UniRef50_Q9BPR53e-5154.55%Cuticle protein n=2 Tax=Bombyx mori RepID=Q9BPR5_BOMMO
NCBI RefSeqNP_001036864.15e-5254.55%cuticular protein RR-2 motif 93 [Bombyx mori]
NCBI nr blastpgi|1129837761e-5054.55%cuticular protein RR-2 motif 93 precursor [Bombyx mori]
NCBI nr blastxgi|1129837762e-5954.55%cuticular protein RR-2 motif 93 precursor [Bombyx mori]
Group
Gene OntologyGO:00423022.6e-14structural constituent of cuticle
KEGG pathway 
InterPro domain[410-462] IPR0006182.6e-14Insect cuticle protein
Orthology groupMCL26831 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215902-TA
ATGGTTTTAAACGTATTAGAAGTGGCATTATTTTGTTGCATCATTGAAATATGTTCAGCAATCCGAATAGACCCGAATGAAGGAGCAGGAAGAGAATTAATGAATGAAAGAAGATGGCAAATGGTGACCCTATATCCGTCGCATAATCAACAATTATTCAAATATAATACCAATACATATCCTAAATATAAGTTCGAATATTCTGTGTCTGATAAAAAAACTGGTGACCACAAACATCACCATGAATTCCGAGATGGTCACAGAGTGCAAGGAGGCTATAGTCTTATTGAACCTGACGGTTCTTTGAGGACAGTTGAGTATAATGCTGACGACTACAATGGTTTCAATGCTGTTGTAAGCAGAGGTTTCCATCGCCATGGTGGACATGCATTCTCTACGTTTGATCATACAAGACATTTTCATCCCATAAGAAGTGAAATGAAAATAAAACACTTCCTCCCAAGTAGCAATTATCTTTTTGATAAACCACAAAACAAAGGTAATATTGAAGTAAAACCAGTAAGTGAGAAAAATCCGAACCAAAATTTACCGGAAAAAAAAGCTTTAGATAAGGAAGTTGAGGAGGAATTAGAAGATCTAAGTAAAATGACGACTTTTGATCCAATTGAAAACGAGAATTCAATTACCACCGCAGCCTCTGCAATGGAAATACCAACAACGGAATCTGACAAAACAATTGAATCTACTTCGACGGTCAAGAATGACGATATTGATTCTGTTAATCATATGATCACAGAGATGCCGGTTGTTGAAGTTTCATCCACTTCTAATGAAAAAGCCGCAGTAGAAACAGAAAAATTAAAAACTCAAGAAGATTCCGAAGCAGCATCGTCACGTTTTTATTCTAAGTTCTACTACATCATCGTCATCTACGCTATGGTGGCAGCAAGCAATGCAGGTCTCCTGCATCATGCACCAGCAGTGTCTCATAATTATATTAACCATGAAGTCCAGGGACATTACGCTGTCCATGCAGCTCCCTCATATCAAGGGTCCCAGATTTCTCCAATTCACACATCAATTGTCCACCCAGCTCCAGCTGTTCATGCAGCGCCCGTTGCTTATACAGCCTCTGTACACGCTGCCCCAGCAGCTTATGCCTCACCTGCTCACTCAGCCCCTGTCTCCCATGCTGCTCCAGTTGCTCATGGGGCATCCAGTGCTCACGAAGATGACCACTACGTAGGAGAATTCGCTCATCCCAAATACGGCTACTCTTACTCCGTCGAGGATCCCCATACTGGTGACCACAAGTCCCAGCACGAGACTCGTGATGGCGATGTCGTAAAGGGCGAGTACTCTCTTCTTCAACCTGACGGTTCCTTCCGAAAAGTCACCTACACCGCTGACCACCACAATGGATTCAACGCTGTGGTTCACAACACTCCACCCGTCATCCATCATCATTGA

Protein sequence:

>DPOGS215902-PA
MVLNVLEVALFCCIIEICSAIRIDPNEGAGRELMNERRWQMVTLYPSHNQQLFKYNTNTYPKYKFEYSVSDKKTGDHKHHHEFRDGHRVQGGYSLIEPDGSLRTVEYNADDYNGFNAVVSRGFHRHGGHAFSTFDHTRHFHPIRSEMKIKHFLPSSNYLFDKPQNKGNIEVKPVSEKNPNQNLPEKKALDKEVEEELEDLSKMTTFDPIENENSITTAASAMEIPTTESDKTIESTSTVKNDDIDSVNHMITEMPVVEVSSTSNEKAAVETEKLKTQEDSEAASSRFYSKFYYIIVIYAMVAASNAGLLHHAPAVSHNYINHEVQGHYAVHAAPSYQGSQISPIHTSIVHPAPAVHAAPVAYTASVHAAPAAYASPAHSAPVSHAAPVAHGASSAHEDDHYVGEFAHPKYGYSYSVEDPHTGDHKSQHETRDGDVVKGEYSLLQPDGSFRKVTYTADHHNGFNAVVHNTPPVIHHH-