Monarch geneset OGS2.0

DPOGS210120
TranscriptDPOGS210120-TA1491 bp
ProteinDPOGS210120-PA496 aa
Genomic positionDPSCF300017 + 1394939-1398318
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0064660.080.25% 
BombyxBGIBMGA000226-TA8e-16374.75% 
DrosophilaCpr73D-PB2e-6646.81% 
EBI UniRef50UniRef50_C0H6N91e-16074.75%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6N9_BOMMO
NCBI RefSeqNP_001166709.13e-16174.75%cuticular protein RR-1 motif 47 [Bombyx mori]
NCBI nr blastpgi|2905632395e-16074.75%cuticular protein RR-1 motif 47 precursor [Bombyx mori]
NCBI nr blastxgi|2905632394e-16974.50%cuticular protein RR-1 motif 47 precursor [Bombyx mori]
Group
Gene OntologyGO:00423025.3e-08structural constituent of cuticle
KEGG pathway 
InterPro domain[226-264] IPR0006185.3e-08Insect cuticle protein
Orthology groupMCL17151 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210120-TA
ATGTTTTGGACATACGGTGCGGTCGTCCTGAGTGTGTGTGTGTCCGTCCGGACTCAAGTTATTCCGGGCAAGTCCAGGACTAACGATGGTGACTTTAATAATGTTAACGCTGATGGATCTTTCGATTTCGGGTACGCGAACAAAGATCGTGGTGGCAGCTACCACTTGGCGCAAGGTAGTTCGAAAGGACTAGTGGGAGGACGGTTCGGTGCCAGAGAGCCTGGTACTGACGAAGTCAAAGAAACTATTTATACTGCTGGTCCTAGAGGATTTCGCGCTAAGGGCCCTAACGTTCACCGAAAGATAGATCTGGATCAGCGGCCGCGAGGTCCCATTGGAAATAAAGACGATCCCTATTTCGATCCTAACGAAGATCCCAGTTACGCGTACAAAATAGAGACGAGAACTTACTCCAAGAATGAGAATGCTGACAGCAGAGGCGATGTCAAAGGTCATTACTCCTTCGTGGATGATATCGGAGAACGGCACGACGTGTCCTATATAGCTGGGCGCGATACAGGTTTTCATGTATCCTCAGCCAACCCTGACGTGCCCAGTCTTATTGGGTCACCTTTTCACCGAGCGCCTCTGGTTAGAGGGGAGAGCAAATCTCGAGGACGTACTGCGGTACAGAGAGGATTAGATGGTTCATATAGATTCATTTCTGCCGGACCTGACCAACGGCGAACGGAAAGTAGTGACTCACACGGTAACGTTAGAGGATCCTACACATTTTTAGATGACAAAGGTGTACAAAGGACAGTACATTATATAGCGGGGCCAGGTATTGGATATCGGATTGTGAAGAACAGCAACGATCCCTTCATTCCTTCCTATTTTCCTACTATACCTAGTCCTTATGATCCGGCATTTAACGCAGGAGGTAGCGCTGGGGCACCGGCTTTCGCTCCCAGCGACGAAGGTAGCGATGATGTCTTCAAAGGACCCGATGGCACTGCAGCGTCAGGGCACGTTAAGCCACCTCCTTTCCCTCCCTCCGAATCAGAGAGACCTAGTAACACTGGTAGCATATCGACAGTAATTGAAACTCCTGATGATTCAGATAACAACGGTTACGACACAGGTCCAAGTTTTAATCAGGAACCTGATAATACAGATCTGGGTTACGTCGACGAGGACGCTTCTAGTTTTCAAAATCAAAAGCCAACTCAAGCACCAGGCAAACCGTGGCGCCCAGAGAACAAACCGTATAGACCGTCGAAGAAACCGTTTAGACCGATAAAACCTTATCAAGAAACGAATAATAATAACGACAATTTCGCTGGTGATAAAAGTAAACCAGAATTTGCTGTAGGATTCAATATACATCACACGAAACCTGGCACAACGATCATTAGGAATATAGGTGAAGAATACTTCGGCATACCTCCTGGTGTGTCCGTCCGCGCTCATGTACAGAGCATCGATCTTTATCCCTTCGGTTCCAAACCAATTTCACCATCAGAGGCTCTGGAAAATGACCAAACATAG

Protein sequence:

>DPOGS210120-PA
MFWTYGAVVLSVCVSVRTQVIPGKSRTNDGDFNNVNADGSFDFGYANKDRGGSYHLAQGSSKGLVGGRFGAREPGTDEVKETIYTAGPRGFRAKGPNVHRKIDLDQRPRGPIGNKDDPYFDPNEDPSYAYKIETRTYSKNENADSRGDVKGHYSFVDDIGERHDVSYIAGRDTGFHVSSANPDVPSLIGSPFHRAPLVRGESKSRGRTAVQRGLDGSYRFISAGPDQRRTESSDSHGNVRGSYTFLDDKGVQRTVHYIAGPGIGYRIVKNSNDPFIPSYFPTIPSPYDPAFNAGGSAGAPAFAPSDEGSDDVFKGPDGTAASGHVKPPPFPPSESERPSNTGSISTVIETPDDSDNNGYDTGPSFNQEPDNTDLGYVDEDASSFQNQKPTQAPGKPWRPENKPYRPSKKPFRPIKPYQETNNNNDNFAGDKSKPEFAVGFNIHHTKPGTTIIRNIGEEYFGIPPGVSVRAHVQSIDLYPFGSKPISPSEALENDQT-