Monarch geneset OGS2.0

DPOGS209417
TranscriptDPOGS209417-TA2241 bp
ProteinDPOGS209417-PA746 aa
Genomic positionDPSCF300346 + 295762-298229
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0213962e-3044.44% 
BombyxBGIBMGA012604-TA2e-3554.92% 
DrosophilaCpr67Fb-PA2e-1848.94% 
EBI UniRef50UniRef50_C0H6K55e-3354.92%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6K5_BOMMO
NCBI RefSeqNP_001166734.19e-3454.92%cuticular protein RR-1 motif 14 [Bombyx mori]
NCBI nr blastpgi|2905608062e-3254.92%cuticular protein RR-1 motif 14 [Bombyx mori]
NCBI nr blastxgi|2905608061e-4125.44%cuticular protein RR-1 motif 14 [Bombyx mori]
Group
Gene OntologyGO:00423021.4e-14structural constituent of cuticle
KEGG pathway 
InterPro domain[287-336] IPR0006181.4e-14Insect cuticle protein
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209417-TA
ATGAAAATAATATGGGTACTTTTGGGGCTCATAGGCTCAGCCCTAGCGGAGAAATTAGATCGTTCATATCTTCCACCACCTGGATCAAAATTTTCTGGAGGCAGTCCAGGGGCAATCGATGTTCCACTAGAGTTCCCTAAGGAAACGGTTCTTCCTAACCCTGGCAGCAATAATTTGGGAAAACCAGAAATAGCAATTGGTATCAACCGCATTAGCCCAATAGCCTTCAATCAACAATATGGCAGTACATCAGAACCATACCAAAAAAATGAATATAACAGTCCTGCAACTGAATCGTATGCATTATCAGAAATTTCAACAAAAACACCTGATGTGGTTTCTAACGATTTCGTTCAAACTCCTAACTTAGGTGATTTGAAAATCAATTATGAAAACTTGTACCAGACTTCATCAACCACAAAGCCTGGCAACCTTTTTAGTGTGATAGAAGAAACAAAATATATTAATAATGGTGGTGATCTTACAGATGACAACAATTACGAAACAGATAATAAAATAAGCGATGAAAGTAAAAAAATATACGGTGGTGATATTGACAGTATTCCAGAAATTAATTCATACTTTGCGAAACCTTCGTCACCTGCTGCATTTCATTTTAGTTTGTCAAAATTATATGAAACTGGTGCCAATGTTTCGTCTACACCGACTTACGACAGTTCATCGAATTTATCTGAAAGTTTATATAGCACAAAACCATCGCAATACGGTCTGAAACCTGATTCAAAGACAACATTCAACATTCCTTACACTTATTCACCACGTACAGAAAGAATACAGGCGCAGAGAGATAGGGAAGCAATCATCTTAAATTATGACAGTGAAATTACTCCAGATGGTTATGCTTATAGCTTTGATACGTCAAATGGAATCCATGTAGATGAAAAGGCAACTGCACTAAATGGAGTCCGGGCCACTGGATCATATTCATACATTGGAGATGATGGCAAACTCTATAATGTCAGTTATACAGCTGATGAAAATGGATTCAGGCCTATTGGGGATCATTTGCCATCTCCTCCGCCAATTCCTGATGCCATTATGAAAGTCATAGAACAGGCTACAAAAGATAGAGATTTAGGAATTTATGATGATGGAACTTATGATGACAGTAAATATGGCTATAGAAATTATGAACCAAAAAACAAAATAGGATTTTTAAATAAATATAAAACAGGTGATCCCAAATACATCAAAAAGCAAACAGACCAGAAAGAAATCAAAAACAAGAAAATTCTTTCAAATAAAAAAGAAATGGGGGACAAATTGATTACAAAAGATTCGATATTTACACCTCCGGTCACGACTTTATTACCTTACGAGGAGGATAGATCAAAGAGTAATTATGATGAATTTAAAAATGAAAATATTAATGATTCAGAAAATGAAGGTAATATATTGAGTAATGAAAGTGATGGAACTGGGTACGAATACTCAAAACCTCTCGACGACTTCTCTTTGGTAACAGACAAAGAGTTTGTCAATAGAATTTCAGAGACTCCAAATCTGAATACCGTTCGAATTATTGAAGAGAAGACAAACGGTAATATTCGAGGAAAACCGTTCATGACACCGTACGTTTATGAAAATGATAACACATTTCTCGACTATGACAACAGTGAATCAGTATTGGAAACACTAGGACAATACCAAGTTCAAAACAAAAATATACCGAGTGTTCAGAATACAAACACTGTCCTTCCAACTTTTTCAATTTCAGAATTAAATTCTACCAATAATATAAGTGGGGATACTGGGATAACTAACATACTACCCCAAGATCATCAAGGATACTTCTATCCCACCACAGGATCTAATTTCAACAGTGATGCTTATAATCCCATTGCAATTTCTTCCGAGAAAAACATATCTGAAAATCAACCAACAATTCCAAGAGAATTTCCTTCAAGGTTAAATTTGCAAGCTACAAAAGTAGAAACTGTTTCCAGCAATCCATCCCCATATAGAGACTTCGGTTTATTCAACGATGAATCTAAAATTAGAAATGATTCTACAGATGTGGGAGATAAACGAATGTTTATACAACAAGAAACAACCCAACCCAGTGATCAAAACAGTTATGGTGAATACATTTCAGTACCTACGAATGAATTAAATGACACAGAATTCACAATTAACAGAGAAAATGCCATCAAAGGCGAGGATTTCAGTGGTCCGAAACAGAGACAAAAATATGACCCTCTTACTGGATACTACTATTGA

Protein sequence:

>DPOGS209417-PA
MKIIWVLLGLIGSALAEKLDRSYLPPPGSKFSGGSPGAIDVPLEFPKETVLPNPGSNNLGKPEIAIGINRISPIAFNQQYGSTSEPYQKNEYNSPATESYALSEISTKTPDVVSNDFVQTPNLGDLKINYENLYQTSSTTKPGNLFSVIEETKYINNGGDLTDDNNYETDNKISDESKKIYGGDIDSIPEINSYFAKPSSPAAFHFSLSKLYETGANVSSTPTYDSSSNLSESLYSTKPSQYGLKPDSKTTFNIPYTYSPRTERIQAQRDREAIILNYDSEITPDGYAYSFDTSNGIHVDEKATALNGVRATGSYSYIGDDGKLYNVSYTADENGFRPIGDHLPSPPPIPDAIMKVIEQATKDRDLGIYDDGTYDDSKYGYRNYEPKNKIGFLNKYKTGDPKYIKKQTDQKEIKNKKILSNKKEMGDKLITKDSIFTPPVTTLLPYEEDRSKSNYDEFKNENINDSENEGNILSNESDGTGYEYSKPLDDFSLVTDKEFVNRISETPNLNTVRIIEEKTNGNIRGKPFMTPYVYENDNTFLDYDNSESVLETLGQYQVQNKNIPSVQNTNTVLPTFSISELNSTNNISGDTGITNILPQDHQGYFYPTTGSNFNSDAYNPIAISSEKNISENQPTIPREFPSRLNLQATKVETVSSNPSPYRDFGLFNDESKIRNDSTDVGDKRMFIQQETTQPSDQNSYGEYISVPTNELNDTEFTINRENAIKGEDFSGPKQRQKYDPLTGYYY-