Monarch geneset OGS2.0

DPOGS206100
TranscriptDPOGS206100-TA642 bp
ProteinDPOGS206100-PA213 aa
Genomic positionDPSCF300028 + 199450-201295
RNAseq coverage6904x (Rank: top 2%)
Annotation
HeliconiusHMEL0057732e-6487.01% 
BombyxBGIBMGA006826-TA4e-5885.81% 
DrosophilaTwdlE-PA1e-4778.91% 
EBI UniRef50UniRef50_Q8SZP22e-4578.91%RE71854p n=22 Tax=Neoptera RepID=Q8SZP2_DROME
NCBI RefSeqNP_001166627.11e-6985.33%cuticular protein tweedle motif 4 [Bombyx mori]
NCBI nr blastpgi|2905608692e-6885.33%cuticular protein tweedle motif 4 precursor [Bombyx mori]
NCBI nr blastxgi|2905608698e-9085.33%cuticular protein tweedle motif 4 precursor [Bombyx mori]
Group
KEGG pathway 
InterPro domain[85-184] IPR0041451.8e-42Domain of unknown function DUF243
Orthology groupMCL14918 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206100-TA
ATGAGAACCATCGAAAGTGCCCCCTCCCCGGACATGGAACATACATCTCATATAAACGGTGGTGCATCAGGTATCTGCACATTACACCTTCTCAGACTCTGTGACAGGATGTGTGACAAGATGGTATTGAGGTCATTAATTTTCGCTGCGGTGACGGTCTGCGTGCTTGCCCGGCCGGAACCACCGTCATCGTACGGGCCACCTTCAACTTCCTATGGAGTCCCAGCTCCTCAGTATGGGCCACCTCAGCAGCCGATTGTTCACAAGCATGTGTACGTACATGTGCCACCCCCTGACAACGAACGCCCACCACCAGCGAAACAAATTTACGTGCCACCACCGCAGAAGCATTATAAGATAGTGTTCATCAAGGCACCAGCTCCTCCGGCACCAACCGCGCCAGTCATCCCTGTGCAACCCCAGAACGAGGAAAAGACCCTCGTATACGTGCTGGTTAAGAAACCAGAAGATCAACCTGACATTGTCATTCCTACTCCAGCACCCACTCAACCATCCAAGCCGGAGGTCTACTTCATCAGATACAAGACACAGAAACAAGAAGGATACCCTGACGCCTCACCACCAGCACCTTCATACGGTCCACCGGCATCTGAACCGTCATCCGAATACGGCGCCCCTTAA

Protein sequence:

>DPOGS206100-PA
MRTIESAPSPDMEHTSHINGGASGICTLHLLRLCDRMCDKMVLRSLIFAAVTVCVLARPEPPSSYGPPSTSYGVPAPQYGPPQQPIVHKHVYVHVPPPDNERPPPAKQIYVPPPQKHYKIVFIKAPAPPAPTAPVIPVQPQNEEKTLVYVLVKKPEDQPDIVIPTPAPTQPSKPEVYFIRYKTQKQEGYPDASPPAPSYGPPASEPSSEYGAP-