Monarch geneset OGS2.0

DPOGS210676
TranscriptDPOGS210676-TA1623 bp
ProteinDPOGS210676-PA540 aa
Genomic positionDPSCF300013 - 1112637-1124939
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0224141e-11671.88% 
BombyxBGIBMGA006288-TA6e-7648.55% 
Drosophilam-PA1e-15092.02% 
EBI UniRef50UniRef50_E2AB445e-17962.21%Cuticlin-1 n=9 Tax=Endopterygota RepID=E2AB44_CAMFO
NCBI RefSeqXP_392051.11e-17760.51%PREDICTED: similar to miniature CG9369-PA [Apis mellifera]
NCBI nr blastpgi|3407253569e-17963.36%PREDICTED: hypothetical protein LOC100648459 [Bombus terrestris]
NCBI nr blastxgi|3800162456e-17963.07%PREDICTED: uncharacterized protein LOC100864111 [Apis florea]
Group
KEGG pathway 
InterPro domain[41-281] IPR0015075.2e-33Zona pellucida sperm-binding protein
Orthology groupMCL16402 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210676-TA
ATGGATCGGTCTCACGAGTCCTTCCAAATTTGTATGAAAAATACAAGAGCAAAAGCCGGTGAACTATGGCCTATGGAGCGACCGGAAGGTATGCCGGTCATACAATCTCTTGAAGTGATGTGCGGCAAAGACCATATGGATGTTCACCTCACATTCTCACACCCCTTCGAAGGAATCGTCTCCTCTAAAGGTCAACACGGCGATCCTCGTTGTGTGTACGTCCCGCCATCAACCGGACAGACGTATTTCTCGTTTCGAATCGCATACGCCAAGTGTGGTACAAAACCAGATCTCCACGGGCAGTTCTATGAGAACACGGTGGTAGTTCAGTACGACAAGGACTTGCTAGAAGTTTGGGATGAAGCGAAGCGTCTCCGGTGTGAGTGGTTTAACGATTACGAGAAAACAGCATCAAAACCACCAATGGTTATCGCAGACTTGGACGTAGTTCAGTTAGATTTCAGAGGTGACAACGTTGACTGCTGGATGGAAATCCAAGCGGGTAAAGGACCCTGGGCTGCCCCTGTCTCCGGAATTGTACCTCTCGGGTCAACTCTGACATTAGTTGTAGCTATTAACGATTATCGAGGTGAATTCGATATGCGTGTTAAATCTTGTGCAGCATCTGACGGTTCCGGCCATGTTATACAGCTGTCTGATGAGTACGGTTGCGTTCTGCGCCCTAAAATGATTTCCAGATTTCTCAAAGCGAAAGCCGTGGATGAGCGAGCAACAGTGATCACGTACGCATTCTTTCACGCGTTCAAATTCCCCGATGCGCTGAGCGTTCACATACGATGCAAGGTGGAAATATGCAGGCATGGTTGTCTAGATCACTGTCAGCTGGCTGGCATAGCGCCTCAGAGACAGGGGGCTTTAGATAGAAAGGATGATAGAACGCAGCAGGGCCATAATGATATCGGGGACTCGCTAGAAGAATCAGACCTCAGTCTGGCAGAGGATCACCAACGACTGCCGCTGGAAGAGGCCTTTGGTGATGAAGTATCTGGAGACGCGGTGCCCTCCCCAGCACCTCAACGAGCGCCCGCACCTGTCGTCGAAGCGGACTCCGATCATATGTCAGCTGTTGAAGAACTTGTAGAAGCACTCGCACGACCTTTCACCGATCGCCCATCCGATCTCGGTGAACGAGAGCGTTTCCCTCACGGACCACGAAATCTTGGCTCCAACGTCAAACTAGAACCGGGTGATGGCGTGATATCCGCAAAACATCCCGGACCAGCCGTAGCTGGTCCTAGGTCCTTGCGCAAACGAAGAACAGTGAGGGAAGCCAGATCCGCGGATGTTGGTGTATCAGGAATATACGATGTCGTCTCGGAGGCAGATCTCGCCTTTGGACCTACCAGGGAACCCGTTACTGTATTCAGTGGAAGAATACGCGAAGAGGTTGTATACGGAATTTGCATGGGCGCTGTGAGCTTCGGCGCTTTATTCGCTTCAGCGGCTGCTCGGCGACGCCCGCGTCGCGAACCAGCGGCTATATTGTTGCACAAACTCCGCTCGAGGACTCAACCTCCAGTGACTCCCGCTATCGCCTCTAACTCGCTCGCAGCCTGGATGGCTTTAAGACTTCTCGGATCACCACATTCGCGTGAAGGGTGA

Protein sequence:

>DPOGS210676-PA
MDRSHESFQICMKNTRAKAGELWPMERPEGMPVIQSLEVMCGKDHMDVHLTFSHPFEGIVSSKGQHGDPRCVYVPPSTGQTYFSFRIAYAKCGTKPDLHGQFYENTVVVQYDKDLLEVWDEAKRLRCEWFNDYEKTASKPPMVIADLDVVQLDFRGDNVDCWMEIQAGKGPWAAPVSGIVPLGSTLTLVVAINDYRGEFDMRVKSCAASDGSGHVIQLSDEYGCVLRPKMISRFLKAKAVDERATVITYAFFHAFKFPDALSVHIRCKVEICRHGCLDHCQLAGIAPQRQGALDRKDDRTQQGHNDIGDSLEESDLSLAEDHQRLPLEEAFGDEVSGDAVPSPAPQRAPAPVVEADSDHMSAVEELVEALARPFTDRPSDLGERERFPHGPRNLGSNVKLEPGDGVISAKHPGPAVAGPRSLRKRRTVREARSADVGVSGIYDVVSEADLAFGPTREPVTVFSGRIREEVVYGICMGAVSFGALFASAAARRRPRREPAAILLHKLRSRTQPPVTPAIASNSLAAWMALRLLGSPHSREG-