Monarch geneset OGS2.0

DPOGS210569
TranscriptDPOGS210569-TA1848 bp
ProteinDPOGS210569-PA615 aa
Genomic positionDPSCF300408 - 45947-62368
RNAseq coverage709x (Rank: top 18%)
Annotation
HeliconiusHMEL0057982e-17386.51% 
BombyxBGIBMGA009681-TA3e-10576.03% 
Drosophilapio-PB2e-7443.27% 
EBI UniRef50UniRef50_E0VID31e-7847.60%Cutilin-1, putative n=10 Tax=Pancrustacea RepID=E0VID3_PEDHC
NCBI RefSeqXP_002425877.12e-7947.60%cutilin-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420102384e-7847.60%cutilin-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420102381e-7746.53%cutilin-1 precursor, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[113-371] IPR0015074.7e-42Zona pellucida sperm-binding protein
Orthology groupMCL15704 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210569-TA
ATGTGGAGTATTCTTCTCATAGCCGCTCTATTCGCGACTGCGCGCTCCAAGCAAATACCACCACACACCGAACTGGCCACGACGACTGTCGCTACAGCAACCGAGCTATCAGAATGGCCGGCGGCTCTCGCGCCTCTGGACACCTTGACGGTCACTTGCGAGAAAGAGAACATGCATGTCACCATCTCGCTCGCACACACAAGACATCCCAACAGATCTATCATACTCTTGAAAACCGAAGTCTCAGTTGAAGAACTGGCCACGACGACTGTCGCTACAGCAACCGAGCTATCAGAATGGCCGGCGGCTCTCGCGCCTCTGGACACCTTGACGGTCACTTGCGAGAAAGAGAACATGCATGTCACCATCTCGCTCGCACACACACGACATCCCAACAGTATTTACGATACTTTTAACGGTATCGTGTACCCCGCGGGTCTTGGCAGTAACTCGACGTGTCTGAGAGAGTACGTCTCAGCCAGGGGCGACCTTCAGTACACACTCCCCCTCAAAGGATGTAACACAATGAGCACAGATAATGATGATGGTACTGTGGAGTATCACAACAACATCATAGTTCAGCCTCACCTTCGCTTGGTGACCGGCCAGGGTCGAGGGTACCACGTCAGGTGTAGGTATCGTCGGAGAGACCTCACCCTGTACCACCTACATAGACCTCATGCCGACAGATACATCGGCTCCAGCAACGATTATAACAGTGACGAGTACGACGAGGACTCGGGCCTGCTCCCATCAGTCACCATGAAGATATACAAAGGAGACCCCGAAGATAAAGAGGTGGCGTCTAATGTGAAGATCGGTGACACTCTGACGTTAGTGGTGTCTCTTGAGAAACAGCGGCAGTATGGTCTACTGGTGTCAGAGTGTACCGTCCGTGATGGACTGGGCTGGGCGGAACAGAGCCTCATAGCTGACGACGGGTGCCCTCTCGACGGGGAGATCATGGGTCTGTTCCAGTATTCGAGTGAGAAGCAGGAGGCGAAGGTCTCATTCCCCGCACACAAATTCCCTTACACTGCCAGCGTGTACTACACGTGTGAAGTGAAGCTTTGCGACCTCAACCATCCCACTGACTGTGAGCCGTGCAGTCACAAGCGGCGTGTCCGTCGCCAGTCTGACGAGTCCCCGGCCACCGTGGAGGTGTTTTCGGGCCTGTATGTCAACGAGGCCGACTCCCCCGACAATGACGTCACCAGCGAAAAGGTGGCGTCTAATGTGAAGATCGGTGACACTCTGACGTTAGTGGTGTCTCTTGAGAAACAGCGGCAGTATGGTCTACTGGTGTCAGAGTGTACCGTCCGTGATGGACTGGGCTGGGCGGAACAGAGCCTCATAGCTGACGACGGGTGCCCTCTCGACGGGGAGATCATGGGTCTGTTCCAGTATTCGAGTGAGAAGCAGGAGGCGAAGGTCTCATTCCCCGCACACAAATTCCCTTACACTGCCAGCGTGTACTACACGTGTGAAGTGAAGCTTTGCGACCTCAACCATCCCACTGACTGTGAGCCGTGCAGTCACAAGCGGCGTGTCCGTCGCCAGTCTGACGAGTCCCCGGCCACCGTGGAGGTGTTTTCGGGCCTGTATGTCAACGAGGCCGACTCCCCCGACAATGACGTCACCAGCGAAAAGAAAGAGGACGAGATATGCATATCACAGAAGAACTTCGCGATTGGTATCTGTATAGCGGGAGTCATCCTCATGATATGCGTGATCGCTGCCATAGCGTTCATACTCGCCAGACGAAGGAATCCCAAGACCTACTCCAGGACCGGCAGCTCACTATACAGCGGCCCTTACACAAACACTGGATATTCACACACTAGTTAA

Protein sequence:

>DPOGS210569-PA
MWSILLIAALFATARSKQIPPHTELATTTVATATELSEWPAALAPLDTLTVTCEKENMHVTISLAHTRHPNRSIILLKTEVSVEELATTTVATATELSEWPAALAPLDTLTVTCEKENMHVTISLAHTRHPNSIYDTFNGIVYPAGLGSNSTCLREYVSARGDLQYTLPLKGCNTMSTDNDDGTVEYHNNIIVQPHLRLVTGQGRGYHVRCRYRRRDLTLYHLHRPHADRYIGSSNDYNSDEYDEDSGLLPSVTMKIYKGDPEDKEVASNVKIGDTLTLVVSLEKQRQYGLLVSECTVRDGLGWAEQSLIADDGCPLDGEIMGLFQYSSEKQEAKVSFPAHKFPYTASVYYTCEVKLCDLNHPTDCEPCSHKRRVRRQSDESPATVEVFSGLYVNEADSPDNDVTSEKVASNVKIGDTLTLVVSLEKQRQYGLLVSECTVRDGLGWAEQSLIADDGCPLDGEIMGLFQYSSEKQEAKVSFPAHKFPYTASVYYTCEVKLCDLNHPTDCEPCSHKRRVRRQSDESPATVEVFSGLYVNEADSPDNDVTSEKKEDEICISQKNFAIGICIAGVILMICVIAAIAFILARRRNPKTYSRTGSSLYSGPYTNTGYSHTS-