Monarch geneset OGS2.0

DPOGS212672
TranscriptDPOGS212672-TA1770 bp
ProteinDPOGS212672-PA589 aa
Genomic positionDPSCF300198 + 170028-173344
RNAseq coverage404x (Rank: top 30%)
Annotation
HeliconiusHMEL0075110.076.69% 
BombyxBGIBMGA004266-TA3e-16570.14% 
Drosophiladyl-PC8e-5637.84% 
EBI UniRef50UniRef50_D6WSX08e-13147.77%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WSX0_TRICA
NCBI RefSeqXP_968458.14e-13347.01%PREDICTED: similar to dusky-like CG15013-PA [Tribolium castaneum]
NCBI nr blastpgi|910861257e-13247.01%PREDICTED: similar to dusky-like CG15013-PA [Tribolium castaneum]
NCBI nr blastxgi|910861253e-13948.44%PREDICTED: similar to dusky-like CG15013-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[260-505] IPR0015071.1e-31Zona pellucida sperm-binding protein
Orthology groupMCL18288 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212672-TA
ATGGGTCGGTGGTGTGTGGTGGCACTGTGGGCGGTGATCGCCTGTGCCGCAGCTATAGACACTCCTTCTAGATTTCCCGATCTCCAGCCAGGCCAGCCAGATACCGATGAAATGGTCACCAATCTCCTTCAGTGGATTCAAGGATTCTACCAACAGAAAAATAGGAGAGCTCGACAATACTTCGGAGGAAAACCCGAAAAGCCGAGGCCATTTGAACCAGCTTCTTATCTCCCACCAGTGACAGGTTACCCCGCACCTGGCAGTCGGCCGACACCTTCATATAGTGAAGGCCCGAGTCAACCTAGTTCCCCATTCCTACCTAGCCAACCTGACCAACTACCGTTGGAACCTAGTCAATCTCCTTACCAGCCCAGCCAACCTGGCTATCAACCGAGTCAACCAAGTTATTCATCCTACCAGCCGAGTCAGCCCACTCAATCTTCATACGAACCCAGCCAACCAAACCGGCCCAGTGAGCCGTCACAACCCTCTTCTTACCAACCTAGTCAACCTACACAATCATCTTACGACCAAAATGAATCTTCAGAACCCAGTTACGAACCAAGTCAACCTGGTTATAAACCAAGTCAGTCGTCGCAACCTCAGCAATCAACTTCTCCTTCTTATCAACCTAGTCAACCTAATGAAGATTCAGGATTACCTGACCAGCCAGAAAAACCAGATCAGCCTGGAGATAATGATAATGGCCCACAGACTCCCATAAATCCAGAAGACGAAGACCGTCATCCACCTCATATTCACGATATCACCGTCGATTGTGGTAAACAAATGATGACCATTAACATTGAGTTCAACAAGGCCTACAACGGGATCATTTATTCTCAAGACCACTACAAAGACTCTGAGTGCATTTATGTTAAGGAGAATTCAAACCAAATCAAATATTCGTTTACAGTGAATCTAAATAAATGTGGAACAAGATTCTTTAGCGATTTCGAAAATGAAGGTCAAGCCTATCTTGAAAACGTTTTGGTTCTCCAAAACGAGCCAGGGATTCAAGAAGTCTGGGATCACATTCGGCGCGTAAGATGCCTGTGGGAAGGAAATTTAACTAAACAGCTAGTGTCATCTTTGAGCGTCGGTATGTTGAATCAGATAACAAGTAATTTCAGCGGTGATACTGCTATGGCTCGTTTGGATATTCAGACGGGCAGAGGACCTTTTGCCCCCGAAGCAAACGGGCTGATTAAGATCGGAGAAATTATGACTTTAGTAGTATCCGTAACTGGTGATGCAGGATTTGATATTTTAGTTAGGGAATGCATCGCTCGCGACTCAAGCAACACCAATATCGTCCCATTGACCGACTCAAACGGCTGCGTGCTAAAACCAAAACTATTCGGAGCATTCCAGAAAACAAGAGAGACAGGAAACACTGGGGCTTCGATTATCGCTTACGCATACTTCAATGCCTTTAAATTCCCAGACGAGATGGACCTGATCATACAATGTGATGTTGAACTGTGCAAAACGGACTGCGAAGTGTGCCCCAGTCCTGGCAGTACGGAGCCCAGGAGGAAGAGACGGGACGTCATTCACATAGGAAACAGAACTTACGAACCGGTTACTACTATTGAGAAGGGTTTGAGGGTTGTGTTCGCTGAAGATTTACCCAATGAAGCTGGATTATGCATTTCATCTACAGCAGCGCTGTGGGGAGGTGTTCTAGTTCTCGCTGGATCCCTATTAAGCAGTTTGCTAGTCACTCATTGTTGGCACAGATCGCGCGGTTCAGCGAAATTCGCGTAA

Protein sequence:

>DPOGS212672-PA
MGRWCVVALWAVIACAAAIDTPSRFPDLQPGQPDTDEMVTNLLQWIQGFYQQKNRRARQYFGGKPEKPRPFEPASYLPPVTGYPAPGSRPTPSYSEGPSQPSSPFLPSQPDQLPLEPSQSPYQPSQPGYQPSQPSYSSYQPSQPTQSSYEPSQPNRPSEPSQPSSYQPSQPTQSSYDQNESSEPSYEPSQPGYKPSQSSQPQQSTSPSYQPSQPNEDSGLPDQPEKPDQPGDNDNGPQTPINPEDEDRHPPHIHDITVDCGKQMMTINIEFNKAYNGIIYSQDHYKDSECIYVKENSNQIKYSFTVNLNKCGTRFFSDFENEGQAYLENVLVLQNEPGIQEVWDHIRRVRCLWEGNLTKQLVSSLSVGMLNQITSNFSGDTAMARLDIQTGRGPFAPEANGLIKIGEIMTLVVSVTGDAGFDILVRECIARDSSNTNIVPLTDSNGCVLKPKLFGAFQKTRETGNTGASIIAYAYFNAFKFPDEMDLIIQCDVELCKTDCEVCPSPGSTEPRRKRRDVIHIGNRTYEPVTTIEKGLRVVFAEDLPNEAGLCISSTAALWGGVLVLAGSLLSSLLVTHCWHRSRGSAKFA-