Monarch geneset OGS2.0

DPOGS206610
TranscriptDPOGS206610-TA1974 bp
ProteinDPOGS206610-PA657 aa
Genomic positionDPSCF300048 - 1201992-1219898
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0065733e-14685.27% 
BombyxBGIBMGA006288-TA5e-3329.07% 
DrosophilaCG15020-PA6e-17168.60% 
EBI UniRef50UniRef50_F4W5J22e-17460.31%Putative uncharacterized protein n=6 Tax=Formicidae RepID=F4W5J2_ACREC
NCBI RefSeqXP_001664068.10.068.96%hypothetical protein AaeL_AAEL003724 [Aedes aegypti]
NCBI nr blastpgi|1571379010.068.96%hypothetical protein AaeL_AAEL003724 [Aedes aegypti]
NCBI nr blastxgi|1571379015e-17868.96%hypothetical protein AaeL_AAEL003724 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[299-551] IPR0015077.6e-33Zona pellucida sperm-binding protein
Orthology groupMCL15466 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206610-TA
ATGGAAACCAGTGCAATGGATAAATTTCCCGAAGGAGAGTCTTGGAGACGACGTTGTGGATGTTTACGTACTAGTGACAAGCTGAAACATATCCTCAACTGTTTTCCGAGCCCACCTCCACCCACACCAGACAATAAAAATAATATCAATCATAATAATAATAATCTTGGAACCACCTGCATATCTGCATATTGTACGAACGTACATCAAACGAGAATGGGCATGCCAAATTCTTGCATGGACTCCCTACATGATTATGATGTTTCTGATGAAAAGGCTCTGAAAAAGTCGAGCGAGAAGTCATGGAGAGTGATTGGTGACGTAGTACAATTAATACAACTGTCGGTCACGGCCACGGCCCTGGCTCTCTACTATCTTATCTATTGCTACATGCAGCTCATTTATTACACTTTGCGAAGCGCCCTCTACTTTCATAATGCTGATGGCGGCCTTATCTTAAAAGGCTACCGATTGGAATCATTTCACAGACGAAGTATTACGGAGGCTGCGATCGACCCCACTTCTAATCGAAACGCCGGCGCGGGCGCATGTTGCACAGTACGACCAGCCTATTCGGTGCTGCCACATCCTCGGCTAGACTCAGCTGAGTCTACAGAACAGTCGACTGAGTCTGCTTTACCTGGAGACCTACACGAAGGTTCGGGAGAAGACCCCACTGGACCCCCCACGAATAAGAATGAGACCAGAGAAGCCCTTAAATCAGATGGCAGTGAGCAAGACCCAGTAGTGTTTCTATCTGATAAGAACAGTTACATCCCAGTTAGCATCAAGCCAAGATCACCAGTCGACTCTAGTTGGAGGGAGCCACCAGCTTGGGAAAGGGAGTACAGAAGACATCGCACTAACGGCAGCAGAGTACAATACATCGAAGCAGAATGCCAAGACGACTTCATGAAAATCAGAGTCGGTTTCAACGGCTCGTTCAGCGGTCTAGTTTATTCAGCTGGTAGCTACTCCTACGATCCTGACTGTATGTACATAAACGGTTCTGGTCGTGACTATTACGAGTTCTACATACAACTGAACCGATGCGGAACCCTGGGAAGAAACGGGCAACACGAGGACACCAGGAAACATCCAGCTAAGAATCTGATGTGGAACACCATAACAGTTCAGTATAACCCACTGATTGAAGAAGAGTTGGACGAACACTTCAAGGTCACCTGCGAATACGGATACGATTTTTGGAAAACTGTCACCTTCCCCTTCCTGGACGTTGAAGTGGCGACAGGCAATCCGGTGGTCTTCACTCTGCAGCCTCCAGAGTGTTATATGGAGATAAGATCTGGTTATGGGGCGACCGGGGCAAGGGTCACGGGACCAGTGAGGGTTGGAGATCCTCTGACTTTGCTCATTTACATGAGAAGCGCGTATGATGGTTTTGATATAGTCGTCAATGATTGTTTCGCACACAACGGGGCTGCGAAGAGAATCCAACTCATAGATGAATATGGATGTCCCGTGGATGATAAACTGATATCTCGCTTCCGCGGCTCCTGGTCCGAGTCCGGTGTCTTCGAGACTCAGGTTTACGCGTACATGAAGACTTTTCGCTTCACCGGATCGCCAGCCTTATATATTGAATGCGATGTCAGGATGTGCCATGGGAGATGTCCGTCACAACCCTGCCATTGGCGTAACATGAAGAGTGTAAAGAAACGTTCAGCTGAAGCGGAGACGGCGAGCGTAGCGCCGCGTCTATCTGAAAACATCTCTCTCTTTCAGTCGTTGAGAGTTCTACAGGAGGGGGAAGAGGACGAGGATATGGCACGAGCTGCAGCTGAAGGACAGACGTGTATGAAGACGTCAGCGCTCTCGGCCATGATAGTGTCTTGTAGTGTGCTCGTGGCAGCCCTGCTGGTAGCCCTCCTTGTTGCCGCTAGAGGATGGCGGCAAAGCGAAAAAAGGACGCCAATACACGCTTATGTACCACACAAGGGAAGAATAAAATAA

Protein sequence:

>DPOGS206610-PA
METSAMDKFPEGESWRRRCGCLRTSDKLKHILNCFPSPPPPTPDNKNNINHNNNNLGTTCISAYCTNVHQTRMGMPNSCMDSLHDYDVSDEKALKKSSEKSWRVIGDVVQLIQLSVTATALALYYLIYCYMQLIYYTLRSALYFHNADGGLILKGYRLESFHRRSITEAAIDPTSNRNAGAGACCTVRPAYSVLPHPRLDSAESTEQSTESALPGDLHEGSGEDPTGPPTNKNETREALKSDGSEQDPVVFLSDKNSYIPVSIKPRSPVDSSWREPPAWEREYRRHRTNGSRVQYIEAECQDDFMKIRVGFNGSFSGLVYSAGSYSYDPDCMYINGSGRDYYEFYIQLNRCGTLGRNGQHEDTRKHPAKNLMWNTITVQYNPLIEEELDEHFKVTCEYGYDFWKTVTFPFLDVEVATGNPVVFTLQPPECYMEIRSGYGATGARVTGPVRVGDPLTLLIYMRSAYDGFDIVVNDCFAHNGAAKRIQLIDEYGCPVDDKLISRFRGSWSESGVFETQVYAYMKTFRFTGSPALYIECDVRMCHGRCPSQPCHWRNMKSVKKRSAEAETASVAPRLSENISLFQSLRVLQEGEEDEDMARAAAEGQTCMKTSALSAMIVSCSVLVAALLVALLVAARGWRQSEKRTPIHAYVPHKGRIK-