Monarch geneset OGS2.0

DPOGS211178
TranscriptDPOGS211178-TA1875 bp
ProteinDPOGS211178-PA624 aa
Genomic positionDPSCF300007 + 421698-428361
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0124184e-16386.22% 
BombyxBGIBMGA003167-TA1e-15172.85% 
DrosophilaCG16798-PA2e-4839.51% 
EBI UniRef50UniRef50_D6WL589e-7559.18%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WL58_TRICA
NCBI RefSeqXP_972996.12e-7559.18%PREDICTED: similar to CG16798 CG16798-PA [Tribolium castaneum]
NCBI nr blastpgi|910829333e-7459.18%PREDICTED: similar to CG16798 CG16798-PA [Tribolium castaneum]
NCBI nr blastxgi|910829332e-7159.18%PREDICTED: similar to CG16798 CG16798-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[146-259] IPR0015072.4e-06Zona pellucida sperm-binding protein
Orthology groupMCL18425 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211178-TA
ATGACAGAGGTCATATTACCGATTAGGGAACAGGCCAAGGTTCTGCACCTTATAGGCGAGCCAGACTTTCGCGATGTCCGATTGCGTTGGGAGTACGGCGGGAATGAAGATGAAGATTTGCAGAAATTACTAGCCTTTCAAATACATTACTGTGAACTGCAAGCCTGGGGGCAATACCGATGTAGAACTAAGGTGGTAGATAATTTTGAAGAAGAGAGATCTTCTAGGATGACGATTGAGACGACCACCAACAAGGTCCCAGCCGGTAAACGCGGTCGGACCTACACCACATATATCTCTGGCCTCCGTATGGCCACCACGTATTCTTTCGAAGTACGTCCTGTCAAACGTGAAGCTCGTGATTTGGCTGACCCGCAATCTATTGGATCTAAAATCATCATTGTACCTACTAAAGGATTTTCAGCGCGAGCTACCCAGTGCTTGCCTCATGCTAGTGAAGTCGAGGTTTCCACGGGGCCCTTCTTTGGGGGTCGGATAGCTGTGGAGGCGGCTGACGGTGGACCGGAGAGATGCTCCCTTCAAGGGAACCCGAACAGCGCCCAAGACGCGTACATACTAAGGATTCATCATGAGGAATGCGGTTCGGAAGTCAACGAAACTACCGTCGCGACTTATGTCATAGTACAAGAGAACCTGCCGATTCTAACTCACAGTACCCGCCGTTTTCTGGTGTTATGCACCTACAAACCGGAGACATTGACGGTGAGGGCCGGCATCAACCTGCCAAAGACGAATCCAGGGGATGTTCTGTTGGAGACGAAACCACAAGGAATCGTTTTGCACCTTATAGGCGAGCCAGACTTTCGCGATGTCCGATTGCGTTGGGAGTACGGCGGCAATGAAGATGAAGATTTGCAGAAATTACTAGCCTTTCAAATACATTACTGTGAACTGCAAGCCTGGGGGCAATACCGATGTAGAACTAAGGTGGTAGATAATTTTGAAGAAGAGAGATCTTCTAGGATGACGATTGAGACGACGACCAACAAGGTCCCAGCCGGTAAACGCGGTCGGACCTACACCACATATATCTCTGGCCTCCGTATGGCCACCACGTATTCTTTCGAAGTACGTCCTGTCAAACGTGAAGCTCGTGATTTGGCTGACCCGCAATCTATTGGATCTAAAATCATCATTGTACCTACTAAAGGATCGCGAGCTACCCAGTGCTTGCCTCATGCTAGTGAAGTCGAGGTTTCCACGGGGCCCTTCTTTGGGGGTCGGATAGCTGTGGAGGCGGCTGACGGTGGACCGGAGAGATGCTCCCTTCAAGGGAACCCGAACAGCGCCCAAGACGCGTACATACTAAGGATTCATCATGAGGAATGCGGTTCGGAAGTCAACGAAACTACCGTCGCGACTTATGTCATAGTACAAGAGAACCTGCCGATTCTAACTCACAGTACCCGCCGTTTTCTGGTGTTATGCACCTACAAACCGGAGACATTGACGGTGAGGGCCGGCATCAACCTGCCAAAGACGAATCCAGGGGATGTTCTGTTGGAGACGAAACCACAAGGAATCGTGGAGCCTTACGATGACAATAACCTGCAGCCCGCAAGACTTGAAGCCAGGAGGGAAGAAACACAGCAGAGTATGTTCGGGGAAATTATGTTAGTGATGTTCTTGGTGGCGGCAGCGTTTGGAGGTGTCGCTTTTCTGATATGGAAGGTTGTGCCGCAGGCTGGCAAGGAAGACAGCATCTCCATATCAACATCCTCAACCTTGTCCCGCAGCGGTATATTCAGCAGGCGGAACATAGATCGATTCTCCGATAAGAGTTCCGTATACTCCATCACGTTGTCTGAAAAAGACGTTAAGAAAAGCGACGGAGACGATACCTCGGAGGCCTAG

Protein sequence:

>DPOGS211178-PA
MTEVILPIREQAKVLHLIGEPDFRDVRLRWEYGGNEDEDLQKLLAFQIHYCELQAWGQYRCRTKVVDNFEEERSSRMTIETTTNKVPAGKRGRTYTTYISGLRMATTYSFEVRPVKREARDLADPQSIGSKIIIVPTKGFSARATQCLPHASEVEVSTGPFFGGRIAVEAADGGPERCSLQGNPNSAQDAYILRIHHEECGSEVNETTVATYVIVQENLPILTHSTRRFLVLCTYKPETLTVRAGINLPKTNPGDVLLETKPQGIVLHLIGEPDFRDVRLRWEYGGNEDEDLQKLLAFQIHYCELQAWGQYRCRTKVVDNFEEERSSRMTIETTTNKVPAGKRGRTYTTYISGLRMATTYSFEVRPVKREARDLADPQSIGSKIIIVPTKGSRATQCLPHASEVEVSTGPFFGGRIAVEAADGGPERCSLQGNPNSAQDAYILRIHHEECGSEVNETTVATYVIVQENLPILTHSTRRFLVLCTYKPETLTVRAGINLPKTNPGDVLLETKPQGIVEPYDDNNLQPARLEARREETQQSMFGEIMLVMFLVAAAFGGVAFLIWKVVPQAGKEDSISISTSSTLSRSGIFSRRNIDRFSDKSSVYSITLSEKDVKKSDGDDTSEA-