Monarch geneset OGS2.0

DPOGS201720
TranscriptDPOGS201720-TA3417 bp
ProteinDPOGS201720-PA1019 aa
Genomic positionDPSCF300269 - 172488-191129
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0158590.064.36% 
BombyxBGIBMGA014470-TA0.054.93% 
Drosophilapwn-PA3e-12144.28% 
EBI UniRef50UniRef50_F4X7213e-13636.12%63 kDa sperm flagellar membrane protein n=6 Tax=Formicidae RepID=F4X721_ACREC
NCBI RefSeqXP_969198.11e-13751.74%PREDICTED: similar to pawn CG11101-PA [Tribolium castaneum]
NCBI nr blastpgi|910868652e-13651.74%PREDICTED: similar to pawn CG11101-PA [Tribolium castaneum]
NCBI nr blastxgi|910868650.041.64%PREDICTED: similar to pawn CG11101-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055091.6e-05calcium ion binding
KEGG pathway 
InterPro domain[865-896] IPR0130919.7e-08EGF calcium-binding
Orthology groupMCL14916 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201720-TA
ATGGCCGAGAGAAGATGGCGGATGGGTCTCGGCGCGCTGTTCGCCATTTTAATACAAGTCATTAATGGTTACGAAGAGACCTTCACTATATCACAACTAACACCGAAGTCAATTCTAGCTGAGGGTCCTCACGACGTACTGCTGCAAACGAGTCATGTAGAAAACGTAGGATTTAAATTAGAATCAAGTGCCAAATATGACGACATCGATGTTGAACCTGATGATATATTAACTGATTTGGAGACCGAGAACCATATCATAGTTGAGAGATCAGTCGAACCGAGCAGAGGTTCCAGATCCATTAACGATCAGGATGTACCATCGTCGAATAACAACGAAATTGACAACGAATTAGAAATACACAACAAGAGGAGGTACTATGACATCATACCAACGAAAACTCTCGATTACGTTAGGGATGATGAGGGTATGCTAATGTCCTCCGGGTTCGGGGACAAGTTAGAATTTAAATTTCCGGGGGAAGGGAAGAGGGCTCCGAAGGGGCGTGCACTAGCTCCTACGGGGCCCAAAATCAGGCCTACTGCGGTGCAACCACAACTGTTCTCAACAGATACCCTTAAAGACCAGGGGAGACAGAACAGTGAAATACAGAATATAATTACCGGTATCGTAAAGTTGTTAAATGGGAACGTCAATGTGCAGGCCAACACGCAGTTACTGAACGGCCGTCCAAGACCTATGGCGTCCAGGATAAATAACCGTGGACCACCAGATTTCGACGAACCTCTGACACCACCGCCGCTCGCTTATCATCCAACAAAAATACCCCCACCTTACCCCTTTGATAGGCCTATCGGTGTTAATTTAGCTGATCGACTCCCACCGATCAGTAATCGTCCAGGGTTTCATAGACCCATGCCTCCGTGGCAGCGTAGACCTAGACCTCCGAATTCCAATCGGAGACCTAACCCAAACCTTCCCGTTTATAAGCCAACACTGATCCCTCCACCGGATATGACGTATATAAACGAAAAAGAAGACGAGAGTTACAACGAAAATGAATCTCATTTTGACATTCCAAACCACGAGGATAACATTCCGTTCATAACGCTACCGGATACAACTACTGAGATAAGAAATATGACAGAGGAAATTCCAACTACCACCGACACTACGACATCTACAACAGCGTCAACTACTTCTAGGACGACAACTACCACAAGTACTACGACAACTACAACAACTACAACAACTACGACAACACCGAAAACAACGACCGAGTTGACCACCGAGAAGACTACTACAGAAAAGGTCAAGATCGAAAAGCCTGTCAAAGCAGACAAAGAAAAAAAGAGAGACCAATTAGGTCCCGATAAGTTAAAAGACAAAATCAAAGTCACAGAAGATAAACCGACAGAGAGGCCCATCAAGAATGATGCCATCACACCGGTGGCTATAGAACCCTCAGTCAGCGAGAGTCCGACGCCAGTTACACAGTCCACTTCCTCTCCGACCCCTACCGAGGGGTTGATGACGCACGCGCCAACAGTCAGCGAAACCACGAAGATACAAAATATAGAAACCTCCTCCATCAACTCTCAGCCGTTGCCAACATCCCAAGGTATCCCGTACAAGCCGTATCCCCGGCCAGGGATAGTTCTAGACGACACAGATTTCAAGCCCCACAAGTCCAGACACAGGCCTGACGCGTCCGTCATCACAGCCGACAGACTGCCGGGGTACGGGGAGATATTTGACGTGACAGTCTCCGCTATACAGGGCCCAGGGGAAAAAGCTGGACCAGTGAACATACAGACGCATGTACACCACGGGTCACAGTACGCGGACGATATAATAGTGTCAGGGTCCGGGCAGCACAGCTTTGTCTCCATAGACGGCAAGAGGACCTACCTGAACCTCTTCGACACCGGCAGCATCACGCCCACCAGCGTGCAGCCGGCCCCACAGACCAGCCTCCCGAAGACCCACGTCCCGTCGTTGGGGACGGGGGTGGCGATACCAGCCGATGATGTACCGGCTCCGCCGGCACCTCCCCCGAGGAGGCGACCCCAGACACCCTATAGGAGACCCCAGCCGACTGTACGCATAGACACCTGCATCGTGGGCGACGACTCCACCTGCGACCAGAGCCAGAACGAGAGGTGCAGGACGGAGGCTGGTGTGTCCAGCTGCCAGTGCCGCGCGGGCACGGCTCGCCGCGTGCGCCACTCGCCCTGCCGCCGCGCCGTGTCGCTGTCGGTGTCGCTGAGGGTGGACCGCCTCTACGACCGCCAGATCTCGTGGGACGAGAAGCTGTCTGACAAGGAGTCGGAGCCGTACCAGCAGTTGAGCTACGAAGCCGTCAAAGCGATCGACTCCGCGTTCTCTATGACTCCGTTCTCCGATGACTTCGTGAGCGGCTCTGTGGACTCCATCGTGCGAGGTGGCCCGCAGCACCCGGGGGTGTACGTTAACTTCACCGTATTGCTGTCCGAGACCCCCGAGACCGTCCGTCCGGCCGTTGCGGGTGACATCCATCAGCACCTGGTGGGTGTCATCCGCCGCCGGTCCAACAACGTGGGTGCTTCCGCCCTGTGGGTGACGCCCGAGGGCAGCGTGATAAATGTCCGAGATGTAGACGAGTGCTCGTCCCCTGACCTCAACGACTGTCACACACTGGCGACCTGCACCAACACCTGGGGGGCTTTCAAATGCACGTGTCCCAACACGACCCTGGACCCCGCGCCGGTGGCCAGCCGGGCGGGCCGCGAGTGCCGCTCGTGCGCCGCCTCGCACTGCAGCGACCGGGGGCTCTGCCACTACAACAACGGACAGCCTTACTGCACGTGTTCATCTGGTTACTACGGCTCCACTTGTGAGATGGACGGCGAGGTCATAGGGGTCTCCGTGGGGGCTTCGCTGGCGGCCGCGCTCGTCATAGCCATCACACTGGCAGCCCTGCTCAGCTGGAGTACTCTTTGCTGGTGGACTTTCATAACCCGTCCCCAGCTGACCTCTACTCGTAAGGAATGTTCCTCACTAATGTCAAATCTATTCGTCACGGTCCCAGGTTATCGTGATGAGTCGCTATACCGGTAGCCGTGTCTCATATATATACCAGCTAGTTATGCAGAATGCTTTTGTTCCAGGATGGGCATGCATGGAGTTCACACGGGAACTCTCAACACCATGACGTCACGGGCTAACACAGCCTCACACATATACGGTTACACAAATCACCTGGCATCCGAGTCCAGCTCGGAGGCGTCCAGTCACGTGCAGGAGAGAGCCGACCTTCTGGTGCCCAGGCCCAAGTCACGAGCCAGGAGTATGCATAATCAGACGGGCATCTACTATGATGTGGAATATGAGAACGCTGAACCCATATATGGAACCAAAGGCATCCCGCTGTCCACCTACACCGTCAGCAGGGGACCGACCTTCTACAGACAATAA

Protein sequence:

>DPOGS201720-PA
MAERRWRMGLGALFAILIQVINGYEETFTISQLTPKSILAEGPHDVLLQTSHVENVGFKLESSAKYDDIDVEPDDILTDLETENHIIVERSVEPSRGSRSINDQDVPSSNNNEIDNELEIHNKRRYYDIIPTKTLDYVRDDEGMLMSSGFGDKLEFKFPGEGKRAPKGRALAPTGPKIRPTAVQPQLFSTDTLKDQGRQNSEIQNIITGIVKLLNGNVNVQANTQLLNGRPRPMASRINNRGPPDFDEPLTPPPLAYHPTKIPPPYPFDRPIGVNLADRLPPISNRPGFHRPMPPWQRRPRPPNSNRRPNPNLPVYKPTLIPPPDMTYINEKEDESYNENESHFDIPNHEDNIPFITLPDTTTEIRNMTEEIPTTTDTTTSTTASTTSRTTTTTSTTTTTTTTTTTTTPKTTTELTTEKTTTEKVKIEKPVKADKEKKRDQLGPDKLKDKIKVTEDKPTERPIKNDAITPVAIEPSVSESPTPVTQSTSSPTPTEGLMTHAPTVSETTKIQNIETSSINSQPLPTSQGIPYKPYPRPGIVLDDTDFKPHKSRHRPDASVITADRLPGYGEIFDVTVSAIQGPGEKAGPVNIQTHVHHGSQYADDIIVSGSGQHSFVSIDGKRTYLNLFDTGSITPTSVQPAPQTSLPKTHVPSLGTGVAIPADDVPAPPAPPPRRRPQTPYRRPQPTVRIDTCIVGDDSTCDQSQNERCRTEAGVSSCQCRAGTARRVRHSPCRRAVSLSVSLRVDRLYDRQISWDEKLSDKESEPYQQLSYEAVKAIDSAFSMTPFSDDFVSGSVDSIVRGGPQHPGVYVNFTVLLSETPETVRPAVAGDIHQHLVGVIRRRSNNVGASALWVTPEGSVINVRDVDECSSPDLNDCHTLATCTNTWGAFKCTCPNTTLDPAPVASRAGRECRSCAASHCSDRGLCHYNNGQPYCTCSSGYYGSTCEMDGEVIGVSVGASLAAALVIAITLAALLSWSTLCWWTFITRPQLTSTRKECSSLMSNLFVTVPGYRDESLYR-