Monarch geneset OGS2.0

DPOGS214038
TranscriptDPOGS214038-TA1449 bp
ProteinDPOGS214038-PA482 aa
Genomic positionDPSCF300238 + 225087-228660
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0050721e-9689.30% 
BombyxBGIBMGA008320-TA1e-18067.74% 
Drosophilacyr-PB3e-5433.57% 
EBI UniRef50UniRef50_D6WTK53e-7439.03%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WTK5_TRICA
NCBI RefSeqXP_975004.12e-7339.30%PREDICTED: similar to RE22259p [Tribolium castaneum]
NCBI nr blastpgi|2700107429e-7439.03%hypothetical protein TcasGA2_TC010196 [Tribolium castaneum]
NCBI nr blastxgi|2700107421e-7539.66%hypothetical protein TcasGA2_TC010196 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[27-316] IPR0015073.3e-25Zona pellucida sperm-binding protein
Orthology groupMCL18869 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214038-TA
ATGTGGCGTGTGATCCTGCTGACGGTACTCCTGTGTGGAGTACAGAGTCATCCTAATAAGGTCCGCAATGTGACGACATCGTGTGACAAAGGCTCAATCACTATCAATATAGACATGGAGAAGCCATTCAAGGGTCTAGTTTTCAGTAAGGACTTCTCAAGAGAATGTCGGATTTTAGGTCAAATGCAGACGAATGTTTCATTGCATTTGCCTTCAAACACGTGCGGAGTAAGGACATCAATACAGAACAACACGACAAAGAGATACGATGAACTAAATCTGTACTACACGGTGGAGGTAGTAGTTCAAATGGATAAGATGCTTCAACAGTCATCAGACCAGGAGATAATTGTTAGGTGTAAATTGCAACCTCGTGCGGTGCGGATAAACAGCTCAGCGCTTGAAGGCGTCATTAAATCTAGACTGCGAGAAATTACCGGCCATGAAGGGAAACGGATGAGAACTGGACGGAATAGAAAGGGTTGGGATAGGATGGTTGAGGTGGAACAGCAAGAGCTGTTAGAAGCAGCTCGAGCCTGGATGCAGCTGGCGCCAGACACGGTGGAGGTCGGACAACCCACGGAGCTCATGATACAGACATGTGATGTGGGCGTCGGTCTCCGTGTAACAAACTGCATAGCGCACGATGGTTTAGGAGAGGCCTCGCAGAAGTTACTCGACGAAGCTGGCTGTCCCATTGACGAAACTATATTCAACTCCCCAACCGTGCATCAACACAGACGAGATGAGATAGACTTTACCGACAACGAACAGAGCCTAGAGAACTCGAGGGTAGCGAGTACTGACTCTGTCATCAAAAATATGATGACCTTCCAACACGCTGTGACCACGTTCGCAGCGTTCAAGTTTCCCGACAGGGCGAAATTACATCTCTCCTGCGGCATAGAACTCTGTAAGGGCGTCTGCCCCAAGGTCGATTGTAAGGCCCTTCAAAAACCCCAACAGACGAAGGACGGGCTGGTGAGGAAGGCCCGTCTGGATAAGGACGCTAAGGGTGTGGTGATAGAAAGACTAGAAGTCTACAATAGTATAGAAGTCCTGGCACCGAACATAGAACTAGAAGACGAGGCTTCTATAAGAGGTTCCAGAAGGGTAGAAGAAGAAGATGGGCTGAAAGGTTTTTCCCCCGGCGACAAAACAATATGCCTATCTCCTGGGAAAATGGCCTTAGCCTTTTGTATACTTGGCATCATTTTCCTATGTGCTATTGCTGTAGCTTTCGCGTCCTTAGTGCGAGCGAGACGGAGAACACCTCGAGAGCCGGTGAATACGTCTCTCTCTTTCTACACCGGTAGCAAGAGTCTCTTCTCTTCTAGTGGCAGCAGTAGTTCCGGTTTAAGTGGCAGCAAGCTTCTTCTAACCGATAGTCCATATTTAGACCACCATTCGTCTTCCAGTAACAATTGGCCATATGCTCGGGCATTTTAA

Protein sequence:

>DPOGS214038-PA
MWRVILLTVLLCGVQSHPNKVRNVTTSCDKGSITINIDMEKPFKGLVFSKDFSRECRILGQMQTNVSLHLPSNTCGVRTSIQNNTTKRYDELNLYYTVEVVVQMDKMLQQSSDQEIIVRCKLQPRAVRINSSALEGVIKSRLREITGHEGKRMRTGRNRKGWDRMVEVEQQELLEAARAWMQLAPDTVEVGQPTELMIQTCDVGVGLRVTNCIAHDGLGEASQKLLDEAGCPIDETIFNSPTVHQHRRDEIDFTDNEQSLENSRVASTDSVIKNMMTFQHAVTTFAAFKFPDRAKLHLSCGIELCKGVCPKVDCKALQKPQQTKDGLVRKARLDKDAKGVVIERLEVYNSIEVLAPNIELEDEASIRGSRRVEEEDGLKGFSPGDKTICLSPGKMALAFCILGIIFLCAIAVAFASLVRARRRTPREPVNTSLSFYTGSKSLFSSSGSSSSGLSGSKLLLTDSPYLDHHSSSSNNWPYARAF-