Monarch geneset OGS2.0

DPOGS207956
TranscriptDPOGS207956-TA1182 bp
ProteinDPOGS207956-PA393 aa
Genomic positionDPSCF300090 - 11216-26385
RNAseq coverage221x (Rank: top 45%)
Annotation
HeliconiusHMEL0045140.083.51% 
BombyxBGIBMGA000392-TA3e-9457.05% 
DrosophilaCG12814-PC1e-10749.25% 
EBI UniRef50UniRef50_Q0KI925e-9547.87%CG12814, isoform B n=21 Tax=Neoptera RepID=Q0KI92_DROME
NCBI RefSeqXP_971306.12e-12757.07%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3287818471e-12756.27%PREDICTED: hypothetical protein LOC551765 [Apis mellifera]
NCBI nr blastxgi|3838515958e-12555.75%PREDICTED: uncharacterized protein LOC100883430 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[35-252] IPR0015071.1e-20Zona pellucida sperm-binding protein
Orthology groupMCL16695 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207956-TA
ATGTGTCTGGTTCGAAGATGTGCCTTTGTCCTCACGCTGCTGGCGTCTGCGCTTGCGCAGAATACAGAGAGCAGTGGCGATTTTACACCCGTTGTCAGTGCAACCTGCAAGAATGGTGCTATGTCGATCAGGATTAACTTCAGTCAACCGTTCAATGGAGTTGCACATGCGAGAGAGTTCAGGACTCTGGCATGCATGGCCACAGGAAATGGATCTGAAAGTCTAACTTTCGACATCAACCTAACAGCTGCCCAGGGATCCCCAGATTACTGTGGAGTGTTCTGGAATAATCGAACTGATGAACGCTCGCTGCCTTTGGCCGTCCGAGTCCACAGGACGTTGGAGTTGGCTGACGACAAGTTTTACGTGATCACGTGTGGGAAGGCCGGCTTCAGAAACGCAAGAAACGAGACGTCGCTTGTGTCTTTGCGGATGCTGAACTCAGATGGACGTAAAGTTTTGAATGCTGCCTTCGGCTTACCTTACACACTGCGCGCGGAAATGAGCAGATCCGATGGAGCTCACGGCATCCGTCTCAGGAATTGCTTTGCATTCAATATGCGAAACAACACAGTGGACCTCCTGGATAAAAGAGGATGTCCCGTTAAAGAGCAGTCGCTGGCAGTGAAAATGGAAAACGGTGCCGCTGAGCTCGCGATAGCTTCCATGTTTAGATTTCCGGATTCCTCACAAGTTAATTTTCAATGCGAGATCGGCATTTGTAAAGGCAGTTGCACACCATCCGACTGCACGACAGAAGGCCGCACGGAGCCAGCAGACGAGGAAGGTGCCGTCACCGCCTCAACAGGAATATTTGTATTAGATCCAAACGACAGTGCCGTGGCAGCTATGGCGTGTACTGAGAGCGGCGTACGTCCTCTCTGGCTGTTGTATCTGGCGATCGCGTTGGGGGTCATGTTCCTCGTGATGTTACTCATTAACTGCTTCCTTTGCACTGCCATGACATGTTCCTGCGCTAGAACTGATGTTATAGAAAAGGATCCTTCAGTGGTCGAGGACTATGACCCTTATCGCAGCTGGCACGGGAGCCAATACGGAAGCCGGTACCCCTCATCCACTATACACTCCGCGAGGTCAGTATCAGACAACAGCGACCACTACGCCATAGTTCAGTCGAGGCCGGGGAGTCGACACTCCGGTATGCACCGAAATAGACACTAA

Protein sequence:

>DPOGS207956-PA
MCLVRRCAFVLTLLASALAQNTESSGDFTPVVSATCKNGAMSIRINFSQPFNGVAHAREFRTLACMATGNGSESLTFDINLTAAQGSPDYCGVFWNNRTDERSLPLAVRVHRTLELADDKFYVITCGKAGFRNARNETSLVSLRMLNSDGRKVLNAAFGLPYTLRAEMSRSDGAHGIRLRNCFAFNMRNNTVDLLDKRGCPVKEQSLAVKMENGAAELAIASMFRFPDSSQVNFQCEIGICKGSCTPSDCTTEGRTEPADEEGAVTASTGIFVLDPNDSAVAAMACTESGVRPLWLLYLAIALGVMFLVMLLINCFLCTAMTCSCARTDVIEKDPSVVEDYDPYRSWHGSQYGSRYPSSTIHSARSVSDNSDHYAIVQSRPGSRHSGMHRNRH-