Monarch geneset OGS2.0

DPOGS206978
TranscriptDPOGS206978-TA1515 bp
ProteinDPOGS206978-PA504 aa
Genomic positionDPSCF300001 + 374456-378139
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0021170.068.71% 
BombyxBGIBMGA009947-TA2e-9873.97% 
Drosophilanesd-PA5e-4127.73% 
EBI UniRef50UniRef50_D6WKG02e-4327.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKG0_TRICA
NCBI RefSeqXP_001605544.13e-4526.76%PREDICTED: similar to LD13710p [Nasonia vitripennis]
NCBI nr blastpgi|3838645827e-4528.32%PREDICTED: SHC SH2 domain-binding protein 1 homolog B-like [Megachile rotundata]
NCBI nr blastxgi|3287868492e-4828.09%PREDICTED: SHC SH2 domain-binding protein 1 homolog B-like [Apis mellifera]
Group
KEGG pathway 
InterPro domain[321-487] IPR0110505.8e-09Pectin lyase fold/virulence factor
[363-467] IPR0123344.3e-06Pectin lyase fold
Orthology groupMCL15650 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206978-TA
ATGGTTGCTGTGTTTACGTTCACGAGAAGCGACAACGATATTCTTCAGGAATTACTGCGTGTGTTCAGTTCTGGGCGTTGCACAGCTAGAGAGCAATGGAGTATGCAGGCCGAACTATTAGTAGAGCCTGTTGGATGGGATGCTCTGTGGAAATTGTCAAAGGATTTCTGTAAAAAATTTGAAGTGAGATATCCCTGTGTTGCATACATCACAGTAATGTCAGTGAACTTTGAAGATCTATCTGCCAATGTGGATGTGCTGAGGGTTCAGCATGAGGCTGTGTCTCTACCAGAAAGTATTGTTAGTGTGCCATTGATAGAATTGTGGCCAACAATTAAACAGAGGGAGCAGTGCGTTAATGCAGCGAGCACAGCCGAGTTTATAGATCTTATGAGATTCTTTTACGATAACATATGGATGCCATGGGACGATCAGGATGATAAAGTCCTTCTACCAAACACCATAGAGGATCGTATGAATCTGTGGATGGAGTTGCACAATGGCACCATACCAAACTACATTGCGAGATCTATCACCTTGCTGAGAACTAGTGCTATCAATGCTCATCAGAAGTTACAGGAACTCGATTCATCTCTATGCGAAGGAGATTTTGCTGACGATGATGATTCCCTGTTACCACCAAACTACATATCGCTCTGTGCTGAATTAAATGCGAGATTAGACGGGCTGATGTCAAAATGGACATTATATGAAAATCCTCTCATCAGGGAGCAATATTTGGTTAAAACTATGAAAAAGTATCAAAGGAATAAGAGCAAAAAGAATGTGATAGCTCTGTGGCAGGGAGGTGATATAGCGGAATTCAAGAATATCACCAAATTTCTAGAAACGAGAGTCACGTATGATCACACACTGACTATAACAATGTCAGCTGAAGAAGCATTATCATTAGAACCAAACGAAGTAGTTGTGTGCAGCAAACAGTATGAGATACCTGAAATATCATTGTCGCAAATAACTTTGTGTAGTATAGGCGGAGCTACTTTAAAGGCATCAGATATGAGATCATGTTTATTAATGTTGAGTGATGTATGCCAGATACAAGACATGACGCTACACTGTTCTTCTGTTAATACGGTCATAGTAATGCTTTCTGGTACGCTACATGTCAAAAATTGTATGCTTCTAGACGATTCAAGTAATTTCCAGAGTGATTTTGCTCAAGGTATTGTGGCAATGTCTGGTGCGAAAGTTATATTAGAAGATTGTACATTTGAAAATTTCTACTCTGGTATTGTAATTCATAAAGGAGCTCAGATGGAACTTCGAAATTGTTTAATAAAGAAGTGCGGTGTTGGCATACAAATGTATTCCGGGGCTCAGGTTAAGTTGGACGGTGTCATAGTTGAGGAATGTACAGAACAGTGCATACGATGTGAAATGGAGAATGGAATTGTTAAGACTGAAATGGATGGCCTCGAAATGATAAATTGCAAGATTGGTTCCGGTGATCTACAGAAAGAAATCTATGTTACACAAGATGTCAATATGTAA

Protein sequence:

>DPOGS206978-PA
MVAVFTFTRSDNDILQELLRVFSSGRCTAREQWSMQAELLVEPVGWDALWKLSKDFCKKFEVRYPCVAYITVMSVNFEDLSANVDVLRVQHEAVSLPESIVSVPLIELWPTIKQREQCVNAASTAEFIDLMRFFYDNIWMPWDDQDDKVLLPNTIEDRMNLWMELHNGTIPNYIARSITLLRTSAINAHQKLQELDSSLCEGDFADDDDSLLPPNYISLCAELNARLDGLMSKWTLYENPLIREQYLVKTMKKYQRNKSKKNVIALWQGGDIAEFKNITKFLETRVTYDHTLTITMSAEEALSLEPNEVVVCSKQYEIPEISLSQITLCSIGGATLKASDMRSCLLMLSDVCQIQDMTLHCSSVNTVIVMLSGTLHVKNCMLLDDSSNFQSDFAQGIVAMSGAKVILEDCTFENFYSGIVIHKGAQMELRNCLIKKCGVGIQMYSGAQVKLDGVIVEECTEQCIRCEMENGIVKTEMDGLEMINCKIGSGDLQKEIYVTQDVNM-