Monarch geneset OGS2.0

DPOGS211101
TranscriptDPOGS211101-TA1830 bp
ProteinDPOGS211101-PA609 aa
Genomic positionDPSCF300007 - 949301-951564
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0124631e-13066.57% 
BombyxBGIBMGA006723-TA3e-1436.23% 
DrosophilaCG7158-PA8e-1130.95% 
EBI UniRef50UniRef50_D6WLY81e-7130.67%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WLY8_TRICA
NCBI RefSeqXP_974718.13e-7230.67%PREDICTED: similar to radial spoke head 10 homolog B (Chlamydomonas)-like [Tribolium castaneum]
NCBI nr blastpgi|910833115e-7130.67%PREDICTED: similar to radial spoke head 10 homolog B (Chlamydomonas)-like [Tribolium castaneum]
NCBI nr blastxgi|1960083313e-5928.26%hypothetical protein TRIADDRAFT_58077 [Trichoplax adhaerens]
Group
KEGG pathwayath:AT1G608909e-22 
 K00889 (E2.7.1.68, PIP5K)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
    Endocytosis
    Regulation of actin cytoskeleton
    Fc gamma R-mediated phagocytosis
Orthology groupMCL17610 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211101-TA
ATGAAGTGTATGCATGGTGAGGGTCGCTATCAGTGGGCAGATGGAACTGTGTACCTAGGTCAGTTTAAGAATAACGAAATAACTGGGAAGGGCACTATTTATTGGAAAGACGACACATGGTACCAGGGCGATTTCTATGGCAACCTTCGCCACGGAAATGGACTATACGTTGATTCTAGACGTCAGCGCTCATATGCCGGAAAATGGCATTATGGCACAAAAGACGGACAAGGAGTTATTTATTATGATGGCAGTTTTAAAAATTCCTACGACGGAGAATGGGCTTTAAACGAACGTCATGGATATGGTTCCCGTGAATATTGCAAGGTTAGCGGTTATAAAGGAGAATGGAACAAGTTTATCAGGGAAGGAAAGGGTATGATGATATGGCCAAATCACGACTTCTACAGAGGTGAATGGAAAAATGGTGTCATGTCTGGTTACGGATTCTATATATGGGAAGCTTATTACAATAATTCCATGTCTTTACCCTCACTGTGTGCATATCGCGGGTTTTGGGAAAAGGGAAAACGAAATGGTTATGGTATATTAAATTTAGGTTTGGCACTGGGCTCTTATTACAAAGGGGAATTCAAAAATAACAAAAAGCACGGTGTTGGAAAATTTGTTACCAATAACGGGCAGATATTACAGCATAAAAAATTATTTATTGATGATAACATGGGTTCATTAAATCGAGATGATGATGAGAGTGATGGCGATGATAAGTGTGGACGCTTGGAAGAACCTTATTTGTTTGATATTTGTAACGATTCCGTTGGTCTGCTTTATCATGTTGAACGTGTTATAAAGAATATTGACAGAAAACAAGAAACTATTAATAGAATTGTATATGATTTTATAGAATTGAATAAAATCCATTCAGTGTCTCGAGGTCCTAAAGATGATATTATTGATGAGTTAAACGGAGACAGCTTTGCCGATTTAATTGATTTTGAAATTAGTTCTTTATATAAGTCATTGCGATGCTATGAAACTGATCTTAAGAATATTTATCATAAGTACGCAACCATTTGTAACACTGAAGAGATTAATTTTAAACCGATTCTTATACGCTTATACTTGTGGCAATTATACTACGATTGTAACTTGCATGATAAAGGGTTGACCTTAGTTGATATCGATAGGCTTTTTCACGAGAACCCCGAATGGTTGTCTCGTAAACCTCACAACCCTTTTGAAAAGATTTACTTTTGGCAATTCCAACACAGCCTAATTTCAGTAGCTAGTAAATTGTATGCCAAAAGACATTTACCAGGTAAGAAACCTGATACTATGCTGGCAAGTGCATTTAGGCTTTTCATGGAAAAGGATATTCTACCTGGCGCTGGTCGAAAAAGAGGGAGACTTGTAGGGGGATGTGGATCCTTTGTTCCTTTGAAAGACTTGTATCATTTATATCAAACTTTAGATGAACCATGCACTGTAAGGACTTTTCTGTGTGCCGCTCGACATGCACCGCACTACGCAGAACAACCAAGTCTTGTTGATTATGACTGCTCCAGTCTTGGTAGAAATGCGTATATTTTTGGTGACGAAATGTCATTTATTATGGAAGACCCCACGGAGATTCCGGAAACTAATGAAAAACCCACCCTAAAGCTATTTAACATCGGAAATTTATCAAGCAAAGCTATAATAAAGATCTTTTCTTTTATATTTCCACAAATATCTGAATTAAACAAAATAATGAATTTGGACGTTGAAATCACGTTTTTTGAATTTTTTCAAGCTCTCATAATGTGTGTTGAGGAAAGTTTACGCCTCTCAGAGCAGGAAGTGTATCGTAATACATCTGCGCTCTATTGA

Protein sequence:

>DPOGS211101-PA
MKCMHGEGRYQWADGTVYLGQFKNNEITGKGTIYWKDDTWYQGDFYGNLRHGNGLYVDSRRQRSYAGKWHYGTKDGQGVIYYDGSFKNSYDGEWALNERHGYGSREYCKVSGYKGEWNKFIREGKGMMIWPNHDFYRGEWKNGVMSGYGFYIWEAYYNNSMSLPSLCAYRGFWEKGKRNGYGILNLGLALGSYYKGEFKNNKKHGVGKFVTNNGQILQHKKLFIDDNMGSLNRDDDESDGDDKCGRLEEPYLFDICNDSVGLLYHVERVIKNIDRKQETINRIVYDFIELNKIHSVSRGPKDDIIDELNGDSFADLIDFEISSLYKSLRCYETDLKNIYHKYATICNTEEINFKPILIRLYLWQLYYDCNLHDKGLTLVDIDRLFHENPEWLSRKPHNPFEKIYFWQFQHSLISVASKLYAKRHLPGKKPDTMLASAFRLFMEKDILPGAGRKRGRLVGGCGSFVPLKDLYHLYQTLDEPCTVRTFLCAARHAPHYAEQPSLVDYDCSSLGRNAYIFGDEMSFIMEDPTEIPETNEKPTLKLFNIGNLSSKAIIKIFSFIFPQISELNKIMNLDVEITFFEFFQALIMCVEESLRLSEQEVYRNTSALY-