Monarch geneset OGS2.0

DPOGS204671
TranscriptDPOGS204671-TA1149 bp
ProteinDPOGS204671-PA382 aa
Genomic positionDPSCF300170 - 175467-176704
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0146374e-7638.68% 
BombyxBGIBMGA010137-TA2e-4730.37% 
DrosophilaCG13813-PA1e-1625.16% 
EBI UniRef50UniRef50_UPI00020633571e-1424.92%UPI0002063357 related cluster n=3 Tax=unknown RepID=UPI0002063357
NCBI RefSeqXP_002021031.18e-1523.85%GL25122 [Drosophila persimilis]
NCBI nr blastpgi|3287762164e-1424.92%PREDICTED: hypothetical protein LOC100578929 [Apis mellifera]
NCBI nr blastxgi|3071888153e-1625.48%hypothetical protein EAG_07624 [Camponotus floridanus]
Group
Gene OntologyGO:00167722e-13transferase activity, transferring phosphorus-containing groups
KEGG pathway 
InterPro domain[35-301] IPR0041192.9e-40Protein of unknown function DUF227
[120-303] IPR0158971.6e-25CHK kinase-like
[31-293] IPR0110092e-13Protein kinase-like domain
Orthology groupMCL31899 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204671-TA
ATGAATGTCTCAGAAGAATTTCTAAGTGAAGCGGTAAAGCACGTAGCTGATTATTTTTCCTTTGAAACTTGGAGTATGCAAAAGGAAGAGATGAAAGAAACAATTGAAAATTATTTTGGTATCCTCATACCTGTTACTGTGGTGGGAGTCATAAAAGGTGAAAAAATATACAAATCTGTTTATATAAAATTAGCACCGCATAATATTAATGAAAAACTAAAATCAGTCGTCCGAGATCACTATATAGTTGAAGCTTTCACATACACAACTCTCATACCGATGTTTAAGAAAGAATCGGATATTGATATTGTTGTACCCGATTTATATTATGCAAGCTTAGAACCAAATAAAGAGTTTTTAATTCTAGAAAATATGTCGTGCAATGAATATAGAAGATACGTGAACAAATTTCTCGATTCGGATCATATTTACTTAGCGCTGAAAGCATTAGCACGTTTTCACTCAATGTCTACTGTTTTGACTATAAAAGGAAAAGATCCTGCTCACCATTTAATGCATCCATACTCATGCGCCTATCCACCAGGTTATTATGAGTTTATTCAGCTGTCTATGAAAAATCACCTACATCTTTTCGAGAGCACCCCTTATTGGGAATATTTAAAGTCTCTTGTGACCAACCTGTCATCGCATGTTGATGACGCGTCATCAAAAATCAAACATATCGTTTATGGCCATGGTGATTATTGGAGAGAAAATTTTTTGTTCAAATATGAAAATAATAAACCGAATGAAATTTGTCTACTGGATTTTCAAAAATGTAGGAAAACGTCTCCTGCCTATGACTTTCTTATATTACTTCTGACAAATTTAAATTCTAAAGATCGACTTCAAAACTTCCAAAACTTTTTGGAGACCTACGTATCTACATTTCGCTTATGCGTTGATAAGCATACTTTAAAAGTAGATTATACATTAGATGACTTTAATGATGATATTAAAATTGTTGCTCCTATGTGTTTAGCTACTGCCACTATGAGTTTTTCACTGTGGTTAGGTCTAGAAAACGATAATTTTCAAAGCAAATACGTTTGTGATGACGATTCAAGACTGATAACTTTGAAATCCTTTAAAAATATAGTTTGTGACATGTTAAAGGATCTAATAAATCTTAAGTACATTAAAATGTAA

Protein sequence:

>DPOGS204671-PA
MNVSEEFLSEAVKHVADYFSFETWSMQKEEMKETIENYFGILIPVTVVGVIKGEKIYKSVYIKLAPHNINEKLKSVVRDHYIVEAFTYTTLIPMFKKESDIDIVVPDLYYASLEPNKEFLILENMSCNEYRRYVNKFLDSDHIYLALKALARFHSMSTVLTIKGKDPAHHLMHPYSCAYPPGYYEFIQLSMKNHLHLFESTPYWEYLKSLVTNLSSHVDDASSKIKHIVYGHGDYWRENFLFKYENNKPNEICLLDFQKCRKTSPAYDFLILLLTNLNSKDRLQNFQNFLETYVSTFRLCVDKHTLKVDYTLDDFNDDIKIVAPMCLATATMSFSLWLGLENDNFQSKYVCDDDSRLITLKSFKNIVCDMLKDLINLKYIKM-