Monarch geneset OGS2.0

DPOGS210530
TranscriptDPOGS210530-TA1170 bp
ProteinDPOGS210530-PA389 aa
Genomic positionDPSCF300186 + 313227-316129
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0112682e-10447.74% 
BombyxBGIBMGA012630-TA3e-5435.31% 
DrosophilaCG13813-PA6e-2425.32% 
EBI UniRef50UniRef50_Q0PCR84e-5435.68%Ecdysteroid 22-phosphate n=1 Tax=Bombyx mori RepID=Q0PCR8_BOMMO
NCBI RefSeqNP_001038956.18e-5535.68%ecdysteroid 22-kinase [Bombyx mori]
NCBI nr blastpgi|1138659412e-5335.68%ecdysteroid 22-kinase [Bombyx mori]
NCBI nr blastxgi|1138659415e-5235.87%ecdysteroid 22-kinase [Bombyx mori]
Group
Gene OntologyGO:00167721.1e-07transferase activity, transferring phosphorus-containing groups
KEGG pathway 
InterPro domain[35-309] IPR0041191.6e-52Protein of unknown function DUF227
[121-308] IPR0158976.4e-33CHK kinase-like
[101-308] IPR0110091.1e-07Protein kinase-like domain
Orthology groupMCL18551 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210530-TA
ATGGCCGACGCACAAGAAATATTGAGTAATTTTTTATTTACAGTTCTTGATAGCCGCGGCTACAAGACGAGAGACGTTAACATAAAAGAAATAACAACGAGGGGTGCTAATTATACATCGGCTCTGTTTGCGATACATGTAAAAACCTTAAATAAAGAATCAAAACTTTTCGCAAAAGTTGCAACGATGAGCAAAGAGATGAGAACAGCAGTTAAAGCAGACAAGATGTTTAGAACAGAGGAATTTGTTTACAATAAACTTATACCTGTTTATAATAATCTCCAGAAGGATCTAGATGAAGAACACAAGTTCGTATTTCCCGAATTTTACGGATTTAATAATGAGTTCGGTAAAGAAACAGTCATATTAGAAAATCTAGTAGAACATGGTTATAAGTCTTACAGTCGGTTTAAGTCCATGGATTGGGATCATGGCAGAGTTGGTGTGGAGGCTTTAGCAAGGTTCCACGCTTTGTCGTTTGCACTGAGTAAACGTGATCCAGAAGGTTTCCAAAAAATAGCTACGGACATGGCGTACACCTTTGATAAAGATACTTTAAGTCAAACAGCTATAGATCAATTCAAGGAAATGGTGGAGAAAGCGTTAAGTGTTCTGAAGGAGGAGGAGTATCGTGAGAGAGTCAGAAAATTTTTATTTGAAACGAACTTGTTGGATAAATACAATAAGCCGCTCAGCGCGCCAGTGTTGGCCCACGGTGATTACAGATTAAGCAACTTGTTATTTAAGGAAAAGGGTTACGATCTGAGTGCAGTGGTTGTTGATTACCAAACCCTCCATACCGGGTGTCCTGTGGCAGATCTCTTCTACTTCATATTCATGGGTTCAGATGAACACTTCCGCCGTCTCTATTATGACAAACTAGTAAATCATTACTATACAAGTCTACAAGAGGCGCTCCAGAGACTGGCTGTGGATCCTGTAGAGGTGTATCCGAGAGAGAGCTTTGAAAGTGATTTGAAAGAGTTACTGCCGTACGTACTAATATCAGCAGTGACAGCTCTGCCGCTGGTCGTGGTCGAGTCTGAATCTGCTCCGAGATGGGATTCTGATGAAGCTGACTATTCTACCCTCATTGTTGAACCGGGTTTGTTATACAAGCAGAGGTTTGGAGGCATTGTAGACGACTTGGTGAGATGGGATGCTATATAA

Protein sequence:

>DPOGS210530-PA
MADAQEILSNFLFTVLDSRGYKTRDVNIKEITTRGANYTSALFAIHVKTLNKESKLFAKVATMSKEMRTAVKADKMFRTEEFVYNKLIPVYNNLQKDLDEEHKFVFPEFYGFNNEFGKETVILENLVEHGYKSYSRFKSMDWDHGRVGVEALARFHALSFALSKRDPEGFQKIATDMAYTFDKDTLSQTAIDQFKEMVEKALSVLKEEEYRERVRKFLFETNLLDKYNKPLSAPVLAHGDYRLSNLLFKEKGYDLSAVVVDYQTLHTGCPVADLFYFIFMGSDEHFRRLYYDKLVNHYYTSLQEALQRLAVDPVEVYPRESFESDLKELLPYVLISAVTALPLVVVESESAPRWDSDEADYSTLIVEPGLLYKQRFGGIVDDLVRWDAI-