Monarch geneset OGS2.0

DPOGS210331
TranscriptDPOGS210331-TA1050 bp
ProteinDPOGS210331-PA349 aa
Genomic positionDPSCF300025 - 590437-591574
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0138457e-8847.20% 
BombyxBGIBMGA011953-TA4e-8442.09% 
DrosophilaCG13813-PA2e-2021.59% 
EBI UniRef50UniRef50_Q0PCR81e-3429.24%Ecdysteroid 22-phosphate n=1 Tax=Bombyx mori RepID=Q0PCR8_BOMMO
NCBI RefSeqNP_001038956.13e-3529.24%ecdysteroid 22-kinase [Bombyx mori]
NCBI nr blastpgi|1138659415e-3429.24%ecdysteroid 22-kinase [Bombyx mori]
NCBI nr blastxgi|1138659416e-3229.24%ecdysteroid 22-kinase [Bombyx mori]
Group
Gene OntologyGO:00167721.2e-08transferase activity, transferring phosphorus-containing groups
KEGG pathway 
InterPro domain[1-270] IPR0041193.2e-41Protein of unknown function DUF227
[85-271] IPR0158975.3e-16CHK kinase-like
[44-302] IPR0110091.2e-08Protein kinase-like domain
Orthology groupMCL19950 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210331-TA
ATGGGCAGTCTCTACGAAATAGACATCAAGGGTGTGGTTGAAGATGAAAAGAAAGAAACTAATATATTCGCCAAAGTTATATTAAATCCAGAAGTTACAACTACCGTTTGTATTGAGGATTTGTATTTGATGGAATTGTTTGTATACAATGAGCTGTCGAAAATATTCCACGATCTGCAAGAAGAGGCTAAAGTACCAATCACAGGACGATTCAACATCGCCAGGTCGTATCAACAGACTAATTCTAATATGATAGTATTAGAGAACTTGAACAGAAAAGGTTTCAAGGTGTACAATAGATTGGATGTGATGCCTTTGAAGTTTGCCGAATTATGTATTCAACAGCTGGCAAGATTCCACGGACTTTCCTTCGTCATAGAGAAAAAAATGCCAGAGTTCTTTAATAAGAGAATTAAAACACTAAAACATCCAGTACTTTACAACGATGAATTAGAAAAATTAGTGAAGCAAAACTCCGCTTACACAATCAATTTATTTGAGGGTGACGAAAAAATGAGAATTGAGAAATTCGCATGCTCTATATTACAAAAGTTTAAGAAATATAACTTGGACAGAAATTGTGTATGCTGCATGACGCACGGCGATTTCAGAATAAGCAATGTAATGGTACGGGAAAAAGACGGTGAGGCGGTTGAAGTAGTCCCAGTGGACTATCAGTTACTAGACTGCGGTTGTCCTATCAGAGACTTTCTATATTTTATATTCTCTGGCACTGACCAGAAGTTCAGGCGCCAACACATGAACCACCTTAAAGATCTCTATTACGACACTATGACAAATTTCCTAAACTATTTTGATATAGCAGCCGAAGATGTATTTCCGAGGAAGGAGTTCGAACAGATGTATAGGGAACGACTGGATTATGGTTTGTTGATGATGATTTTTGGCGCTCCTTTCCTGTTTGGTGGTGTGGAAGGTCATGATGTAGAGAACACGTGTTTGACCGACGTATCATTCGAGGCCGGCACACTCTTTGAAGAACGTGTTAGAGGCTTGATAGATGACTTCATTGAATGGGGTTTTTTGTAA

Protein sequence:

>DPOGS210331-PA
MGSLYEIDIKGVVEDEKKETNIFAKVILNPEVTTTVCIEDLYLMELFVYNELSKIFHDLQEEAKVPITGRFNIARSYQQTNSNMIVLENLNRKGFKVYNRLDVMPLKFAELCIQQLARFHGLSFVIEKKMPEFFNKRIKTLKHPVLYNDELEKLVKQNSAYTINLFEGDEKMRIEKFACSILQKFKKYNLDRNCVCCMTHGDFRISNVMVREKDGEAVEVVPVDYQLLDCGCPIRDFLYFIFSGTDQKFRRQHMNHLKDLYYDTMTNFLNYFDIAAEDVFPRKEFEQMYRERLDYGLLMMIFGAPFLFGGVEGHDVENTCLTDVSFEAGTLFEERVRGLIDDFIEWGFL-