Monarch geneset OGS2.0

DPOGS208773
TranscriptDPOGS208773-TA996 bp
ProteinDPOGS208773-PA253 aa
Genomic positionDPSCF300036 - 884802-886797
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0154181e-9168.35% 
BombyxBGIBMGA007633-TA5e-8564.22% 
DrosophilaCG15543-PA2e-0927.42% 
EBI UniRef50UniRef50_D6W8L24e-3138.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W8L2_TRICA
NCBI RefSeqXP_974188.18e-3238.66%PREDICTED: similar to HypB, AroK, ADK and rho factor domain containing protein RGD1303144 [Tribolium castaneum]
NCBI nr blastpgi|910767981e-3038.66%PREDICTED: similar to HypB, AroK, ADK and rho factor domain containing protein RGD1303144 [Tribolium castaneum]
NCBI nr blastxgi|910767982e-2840.00%PREDICTED: similar to HypB, AroK, ADK and rho factor domain containing protein RGD1303144 [Tribolium castaneum]
Group
Gene OntologyGO:00055242e-17ATP binding
GO:00061392e-17nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00192052e-17nucleobase, nucleoside, nucleotide kinase activity
KEGG pathwaygva:HMPREF0424_02724e-12 
 K00939 (E2.7.4.3, adk)maps-> Purine metabolism
InterPro domain[82-207] IPR0008502e-17Adenylate kinase
Orthology groupMCL18328 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208773-TA
ATGCCACCACCTCCGACCGCTCCGGTTGCTGAGGACCCACGAACACCAGCCAGGACTTTTTTTACACAGGACTTCGAAGGCTTAAAGTTTGCCTACAAAGCGACCTTAAAGGAAGTTCATATCGAATCAGAGGACGTTATGGAACACATAGTGACGAAGTGCGTGAACGCAATAGCGGCGAGCGCGGCCGGAGCGCAGGGACCGGGGCAAGGCATCCACGCGACCGGAGCGCCCGGCGTGTACCGCGTCATACTCATGGGGCCCCGCGGCTCGGGCAGGAAGACACAAGCAATGGCTGCCGCTAAACATTTTGGTCTGGTGTATTTAAATTTTGAAGATTTATTTAATGAGGCCCTCGTAAAAAAAGACGACATTGGTGAAAAACTTAGAAAGCACGGCACAAGTGTACAATTGAAAGCTGAGGTAGTGAGACGCCGTATAGCACAAAAGGATTGCATAGACCACGGCTGGATTTTCACCGGATATCCTTCTAATGGCGTCGATTTCGAGTATTTAGACAACATGCCGACTCCGCCGAATCGTATTATAATCCTGAACACTGAGCGGTCGGTATGCAAGGCTCGCACTGAAAGCCGTGGAGTGGACTGGTGCACGGGCCGCGAGGCAGCGCTCGGTTCGGGGCCGCGGGTGCTGAAGCAGCGGGTGCCACTCAGAGCTTTAGAAGCAGAGGTAAGGCCGGAAAGGATATTCTTTTATATCAAAGGTGGCAAACGAGCAGACGGCCTCCTTGATAGAAAGTAGTCACCGTCGCCCGTGGACAGCTGCATCATAAGGAATGTTGAGGATGTGTTGCCGACCTTTGTTGTGGAAGAGGGTGAAGAGGATGGGAAAAGGGAAAGGACGGAAATAGTAGGAAAAGGGCGCGAAGGAATCGATGTCGATATAGAATGGAATTTTATAATAAAATCAAGGTGGAAAGTTTCAAGACGTTCATGCACAAATTTTGTACAGAACTTGTACTTTTATAAAGAATGA

Protein sequence:

>DPOGS208773-PA
MPPPPTAPVAEDPRTPARTFFTQDFEGLKFAYKATLKEVHIESEDVMEHIVTKCVNAIAASAAGAQGPGQGIHATGAPGVYRVILMGPRGSGRKTQAMAAAKHFGLVYLNFEDLFNEALVKKDDIGEKLRKHGTSVQLKAEVVRRRIAQKDCIDHGWIFTGYPSNGVDFEYLDNMPTPPNRIIILNTERSVCKARTESRGVDWCTGREAALGSGPRVLKQRVPLRALEAEVRPERIFFYIKGGKRADGLLDRK-