Monarch geneset OGS2.0

DPOGS207380
TranscriptDPOGS207380-TA1398 bp
ProteinDPOGS207380-PA465 aa
Genomic positionDPSCF300267 + 23085-29107
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0122322e-8652.06% 
BombyxBGIBMGA008877-TA1e-8168.66% 
Drosophila% 
EBI UniRef50UniRef50_UPI00005840EA1e-1529.50%UPI00005840EA related cluster n=1 Tax=unknown RepID=UPI00005840EA
NCBI RefSeqXP_001193753.12e-1629.50%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3838609531e-1530.29%PREDICTED: uncharacterized protein LOC100877632 [Megachile rotundata]
NCBI nr blastxgi|720070654e-1628.77%PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
Group
Gene OntologyGO:00055157.7e-07protein binding
KEGG pathwaytva:TVAG_4415203e-07 
 K00942 (E2.7.4.8, gmk)maps-> Purine metabolism
InterPro domain[91-206] IPR0081447.7e-07Guanylate kinase
Orthology groupMCL25543 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207380-TA
ATGCGGCGCCTGGGGTTGATTGACCTGCGCTCGAATCCCATTTGTAATGTTCCGGGTTACAAAGATGTGGTCATTAACACATTTCCAGTACTACTAAATTTAGACAACGAAGATTTGGATCCCATCGAACAAAGAACGCTAAAAATGAATATGACACCAGACATACAAACGTTTGCTTCGCGACGTCTTTCACGTCTCCTGTACATTGAACAGTTGTCCAGAGCTCGAGTATCTATGTACACTCCGCCAGCAGACAGCACAGATGTACCAATTGTTATACTTGTGGGATACGAAGCTGTTGGTAAAGGAACACTTGCCCGTCGTCTTGCGACAGAATGCAGTTCAAACGTTGAACTAGCTCTACAACATACAACTGCTTTCTATCACCAAATAGATAACTACATAGTTGTGTCTCGGAAAAATTTTGATGAAATGCTGCTGGATGGCGAATTTCTTACTTATAGTGAAATGGACGGCGAATCATATGGACTAACTCGTGAACAAGCGTTTGTGAAAAATGGTAAAGTAAAAATCGTCACAATGGATCTTATCGGTGCTTTAATGCTGCAACTAAGGGGATATCGCCCTTATTTGATACTAACATTTTGTCATGACAAACGATCACTGGCTCTTCGACAGCAAGAACGTAAGATAATACGAAACGCGGCATACGAGAAACGCTTGTCTATGGATCCGCCTACGGAATTGTCTACCTTGCAAGTTTTGCTGTCGGGAAGAATTATTATCAAAGGGTTGCTTAATGAAATTGTTCAGAATTTTTTAGAATACTCAGAATCCTCTGAATTCTTACTTGAGTCTCAATGTTCACTCTTGGACGTTGAATTAAGACGAAAACATAAAGATGAAAAAAAGGTTAGTTCCACCTCGCTTCCTAGAGAAGAAGTCAATAAAAGTCCTGCAGATTCAAGTTTATTTTCTTTATACAACCAACCCCAAGGCATGGATGAATATGTAAGCGATGTTTACGGACAAGTTTTATTCCATGAAAAGCTATCGAAAAGATCAATTGATACTCGATCACTTCCAGCAAAAGATCCAGGTAGCACCCATGGTCGGCACCATGCATCTTCTGCGTGGAAAGTTGGCAGCAGAAAGTCGTCAAAGTCCGTAACATTCACATCAGGAGTTGCTTATGAAGAACGTCAAGAACTCCATAATGTGGAAACTGATCCATCTGGAATGTTGGTCGAGACTCCACGTGAAGCAGACGCTGAACCACTCCGGCGAGAAACAAAAGTAGATGATCAGCAGTTTTTAATTAAACAACTAGAAAACTATAAAGAGTTGCCGTCGGAAACTTTTACAACAAATATAAGAGATGATTATGAAGAAATACATCATAACTGTCCAGGGCTATTTTGGGATACTGTATTTTGA

Protein sequence:

>DPOGS207380-PA
MRRLGLIDLRSNPICNVPGYKDVVINTFPVLLNLDNEDLDPIEQRTLKMNMTPDIQTFASRRLSRLLYIEQLSRARVSMYTPPADSTDVPIVILVGYEAVGKGTLARRLATECSSNVELALQHTTAFYHQIDNYIVVSRKNFDEMLLDGEFLTYSEMDGESYGLTREQAFVKNGKVKIVTMDLIGALMLQLRGYRPYLILTFCHDKRSLALRQQERKIIRNAAYEKRLSMDPPTELSTLQVLLSGRIIIKGLLNEIVQNFLEYSESSEFLLESQCSLLDVELRRKHKDEKKVSSTSLPREEVNKSPADSSLFSLYNQPQGMDEYVSDVYGQVLFHEKLSKRSIDTRSLPAKDPGSTHGRHHASSAWKVGSRKSSKSVTFTSGVAYEERQELHNVETDPSGMLVETPREADAEPLRRETKVDDQQFLIKQLENYKELPSETFTTNIRDDYEEIHHNCPGLFWDTVF-