Monarch geneset OGS2.0

DPOGS207601
TranscriptDPOGS207601-TA981 bp
ProteinDPOGS207601-PA326 aa
Genomic positionDPSCF300072 + 1060505-1064377
RNAseq coverage705x (Rank: top 18%)
Annotation
HeliconiusHMEL0164023e-13680.62% 
BombyxBGIBMGA009986-TA8e-15184.18% 
DrosophilaCG12567-PA6e-10860.51% 
EBI UniRef50UniRef50_F4WJN32e-11161.76%Uncharacterized protein YJR142W n=11 Tax=Pancrustacea RepID=F4WJN3_ACREC
NCBI RefSeqXP_973446.22e-11766.56%PREDICTED: similar to thiamin pyrophosphokinase [Tribolium castaneum]
NCBI nr blastpgi|2700140042e-11766.12%hypothetical protein TcasGA2_TC012698 [Tribolium castaneum]
NCBI nr blastxgi|2700140043e-11566.56%hypothetical protein TcasGA2_TC012698 [Tribolium castaneum]
Group
Gene OntologyGO:00167871.1e-42hydrolase activity
KEGG pathwayder:Dere_GG214181e-91 
 K06672 (SCC2, NIPBL)maps-> Cell cycle - yeast
InterPro domain[103-284] IPR0000861.1e-42NUDIX hydrolase domain
[78-292] IPR0157972.9e-20NUDIX hydrolase domain-like
Orthology groupMCL15935 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207601-TA
ATGAAAGATATGACTTCTATTTCCAAGACTAACTGTTCTGAATTAGCACTATTGGCAAGGAAATTCAATTGTTTTTATTTGTCAGGATTACGTCAAGGTATATGCAAACCATTTTTAGTAGCTGGACATCAAGTTGGACTTATTCGACCTGATGTACTTAAATATCTTCGAACATTTCCGGAGGTTTTCCGGATCACAGGGGAATATGTAGAACTCAATCCAGCCTTTAGAAATTATCAAGAAAGGACTTCCAAAGTTGCGGAAGTGTTGCAAAATCTAAGAAAAGAAAATGAAATTTGTGCTCTTAAAGGCTGGCGAGATGAGTGTTTTGAAGTGAGTACCCCCTTCCATCATGAGAGTCTACTGGAAATGGATAGAAGTGCTGTCTGCCTTTTTGGTATAAGAAACTATGGTATTAGTGTGAATGGCTACCTGTTTCATCCTTCTAAGGGTCTTTGCATCTGGTTACAGCAGAGAAGTTTCACCAAACAAACATGGCCAGGGAAGTGGGATTGTTTTGTCAGTGGTGGCTTGGCTGTGGGATTTGGTATTCTTGAAACTGCTATCAAAGAGGTGGCTGAAGAAGCTTCTGTTGTTGGAGAGCTAGTTAAGAAATTAGTGCCCGCGGGATGTGTTAGTTTCTATTTTGAAAGTGAGCGGGGTTTGTTTCCCAATACGGAATATGTGTATGATCTGGAGTTGCCATCGGAGTTTGTGCCGAAAAATGCTGACGGTGAAGTTGAAACATTTGAGCTTCTAACTGCTGAGGAATGTGTCCAAAGAGCTCTGACACCGCAGTTTAAGACAACAGGTGCACCGGTTCTACTGGACTTTTTGATCAGAAGGGGCTACATTAATCCCGAAAATGAGCCCAATTATAGACACATTGTGGAGTTGCTTCATGTGCCGCTGCAGACAATATACAACAGTTCTCTTAGAGATATAACATCAAATGGCGATGTGCAAAATTCTGAGAGTTAA

Protein sequence:

>DPOGS207601-PA
MKDMTSISKTNCSELALLARKFNCFYLSGLRQGICKPFLVAGHQVGLIRPDVLKYLRTFPEVFRITGEYVELNPAFRNYQERTSKVAEVLQNLRKENEICALKGWRDECFEVSTPFHHESLLEMDRSAVCLFGIRNYGISVNGYLFHPSKGLCIWLQQRSFTKQTWPGKWDCFVSGGLAVGFGILETAIKEVAEEASVVGELVKKLVPAGCVSFYFESERGLFPNTEYVYDLELPSEFVPKNADGEVETFELLTAEECVQRALTPQFKTTGAPVLLDFLIRRGYINPENEPNYRHIVELLHVPLQTIYNSSLRDITSNGDVQNSES-