Monarch geneset OGS2.0

DPOGS200718
TranscriptDPOGS200718-TA879 bp
ProteinDPOGS200718-PA292 aa
Genomic positionDPSCF300030 - 255398-257359
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0100862e-11971.23% 
BombyxBGIBMGA001118-TA8e-10762.46% 
DrosophilaCG3362-PA8e-6240.34% 
EBI UniRef50UniRef50_D1ZZV32e-6545.64%Putative uncharacterized protein GLEAN_07393 n=1 Tax=Tribolium castaneum RepID=D1ZZV3_TRICA
NCBI RefSeqXP_001863256.11e-6741.67%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700547163e-6641.67%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1571270252e-6542.36%hypothetical protein AaeL_AAEL000258 [Aedes aegypti]
Group
Gene OntologyGO:00002871.1e-84magnesium ion binding
GO:00082531.1e-845'-nucleotidase activity
GO:00057371.1e-84cytoplasm
KEGG pathwaydpo:Dpse_GA174063e-60 
 K01081 (E3.1.3.5)maps-> Purine metabolism
    Nicotinate and nicotinamide metabolism
    Pyrimidine metabolism
InterPro domain[12-290] IPR0064341.1e-84Pyrimidine 5'-nucleotidase, eukaryotic
[20-292] IPR0232148.3e-28HAD-like domain
Orthology groupMCL11065 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200718-TA
ATGTTAACAAGTATTGACGATATACCTGAACTGTGCAAGGAGAATGTATACATTAAAGACAAAGAAAGTTTATTAAAGAATATCAACAAGATCATTGCTGAAGGCCATAAGAAATTGCAAATTTTGACGGATTTCGATCACACATTAACAAGGCATGATGTGGATGGTGTACCCGTGCTTACTAGTTTTGGAATGTTTAAGGAGTGTCCCTCAGTACCCCAAAAGTACAAAGATGATGAAACAATGTTAGCCAATAAATATAAACCAATTGAAGTAGATGCTAATATGAGCATTGAAGATAAAGTGAAACACATGAAAGACTGGTACATGGCCTCACATAATTTAATGAAGGGTCTTAAGTTTCCTAGGAATGAACTTATGGATATTGGTCATAAGATGGTTGGATGTTTCCGGAAAGGTGTCAATGATTTGATTTCTTGGAGCGAGTGTCATCAGGTGCCCGTGCTGGTGTTCTCAGCTGGTCTCGGCGAGTGTGTCGTCGCTGCTCTTCAGGCCGCTAAATTCTTACTACCTAATGTCAAGGTGATATCCAATTTTCTTGCAATGGATGAAAATGATAACATAGTTGGCATTCAAGGTGAAATAATACATACTTATAATAAAAATGAAACAGCTATAAAACATACAGAATATTACGGTATGGTTAAGGAGAGGAATAACGTTCTGTTAATGGGCGACAATATCGGTGACGCGGGTATGGCAGAGGGGATGGAGCATTGTGATGTAGTCATCAAAATAGGATTCCTTGGCAGAAACACAGAAGCCAATCTACAGAACTATGTGTGCACGTTTGATATAGTCGTTGTTAACGAACATACTATGGATATAGCCAATGCAATATTAAAACTAGTGCTGTGA

Protein sequence:

>DPOGS200718-PA
MLTSIDDIPELCKENVYIKDKESLLKNINKIIAEGHKKLQILTDFDHTLTRHDVDGVPVLTSFGMFKECPSVPQKYKDDETMLANKYKPIEVDANMSIEDKVKHMKDWYMASHNLMKGLKFPRNELMDIGHKMVGCFRKGVNDLISWSECHQVPVLVFSAGLGECVVAALQAAKFLLPNVKVISNFLAMDENDNIVGIQGEIIHTYNKNETAIKHTEYYGMVKERNNVLLMGDNIGDAGMAEGMEHCDVVIKIGFLGRNTEANLQNYVCTFDIVVVNEHTMDIANAILKLVL-