Monarch geneset OGS2.0

DPOGS209163
TranscriptDPOGS209163-TA831 bp
ProteinDPOGS209163-PA276 aa
Genomic positionDPSCF300061 - 126510-129653
RNAseq coverage89x (Rank: top 63%)
Annotation
HeliconiusHMEL0097472e-10569.37% 
BombyxBGIBMGA011480-TA3e-8563.60% 
DrosophilaCG13690-PA1e-9463.50% 
EBI UniRef50UniRef50_Q9VPP51e-9263.50%Ribonuclease H2 subunit A n=41 Tax=Coelomata RepID=RNH2A_DROME
NCBI RefSeqXP_001965374.13e-9463.87%GF24795 [Drosophila ananassae]
NCBI nr blastpgi|1947665235e-9363.87%GF24795 [Drosophila ananassae]
NCBI nr blastxgi|2239675096e-9063.87%CG13690-PA [Drosophila melanogaster]
Group
Gene OntologyGO:00037237.8e-116RNA binding
GO:00045237.8e-116ribonuclease H activity
GO:00036761.3e-52nucleic acid binding
GO:00160701.8e-51RNA metabolic process
KEGG pathwaydan:Dana_GF247958e-94 
 K10743 (RNASEH2A)maps-> DNA replication
InterPro domain[1-274] IPR0013527.8e-116Ribonuclease HII/HIII
[1-226] IPR0123371.3e-52Ribonuclease H-like
[2-217] IPR0046491.8e-51Ribonuclease H2, subunit A
[179-222] IPR0231608.6e-17Ribonuclease HII, helix-loop-helix cap domain
Orthology groupMCL11203 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209163-TA
ATGCTTGGTGTTGACGAAGCTGGACGTGGGCCTGTTTTGGGTCCAATGGTATACGGAATTGCATATTGTCCAATAAATCAAAACAAAGTATTAGAATCGCTCGGCTGTGCGGACTCAAAAGCTCTTACAGAAGAAAAGAGAGACGAGATTTTTCTTAAAATGTTAACCGAGAAAGAAGCTGTTGATAATGTTGGCTGGATGGCTGAAGTTATATCACCAAATTATATTTCTAACTCTATGTATAAAAGAGCCAAACATTCTCTCAACGAGGTATCAATGAATTCCGCGATATCTTTGATAAAAAAAACTATTGAATTAGGTGGGAATATAACAGAGGTGTATGTGGATACTGTCGGCCCTCCGGAGAAATATCAGGCCAGGTTAAAAGAAATCTTCCCTGATATTAAGATCACCGTGGCAAAGAAAGCTGATTCCATTTACCCAATAGTGTCGGCGGCCAGTATAGTGGCTAAGGTCACAAGAGACCACGCCCTCAAGGTTTGGGAATTTCCCGAAGGTCTTGAGATCAATCACAAGGACTTTGGGAGTGGTTACCCAGGAGATCCATTGACTAAGAAGTTTATAAGGGAACAGATTGACAGAATATTCGGCTACCCCCTGTTGGTAAGGTTTAGTTGGTCCACGGCCGAACTGGCTCTCCAGGAGAGAGCAGCGAAGTGCAGCTTCGAGGACATAGACGACGAGAATACGAAGAAACCGAAAGGAACCCAGGCCATCAGCTCGTTCTTTTCACCGAAGAACGAGCGGAAACGGAAGAGGCATAAATTTTTCGAAGAAAGAAATTTGACAATGAGCAATGCTTTCGAATAA

Protein sequence:

>DPOGS209163-PA
MLGVDEAGRGPVLGPMVYGIAYCPINQNKVLESLGCADSKALTEEKRDEIFLKMLTEKEAVDNVGWMAEVISPNYISNSMYKRAKHSLNEVSMNSAISLIKKTIELGGNITEVYVDTVGPPEKYQARLKEIFPDIKITVAKKADSIYPIVSAASIVAKVTRDHALKVWEFPEGLEINHKDFGSGYPGDPLTKKFIREQIDRIFGYPLLVRFSWSTAELALQERAAKCSFEDIDDENTKKPKGTQAISSFFSPKNERKRKRHKFFEERNLTMSNAFE-