Monarch geneset OGS2.0

DPOGS211542
TranscriptDPOGS211542-TA1320 bp
ProteinDPOGS211542-PA439 aa
Genomic positionDPSCF300159 - 166975-168294
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0132942e-1042.67% 
BombyxBGIBMGA001603-TA7e-3645.51% 
Drosophila% 
EBI UniRef50UniRef50_A3FMR31e-3531.00%Pol-like protein n=2 Tax=Biomphalaria glabrata RepID=A3FMR3_BIOGL
NCBI RefSeqXP_001946779.15e-3229.30%PREDICTED: similar to pol-like protein, partial [Acyrthosiphon pisum]
NCBI nr blastpgi|1259017874e-3531.00%pol-like protein [Biomphalaria glabrata]
NCBI nr blastxgi|1259017872e-3331.08%pol-like protein [Biomphalaria glabrata]
Group
Gene OntologyGO:00036764.4e-20nucleic acid binding
GO:00045232.2e-17ribonuclease H activity
KEGG pathway 
InterPro domain[231-373] IPR0123374.4e-20Ribonuclease H-like
[234-358] IPR0021562.2e-17Ribonuclease H domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211542-TA
ATGTACAACAATATACCTCTGCCTTGGCTTAAGGAAAAGAAATTCTTAGGCATTTGGTTAGACCCAAAACTAACCCTAGAATGTCACATCAATAATGTTGAAAGAAATGCTTGTAAAGGCCTTAATGTTATGAGATCTTTGGCAGGTGTATATTGGGGTTCAGATCCTAAAACTCTAGCTATGATGTATAAAACAATAGTAAGAAGTCATTTTGATTACAGCACTTTGGCATACATAAATGCTAACATAAGCTTGTTAAGAAAATTAGATATTCTACAAAATAGAGCTTTACGTATAATAACAGGAGCAATGTGCTCAACACCCATTAACAGTATGGAATGTGAATCTTGTATACCACCATTGTTATTGAGAAGAATACAAATAGCAGAAAGATTTTGTTTAAAGTTAATGTCTTTAAACAATAACTATACCCTAAACCATATTTTACCTCCCTCTTACAATTTGATCAACAGTGAACCATACATGGATTGCAAACAATTGATGTCTGGATTTTCACCCACACTTCTAAGAATATGTGTATTTATAAAATCTGTTTTTGTAAATATGAACATAACTGATTCATGGCCAATGTATAGCTTAAGTTTTAGTGCCTTAATTCATCCAGTAAATATTAGTAATAAGAAAATTCTTACTCAGTCAGATCTCCATGAATTTATTGGTGACAATAATGATGTGTATAGAATTTACACTGATGGGTCTAAGAGCTCAGATGGTGTAACATCAGCATTTTACGATCCTCAACTGAAAATTTCCAAATGTTTTCAAATTAATGATAACTGTACAATTTATACAGCTGAATGTTATGCTATATTGAAAGCTCTAGAATATGCATGTAATGTTAATAACTGTCATATAATTATATTAACTGATTCTCAAAGTGCACTCCTAGGTTTGGAAAAAACTTGTCTGAAATATAATACTAGCTATATTCTTTATGAAATTAAGAAGATGTTATATGATATGCACATTCATGGTAAAGTAGTCCAATTACAATGGGTACCTTCACATAATGGTATAATTGGAAATGAACTGGCTGATCAGGCCACCAGAGGACGCGCCGATGGAAACCACAGCAACTGGATGAAGACTCCATATACTGACTTTCGCTGCACATTCACCATGGCCCTGAAATCATTATACAAAGAGTACTGGAAGACTGTAAGCAAAGAAGAAGGAACATGGTACGCAGACATACAGAAAGCCCCGCCCGCCCAAATTTGGTATAACAAATTAAAGCAGTATAACAGAAAATGTATAGTTACAATCATTACACTGCCGGACGCCCAGTCATTAATATAA

Protein sequence:

>DPOGS211542-PA
MYNNIPLPWLKEKKFLGIWLDPKLTLECHINNVERNACKGLNVMRSLAGVYWGSDPKTLAMMYKTIVRSHFDYSTLAYINANISLLRKLDILQNRALRIITGAMCSTPINSMECESCIPPLLLRRIQIAERFCLKLMSLNNNYTLNHILPPSYNLINSEPYMDCKQLMSGFSPTLLRICVFIKSVFVNMNITDSWPMYSLSFSALIHPVNISNKKILTQSDLHEFIGDNNDVYRIYTDGSKSSDGVTSAFYDPQLKISKCFQINDNCTIYTAECYAILKALEYACNVNNCHIIILTDSQSALLGLEKTCLKYNTSYILYEIKKMLYDMHIHGKVVQLQWVPSHNGIIGNELADQATRGRADGNHSNWMKTPYTDFRCTFTMALKSLYKEYWKTVSKEEGTWYADIQKAPPAQIWYNKLKQYNRKCIVTIITLPDAQSLI-