Monarch geneset OGS2.0

DPOGS207296
TranscriptDPOGS207296-TA2463 bp
ProteinDPOGS207296-PA820 aa
Genomic positionDPSCF300008 + 702429-706010
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0021770.071.01% 
BombyxBGIBMGA012027-TA0.068.95% 
DrosophilaPms2-PA4e-10846.60% 
EBI UniRef50UniRef50_D6WH350.046.27%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WH35_TRICA
NCBI RefSeqXP_001947410.10.046.31%PREDICTED: similar to DNA mismatch repair protein pms2 [Acyrthosiphon pisum]
NCBI nr blastpgi|910790300.045.22%PREDICTED: similar to DNA mismatch repair protein pms2 [Tribolium castaneum]
NCBI nr blastxgi|1571369170.045.56%DNA mismatch repair protein pms2 [Aedes aegypti]
Group
Gene OntologyGO:00062982e-94mismatch repair
GO:00055244.2e-70ATP binding
GO:00309832.9e-22mismatched DNA binding
KEGG pathwayapi:1001644880.0 
 K10858 (PMS2)maps-> Mismatch repair
InterPro domain[10-816] IPR0020990DNA mismatch repair protein
[13-386] IPR0147632e-94DNA mismatch repair protein, N-terminal
[13-224] IPR0035944.2e-70ATPase-like, ATP-binding domain
[207-408] IPR0205682.4e-34Ribosomal protein S5 domain 2-type fold
[635-779] IPR0147906.9e-34MutL, C-terminal, dimerisation
[306-406] IPR0147211.8e-29Ribosomal protein S5 domain 2-type fold, subgroup
[308-404] IPR0135072.9e-22DNA mismatch repair protein, C-terminal
Orthology groupMCL12794 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207296-TA
ATGGAAGAAACAAATATACAAAACAAGCAAATCAATACTATTAAACCAATTAACAAGGATGCTGTTCACAAAATATGTTCTGATCAAGTGGTTCTTAGTTTAGCAGTAGCAGTGAAGGAACTAGTTGAAAACTCTTTAGATGCTGGGGCTACTAATATTGAAGTCAGACTTAAAAACTATGGCACAGAATTAATAGAAGTTTCAGATAATGGATCTGGTGTAACTGAGGATAATTTTGAAGCCTTGACCTTAAAATATCATACATCAAAATTAAACGATTACTCGGATTTGCTTGGAGTATCTAGCTTTGGCTTCAGGGGAGAGGCTTTAAGTTCACTTTGCTCTTTAGCCAACCTTACAGTGACAACCAGACATGAAACAAGTAAACATGCTACCAAAATTGAATATGATCAGAAAGGCCATATATCAAGCAAAACACCTTGCTCCCGTCAAGTGGGAACAACAGTGACTTTAACTAACCTTTTCTATACATTGCCAGTAAGACAAAAAGAGTTTCATAAAAATGCAAAACGGGAGTTCAATAAAATGACCAGTCTTTTATATGCATATTGTTTAATTTCTAAAGGGGTAAAAATAACATGTAGTAATCAAACAAACTCAAATTCTAAGTCACTAGTTGTTGCAACCCAAGGCTCTAATTCCTATAAGGATAATATTGCAAGTGTGTTTGGAGTTAAGCAATTACAAAGTATTCTAGATGTTAAAACTGAGCTTGTTTCTAATATCAAAGATAATATATTCAGAGGACTATCGGGAGAAGCAAAGGTAAATGAAGAAAGTATTAATATAGAAGACATTGAAATTGATTTATCTGAAGATTCCAATGATGCCCAAACTGATGAAAATAATTCAAGTCAAATACCATTACCTCAAAGATCACAGGGTTACAAAAATATACCAAATCCTGTTGAGCTCACAGGATATATCTCTTCATGTGCTCATGGTAGTGGAAGATCAAGTACAGACAGACAATTCTTCTATATCAACTCCAGGCCTTGTGAGCCGGTGAAAATTATTAAACTGATCAATGAAATATATCGACAATACAACCCACATCAGTATCCATTTGTATTTTTAAATGTTAATATTGAAAGAACATCAGTTGATGTAAATGTGACTCCTGATAAGAGGAAAGTATTTTTAACCAAAGAGAAAGCTATATTGGATGTTGTTAAATGCTCTCTTTTGAAAATGTTTGAGGATATTCCTAGATCTGTTAAAGTCGAGGCTCCGTCCATTGTCGCTGCGGTGAAAACTGAGCCTGAACTTTCTCAGCCCAGGATATTTCAGTCATTTCTCAAACAATTTAGCAACAAATCAAGTTCTATCAAACCTAGTGAGTCTAACAATCCTGATAAATGTGAATTAAAAAGGAAATCATCTTCAGTTTTGGACAATTTTATTCAGATAAAAAAGACTCTTGTTACAAAACAAGAAGATAATTTAATTGAAGAGGAAACTGAAGAAAAATGTATGTTAGATGAAAATAAGGAACATAATATACTAAATGTATCTACTGAAGAAAATATTGAAAATGCTAATGAGAGAAATATTAATAATTCTTTGGAAAATAGAGATACTACCATTGTTGAAGAATCTCATACTATTTACTGCAATACTAAAAGTATAGAAACAACAAAACCAAAAGGAAATAAAGTGATAACAGATAAAGAACAGTTAGGCAAGACTGTGAGAATGGAAGTCACATTAAAGACGTCTATGGAACAAATAAAAAAACTTTCTGATACATACAAAAAAAATAAAGACAATTCAAAACCGGATAGAATAAGATTCAAAACTAAAATAGATCCAGTATTTAATAAGAAATGTGAAGAAGAATTAAGTAGGGAAATAGAAAAACAATCTTTTAAAAAAATGAAAATTATTGGTCAGTTTAATCTAGGCTTTATAATAACTAGACTTGATGACGATCTTTTTATTATTGATCAACATGCTACGGATGAGATATACAACTTTGAAACCTTACAGAAAACTACAGAACTTACGAGTCAAAAGTTAGTTATCCCACAGCAACTTGAACTCACTGGGGTCAATGAACAAATATTAATGGACAATCTAGATATTTTCAAAAAGAATGGCTTTACTTTTGCAATAGACGAAACTGCTGCTCCTACCAAAAGAGTTAAACTTTTAACTCTTCCTATGTCCAAAAATTGGATATTTGGAAAAGAAGACATTGAGGAACTCCTATTTATTCTGAAGGAAAATCACTCAGAATATTGTAGGCCCAGCAGAGTAAGAGCAATGTTCGCGTCCCGAGCGTGCAGAAAGTCTGTTATGATCGGAACGGCGCTCAGTAAGGGAGACATGAGAAAACTAGTTGACCACATGGCTGAAATAGACAAGCCTTGGAATTGCCCTCACGGAAGACCAACAATACGGCATCTCATAAATCTAGCGATGGTACACACTGTTGACTAA

Protein sequence:

>DPOGS207296-PA
MEETNIQNKQINTIKPINKDAVHKICSDQVVLSLAVAVKELVENSLDAGATNIEVRLKNYGTELIEVSDNGSGVTEDNFEALTLKYHTSKLNDYSDLLGVSSFGFRGEALSSLCSLANLTVTTRHETSKHATKIEYDQKGHISSKTPCSRQVGTTVTLTNLFYTLPVRQKEFHKNAKREFNKMTSLLYAYCLISKGVKITCSNQTNSNSKSLVVATQGSNSYKDNIASVFGVKQLQSILDVKTELVSNIKDNIFRGLSGEAKVNEESINIEDIEIDLSEDSNDAQTDENNSSQIPLPQRSQGYKNIPNPVELTGYISSCAHGSGRSSTDRQFFYINSRPCEPVKIIKLINEIYRQYNPHQYPFVFLNVNIERTSVDVNVTPDKRKVFLTKEKAILDVVKCSLLKMFEDIPRSVKVEAPSIVAAVKTEPELSQPRIFQSFLKQFSNKSSSIKPSESNNPDKCELKRKSSSVLDNFIQIKKTLVTKQEDNLIEEETEEKCMLDENKEHNILNVSTEENIENANERNINNSLENRDTTIVEESHTIYCNTKSIETTKPKGNKVITDKEQLGKTVRMEVTLKTSMEQIKKLSDTYKKNKDNSKPDRIRFKTKIDPVFNKKCEEELSREIEKQSFKKMKIIGQFNLGFIITRLDDDLFIIDQHATDEIYNFETLQKTTELTSQKLVIPQQLELTGVNEQILMDNLDIFKKNGFTFAIDETAAPTKRVKLLTLPMSKNWIFGKEDIEELLFILKENHSEYCRPSRVRAMFASRACRKSVMIGTALSKGDMRKLVDHMAEIDKPWNCPHGRPTIRHLINLAMVHTVD-