Monarch geneset OGS2.0

DPOGS213735
TranscriptDPOGS213735-TA1716 bp
ProteinDPOGS213735-PA571 aa
Genomic positionDPSCF300278 + 199630-208455
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0025653e-12056.52% 
BombyxBGIBMGA011490-TA4e-16062.21% 
Drosophilaspel1-PA1e-7237.41% 
EBI UniRef50UniRef50_E2C8G17e-7837.13%DNA mismatch repair protein Msh2 n=9 Tax=Formicidae RepID=E2C8G1_HARSA
NCBI RefSeqXP_001121207.13e-7936.53%PREDICTED: similar to mutS homolog 2 [Apis mellifera]
NCBI nr blastpgi|3407098393e-8139.32%PREDICTED: LOW QUALITY PROTEIN: DNA mismatch repair protein Msh2-like [Bombus terrestris]
NCBI nr blastxgi|3407098393e-7938.81%PREDICTED: LOW QUALITY PROTEIN: DNA mismatch repair protein Msh2-like [Bombus terrestris]
Group
Gene OntologyGO:00055243.7e-25ATP binding
GO:00062983.7e-25mismatch repair
GO:00309833.7e-25mismatched DNA binding
GO:00055150.0001protein binding
GO:00082700.0001zinc ion binding
KEGG pathwayame:7253489e-79 
 K08735 (MSH2)maps-> Colorectal cancer
    Pathways in cancer
    Mismatch repair
InterPro domain[234-389] IPR0076963.7e-25DNA mismatch repair protein MutS, core
[79-219] IPR0078601.4e-13DNA mismatch repair protein MutS, connector
[507-564] IPR0130831.6e-08Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11845 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213735-TA
ATGGGCATCGAGCCTAACAAACTAGACTATTTGGTCCTGTCGAAGGGAAACTTTGAGATACTCATCAGGAAATTACTATTGGTACGGAGATACAGAGTCGAGATATTTGTGTCGGAGGGATCAGTGAAGTCCTGTGATTGGTCGCTCAGGTACAAAGGTTCTCCTGGATACCTGTCCCAATTGGAGGAAATTGTCGGGGACGGTTTAGGATCCGCCAATGAGCAATCTACATGCTTGATGGCCGTCAATGTCAAGAGTGACGCCATCAGTAAGGGCCGCCTAGTGGGCATAGCGTGCGTGTATCAGAACGATTACACTTTATCAGTGTCGGAGTTCACTGATGATGTTGACTTCACCCAGCTAGAGTCGATCGTCGTACAAGTGGCGCCCTCTGAGTGCGTTGCGGCGCCGGCTGATAACGATTATAAAGCTTTAAAGAAGGTTATGGACAGAGCGAGTGTGACGGTGACGAAGGTCAAGAAGTCGGAGTTCACGACGGAAGGTCTCATCCAGGATCTGAACAGACTTCTCAAGTTCAAAGAGGATCAGCAAAAAGATGCCAATGGATTCCAGGAAACCAAACTACCAGTGGCCATGAGCGCTCTGGCAGCCGCCGTTAGATATACATCGCTGTTAAACGATGACACGAACTTTGGAAGGTTCCGCATATCGTCAGTGAAGGCCGACTACCTTCAGCTGGACTCCTCGGCCCTGTCGGCACTGAATGTGTTCCCTGAACTCGGTGATACGAACACTTCGCCAACCAGGAGCATCTACGGACTACTCGACAGATGTAGAACACAGCATGGAAAACGACTTCTGTGCCAGTTGCTTCGTCAGCCTCTTAGAGACATCAACCTGATCAACGAGCGCCTGGACATTATCCAGCTGTTGGCTATCAACCGCATTCCCGTCCTATTGAAGTGTCTGTCTGAGTTCAACGACCCCACGATACATTCGGTGCTCTGTGAACCGATAGCTGAACTTAACAACGACCTGGAAAAGTTCCAGCAGATGATTGAAACTACCATCGACCTAGAAGCTGTTGACAGAGGTGATTTTCTCGTGAAGCCATCTTTCGATGAAGAGTTACAGGTACTAGCGAATGATCTGGAAAAATTACAAAACTCAGCTGAGAAAGAATTAAACAAAGCGGCCAGGGATCTTGNTGAGAAGACTACAAGCAACAGGAACACCTTTCACGTCATCAGAAGCCCTAATAGATGCTGTGTTGGACGGCCAGTTGAACGAGGAGGCCTGGCTGGTTTCACCTTCGTCCCTGGCAGCGAGGGACCTCCTGGCTGGCACTCTGAGAGGACTCGTGCCTACATCCCGTTCTTTCCCCACAGCCTGTTCCATTCTAACGTGGAAGACGATAGGTCAGACAGTCGGGCTTCACAATCCGACAGTCCTGTCGCGTCGCCCACACCGAGCGTCACCCCGCAAACTGAGACACAAACAGACGCACAGCCAGAGGTGCCACAGAAAAACACTAATAACAATGAACCAGAGGAATCAATGAAGCTCAGTCTCCAAGAAGAAAACAGACAGTTGAAGGAAGCTAGGATGTGCAAAGTTTGTATGGACAGTGAGGTAAGCGTGGTGTTCCTTCCGTGCGGCCATCTTGTGTCGTGTGCGGGTTGTGGCGCAGCCCTGGGGGCGTGTCCTCTCTGTAGGGCTCCAGTGAAGGCCCTAGTAAGAGCCTACCTCGCTTAG

Protein sequence:

>DPOGS213735-PA
MGIEPNKLDYLVLSKGNFEILIRKLLLVRRYRVEIFVSEGSVKSCDWSLRYKGSPGYLSQLEEIVGDGLGSANEQSTCLMAVNVKSDAISKGRLVGIACVYQNDYTLSVSEFTDDVDFTQLESIVVQVAPSECVAAPADNDYKALKKVMDRASVTVTKVKKSEFTTEGLIQDLNRLLKFKEDQQKDANGFQETKLPVAMSALAAAVRYTSLLNDDTNFGRFRISSVKADYLQLDSSALSALNVFPELGDTNTSPTRSIYGLLDRCRTQHGKRLLCQLLRQPLRDINLINERLDIIQLLAINRIPVLLKCLSEFNDPTIHSVLCEPIAELNNDLEKFQQMIETTIDLEAVDRGDFLVKPSFDEELQVLANDLEKLQNSAEKELNKAARDLXEKTTSNRNTFHVIRSPNRCCVGRPVERGGLAGFTFVPGSEGPPGWHSERTRAYIPFFPHSLFHSNVEDDRSDSRASQSDSPVASPTPSVTPQTETQTDAQPEVPQKNTNNNEPEESMKLSLQEENRQLKEARMCKVCMDSEVSVVFLPCGHLVSCAGCGAALGACPLCRAPVKALVRAYLA-