Monarch geneset OGS2.0

DPOGS215534
TranscriptDPOGS215534-TA2064 bp
ProteinDPOGS215534-PA687 aa
Genomic positionDPSCF300129 - 696429-700607
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0061672e-15568.10% 
BombyxBGIBMGA012027-TA3e-2241.53% 
DrosophilaMlh1-PA1e-6245.00% 
EBI UniRef50UniRef50_D6WX753e-6446.99%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WX75_TRICA
NCBI RefSeqNP_001154957.19e-6937.35%mutL homolog 1 [Nasonia vitripennis]
NCBI nr blastpgi|2388596652e-6737.35%mutL homolog 1 [Nasonia vitripennis]
NCBI nr blastxgi|2388596655e-6837.35%mutL homolog 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00056343.2e-142nucleus
GO:00055243.2e-142ATP binding
GO:00062983.2e-142mismatch repair
GO:00309833.2e-142mismatched DNA binding
KEGG pathwaynvi:1001241993e-68 
 K08734 (MLH1)maps-> Colorectal cancer
    Pathways in cancer
    Endometrial cancer
    Mismatch repair
InterPro domain[3-688] IPR0111863.2e-142DNA mismatch repair protein Mlh1
[3-688] IPR0020993.2e-142DNA mismatch repair protein
[7-126] IPR0035942.5e-48ATPase-like, ATP-binding domain
Orthology groupMCL12828 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215534-TA
ATGTGCGAACCAGGAATAATACGAAAACTTGACGAAGAAGTAGTAAATCGTATCGCTGCCGGTGAAATAGTACAACGACCAGCAAATGCTTTAAAAGAACTGATAGAAAATAGTTTAGATGCCCATTCTAATAATATAATAATAACAGTAAAAGCCGGTGGCTTGAAGTATCTACAGATTCAAGACAATGGCACAGGAATTCGCAGTGACAACTTAGACATAGTCTGTGAGAGATTCACTACATCTAAATTAAAGCAATATGAAGATTTACAAGCTATTAGTACTTATGGATTCCGCGGAGAGGCTTTGGCCAGCATCAGTCATATTGCCCATTTGACTATACTTACAAAGACTGCTAATGAGAAATGTGCTTATAACGAGGAGTTATTCTACGAAACCATTCTCTATGATTTCCAAAATCTCGGTTTAATTAAATTATCGAATCCGTTGCCCTTAGAGGAACTGTTGGTACTAGGACTCGGTTCGCACGAAGAGGAATGGGACAACGACTTAGGTGACATGACAGAAATGGCTGCACAGATGACAAAATTACTCATAAGCAAAGGTCCTATGTTATATGAATATTTTTCGATGGAAATTAACAACAAAGGCGAGTTATTGTCTCTGCCTTTGCTATTGGACGGTCACACCCCCTTCATGGGAGCATTGCCAGTGTACTTAGTTAGACTAGTAACTGAAGTGAACTGGGAGTCGGAGAAGGAGTGTTTCGATACGTTCAGCCGACAGACAGCTATATTTTATTCACAACCAAACAGAGATTCACAGGAAGACGTCTCCGAGACCGAGTCCTGGAAACAGGAGCACATAATATTTCCAGCTATAAGAAGGAATTTCTTGCCGCCTAGCAGTTTCGTCAGCAACGGATCTATACTACAAATAGCTAATTTATGCGACCTCTATAAAGCTCGCCTGCCAGGTGCGACTATGGTCGAGGAAATCATCAAAAAGACAACAGACGGTGCAAAAGTATACGCAAAAGATTTAGTACGAGTGGATTCGGATGCTCAGAAAATTGATAAATTCTTCAAAGTAACTACAATAGATAGGTCAAAAGAAAACGAAACAAATGAAACAAGAAATGAGACAACGAATCAAGATAGTAAGGAAAGAAATGAAGGGACAGAAGCCGTTGAAGTTATTGATAATAACGAAGTTATTGATGAAACAGATGATATAATTAATAATTCTCTTGTATCGGAACCAAATGAGACGAAGGCAAAGAAATATAAAACAGCTATCAGTAGCAACGTCACATACATAGATCCTAAAGAATCATTTAAAACGAGGACCTTCAAACATCAGAGAGTGGAAACTAAGTTGACCAGTGTACACCAGCTGAGATTGGACGTTGAGAACAAATGTAATATGAATATGAGGGAGATTCTGGCAAATCTCATTTTTATAGCCTGCATCGACTGCGAGCGCTCCCTGATACAGCACTCAACAAAGCTTTATTTGTGTGACACTACTCGATTGACCGAGGAGTTATTCTACGAAACCATTCTCTATGATTTCCAAAATCTCGGTTTAATTAAATTATCGAATCCGTTGCCCTTAGAGGAACTGTTGGTACTAGGACTCGGTTCGCACGAAGAGGAATGGGACAACGACTTAGGTGACATGACAGAAATGGCTGCACAGATGACAAAATTACTCATAAGCAAAGGTCCCATGTTATATGAATATTTTTCGATGGAAATTAATAACAAAGGCGAGTTATTGTCTCTGCCTTTGCTATTAGACGGTCACACCCCCTTCATGGGAGCATTGCCAGTGTACTTAGTTAGACTAGTAACTGAAGTGAACTGGGAGTCGGAGAAGGAGTGTTTCGATACGTTCAGTCGACAGACAGCTATATTTTATTCACAACCAAACAGAGATTCACCGGAAGACGTGTCTGAGACCGAGTCCTGGAAACAGGAGCACATAATATTTCCAGCTATAAGAAGGAATTTCTTGCCGCCTAGCAGTTTCGTCAGCAACGGATCTATACTACAAATAGCTAATTTATGCGACCTCTATAAAGTGTTTGAGCGTTGTTAA

Protein sequence:

>DPOGS215534-PA
MCEPGIIRKLDEEVVNRIAAGEIVQRPANALKELIENSLDAHSNNIIITVKAGGLKYLQIQDNGTGIRSDNLDIVCERFTTSKLKQYEDLQAISTYGFRGEALASISHIAHLTILTKTANEKCAYNEELFYETILYDFQNLGLIKLSNPLPLEELLVLGLGSHEEEWDNDLGDMTEMAAQMTKLLISKGPMLYEYFSMEINNKGELLSLPLLLDGHTPFMGALPVYLVRLVTEVNWESEKECFDTFSRQTAIFYSQPNRDSQEDVSETESWKQEHIIFPAIRRNFLPPSSFVSNGSILQIANLCDLYKARLPGATMVEEIIKKTTDGAKVYAKDLVRVDSDAQKIDKFFKVTTIDRSKENETNETRNETTNQDSKERNEGTEAVEVIDNNEVIDETDDIINNSLVSEPNETKAKKYKTAISSNVTYIDPKESFKTRTFKHQRVETKLTSVHQLRLDVENKCNMNMREILANLIFIACIDCERSLIQHSTKLYLCDTTRLTEELFYETILYDFQNLGLIKLSNPLPLEELLVLGLGSHEEEWDNDLGDMTEMAAQMTKLLISKGPMLYEYFSMEINNKGELLSLPLLLDGHTPFMGALPVYLVRLVTEVNWESEKECFDTFSRQTAIFYSQPNRDSPEDVSETESWKQEHIIFPAIRRNFLPPSSFVSNGSILQIANLCDLYKVFERC-