Monarch geneset OGS2.0

DPOGS207851
TranscriptDPOGS207851-TA1293 bp
ProteinDPOGS207851-PA430 aa
Genomic positionDPSCF300042 + 1366688-1368413
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0153223e-2477.59% 
BombyxBGIBMGA009815-TA2e-8844.89% 
Drosophilaspel1-PA2e-2434.55% 
EBI UniRef50UniRef50_UPI00022C9A7A5e-4340.17%UPI00022C9A7A related cluster n=2 Tax=unknown RepID=UPI00022C9A7A
NCBI RefSeqXP_001943834.11e-4245.03%PREDICTED: similar to predicted protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3838474302e-4541.30%PREDICTED: mutS protein homolog 4-like [Megachile rotundata]
NCBI nr blastxgi|3838474303e-4431.76%PREDICTED: mutS protein homolog 4-like [Megachile rotundata]
Group
Gene OntologyGO:00055242.7e-45ATP binding
GO:00062982.7e-45mismatch repair
GO:00309832.7e-45mismatched DNA binding
KEGG pathway 
InterPro domain[3-173] IPR0004322.7e-45DNA mismatch repair protein MutS, C-terminal domain
Orthology groupMCL15929 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207851-TA
CTTTTCCAGTTAGGTTGCTTTGTACCAGCAACTAACGCCGTGATCAGACTATGCGATCGTATCTTCTCAAGATTAGGTTTCAACAATAGTATTGAATTCAATGCTTCTAGTTTTGTTATAGAGATGAAGGAAAGTCAGCATATATTAAAAGGGCTTACATCTTCGAGTTTGGTCATAATCGACGAACTCTGCCGTGGAACGAACGTTGAAGAGGGTACGAGTATAGCGTGGTCGATTTGTGAGGAACTCTTGATGAGTGAGGCGTATACATTTTTAACAACCCATTTCATGTATTTAACAAAACTTGAGGACTTATACTACAATGTTATAAATGTCCATACAGCTGTGAAAGAGGAATCCCAAGGTCCAGATGTACTGGAAAAGAGATTGATATATCAACATAAAATTGAACCTGGAATTACACAGATTAAACATTACGGTATAGCATTAGCTGCTAAGACAAATCTACCACAAGATATTGTTAGTTTGGCTAAAGAACTTGCGGAACTAATAGAAAGCAACACAAAGCCAATGTCAGGTTCATCGCAAAAAGAAACAGATTTAAAACTATTATATGATTTGAATGCCAAAATTCAGATGGAATCTAGAAAGAATTATAATAATGAAGAATCTATAAGAAATATATTGAGACAATTTAAGAATAAATATCCACACATAGTAGAGGGATTAAAGTTAGAAAGAAATTCGAGAAATATTCATAATTATTCATCCCATGAGAGTCCTAAAGTACCAGAATTTGATACAGAAAAAAACACAATTTCCCCCCAAAAACAAACGTCAACGTCAGTATATAGTAACTCCGAAAAAGAATGCGTAGAAATAAATACGAATATAATTGCTACGGTCAAATCATCTACACAACTTCTGAATACATTTGATGATAGTACGAATTATATACATACAAAAAATCATACTCGACATGCAAATAACAATAACATCACCGGTATTATTGATGCAAATGAACAAAGCCACAATATCATAGTAAAAGCAGACATCCATCATAATAATTTACCAAAAACTTCATTTCCTTCTCAACTAGATAATAATATGAAAGCATCATTTGATGGGAATTCTTCGACAGATTCGGATATAGCTGAGGCTTTGACTCAAATTATCGAAGAGAATTCGGAATCGGATGCTGGGGAAAACATCTCGGTCGATGAAGAATTAATGAGGGAAACGTTGGATGAAATTAACAAAGATTTGAATGTGTCCGTTGATTCTATAAGCGATATTATTCTCACTCCGCCGATGAAATTTAGGGATTTTTAA

Protein sequence:

>DPOGS207851-PA
LFQLGCFVPATNAVIRLCDRIFSRLGFNNSIEFNASSFVIEMKESQHILKGLTSSSLVIIDELCRGTNVEEGTSIAWSICEELLMSEAYTFLTTHFMYLTKLEDLYYNVINVHTAVKEESQGPDVLEKRLIYQHKIEPGITQIKHYGIALAAKTNLPQDIVSLAKELAELIESNTKPMSGSSQKETDLKLLYDLNAKIQMESRKNYNNEESIRNILRQFKNKYPHIVEGLKLERNSRNIHNYSSHESPKVPEFDTEKNTISPQKQTSTSVYSNSEKECVEINTNIIATVKSSTQLLNTFDDSTNYIHTKNHTRHANNNNITGIIDANEQSHNIIVKADIHHNNLPKTSFPSQLDNNMKASFDGNSSTDSDIAEALTQIIEENSESDAGENISVDEELMRETLDEINKDLNVSVDSISDIILTPPMKFRDF-