Monarch geneset OGS2.0

DPOGS207850
TranscriptDPOGS207850-TA1440 bp
ProteinDPOGS207850-PA479 aa
Genomic positionDPSCF300042 + 1362141-1366686
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0153223e-15671.84% 
BombyxBGIBMGA009815-TA6e-13362.96% 
Drosophila% 
EBI UniRef50UniRef50_G3NYX21e-4735.66%Uncharacterized protein (Fragment) n=1 Tax=Gasterosteus aculeatus RepID=G3NYX2_GASAC
NCBI RefSeqXP_001627391.12e-4245.86%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|1563673713e-4145.86%predicted protein [Nematostella vectensis]
NCBI nr blastxgi|3072055428e-4239.17%MutS protein-like protein 4 [Harpegnathos saltator]
Group
Gene OntologyGO:00055249.1e-12ATP binding
GO:00062989.1e-12mismatch repair
GO:00309839.1e-12mismatched DNA binding
KEGG pathway 
InterPro domain[100-235] IPR0078609.1e-12DNA mismatch repair protein MutS, connector
[283-425] IPR0076962e-06DNA mismatch repair protein MutS, core
Orthology groupMCL15929 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207850-TA
ATGAATGTGATTCAATCTGATAAGCGTCCCTTTAAATCGTTTACTAAAACTAAAAATGAATTGGTACGCAAGGATGCCGCAATGGGTATACGTGGCTCACTGTTTCTACCTGTTAAGCCTCATAAAGCCGTGAATAATATTCGTAAATTCAACAAGTCTACAGCGCCTCGGTATAAATTTGATGACAATATTTCTTTATCTAAAAGACCCATGGGCCCTCCATCAGGACCTGCACCGAGTATCAAACGAAATGGGATAAGAACAGCTAAATCTACCAGCACTTCAGTTGTGGACGCTAGTGTTATACTGGCTATATCTGAAGGTCGGGGGATGGCTCGGGGTGAGATTGGTATAGCAGCTGTGGATTTGAGACGTCCAAATCTGATTCTCTGCCAGTTCAGTGACACGCTCCTATACACACACACACTCACCAAGATAAACTACTTCAATCCCATTGAAATAATAGTACCTCACACATTCTGCGAAGGTGCCCAAACCAATCAGCTGTACCAGTTAATAAAGGATCACTTCCCATTAGTTAACGTAACAGCTGTTCAGAGGAGACATTTCAACGACGCAGCCGGGAGGCAGAACATACACACGCTATGCGCCCCTCAGTACAGCGCTGTATATCTCCTTGTTATACATAAATTTTATGCCCTAACAGCTGCCGCGGCTGTTCTGAAGTACGTGGAATATATACAGTGTATTGTTTTCGCTAGAGAATCTCTGAAGATCGAGTATCACTCGTCCGAAAACACTATGATTATTGGTATCACATATATAATACATTTTCATATATATCAGTATATTTTGCTACAAGTGCCGCCATCAGCCTTCGAAAACCCACACTACGAAGAAATATCTAATCGAATTAGGACAGTCATACAAGAAGATGCACATTTAGAAAAAGGCGCTATGGGAAGCATGCAGAGATGCTTTGCGGTCAAACCAGAAATCAACGGCCTCTTGGATGTAGCTAGAAGAACGTACTCAGAGCTCATTGAGGATATACAAAAAATCGTGGAACAACTAAGCGAGACGTACGACCTCCCGTTGAGGCTTAACCAGAACGTCATGAAGGGTTTCCATATAGTGTTGCCAGTCGCACCCAAGAATAGGAGACAATTTAACGTTGAAGAATTACCATCTATATTCATACAGGTTGTATTCAATGGAGCTAGTGTTACAATGACCACAGAAGAAATAGTTGTACTCGACCAACAAGCGAAGGAGTCGCTTAATGAGATACAAAAAATGAGCAACATGTATATAACTACAGACAGTTTTAATAACTTCATATTATTGTATTATTATATTATATTTATATTAATAAGTCGTGGTGGCCTTTCACTATTCATTATACCAAAAAGTATTACTAACGGAATATATATTTTTACTTTGGCTGTATATTTATTGACCATATTTATCCTACTATGA

Protein sequence:

>DPOGS207850-PA
MNVIQSDKRPFKSFTKTKNELVRKDAAMGIRGSLFLPVKPHKAVNNIRKFNKSTAPRYKFDDNISLSKRPMGPPSGPAPSIKRNGIRTAKSTSTSVVDASVILAISEGRGMARGEIGIAAVDLRRPNLILCQFSDTLLYTHTLTKINYFNPIEIIVPHTFCEGAQTNQLYQLIKDHFPLVNVTAVQRRHFNDAAGRQNIHTLCAPQYSAVYLLVIHKFYALTAAAAVLKYVEYIQCIVFARESLKIEYHSSENTMIIGITYIIHFHIYQYILLQVPPSAFENPHYEEISNRIRTVIQEDAHLEKGAMGSMQRCFAVKPEINGLLDVARRTYSELIEDIQKIVEQLSETYDLPLRLNQNVMKGFHIVLPVAPKNRRQFNVEELPSIFIQVVFNGASVTMTTEEIVVLDQQAKESLNEIQKMSNMYITTDSFNNFILLYYYIIFILISRGGLSLFIIPKSITNGIYIFTLAVYLLTIFILL-