Monarch geneset OGS2.0

DPOGS211920
TranscriptDPOGS211920-TA1545 bp
ProteinDPOGS211920-PA514 aa
Genomic positionDPSCF300011 + 141461-145054
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0177197e-11643.74% 
BombyxBGIBMGA001063-TA1e-14954.17% 
Drosophila% 
EBI UniRef50UniRef50_G7YVP81e-2826.09%HORMA domain-containing protein 2 n=5 Tax=Clonorchis sinensis RepID=G7YVP8_CLOSI
NCBI RefSeqXP_002579820.13e-2925.90%hypothetical protein [Schistosoma mansoni]
NCBI nr blastpgi|3582553155e-2826.09%HORMA domain-containing protein 2 [Clonorchis sinensis]
NCBI nr blastxgi|3582553154e-2826.09%HORMA domain-containing protein 2 [Clonorchis sinensis]
Group
Gene OntologyGO:00070675.9e-23mitosis
KEGG pathwaycgr:CAGL0M12771g5e-12 
 K12778 (HOP1)maps-> Meiosis - yeast
InterPro domain[29-231] IPR0035115.9e-23DNA-binding HORMA
Orthology groupMCL19719 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211920-TA
ATGACAGCAACAGCAACATCTGCTACACAAGCAGTATCAGAATGGGTGAAAGTTTTTCCTAAGCAGGTAACTGAGAATTATACAAGTTCTGTTACCTTCATGAAACAATTGACAGTGGTAGCAGTGAGTACTATAACGTACTTAAAGAACGCATTCCCAGAGGACAGTTACACAGTGGAAACTTTTGGTGGCGTCAAATTGCGTATTCTTAAGAAAAAATGTAGAAACGAACTAGCGCAGTTTTTAAGCACTGCTTTGACTCAAGCATTTGAAGCGTTCGATAAAAAGTTTCTCCACCAGTTGGCGCTGTGTTTCTACGAGGACGAGTGTAAGCCGGAGAATCTCATAGAGTATCATATATTCGAGTACTCGTACAATGACGACCGCGTCACTCTCGACCTACACTCTAAGAGCCGCCACAGCCGCGGTCGCCACAGTCGCCACACCGAGCACACGTTCGAGAGCGTTCGCGAGCGCACGGTTCACCTCATACGGGCTTGTGTTGTTATCATGCAGTCCTGTCAGACGGAGCTGCCCGCCGCGTACGACGTCAGTCTGCGACTGTACTACAACCAGGACGCCCCCTCGGACTACCAAGCCCCGGGGTTCCTCAGTACCGATGAATCCGAGGACCACCTGGAGCCCTATCTGGTAGACGCCATCAAGCTGGGCTGGGTGGAAACTCCTTACCATAAGCTCATAGCGAGATCCTTCATCAAGGAACACGTCCTCGCAAGCCACGAGGCTATCCCTTCACAGAACCGTCCGATACTATCCAACGAGGTTCAAGCTTCCGGCTCCAACATCGCACCTGACACCGATATCCAAATAGTCTGTCCCTGTAACAGACATGAAGAGGGATGTGACGTCAGCGAGCTGCTCAAGTGTCTGTTCTGCGGCACGTACCAGCACGCGTGTTGTTACGGCGTGTGTGACGTCACGGCCGCCGCGTCTCACTGCTGTGTGTCCTGTTATCAAACAGACACACACAGGACTCCCACTGATAAGAAGCTCGCCACTCTCACACACGCCAAGAGAGAGAGCCTGTGTATATTCCGCCGCACGTTGGAGTGGTGCGCCAGAATACCAAGCATAGATGGCGCGAGCATATCACAGAGGTTCCGCATATCAGAGACGAACGCCCACAAGCTGATGAGACTGCTGCACTCACACGGAGTGCTGCCCGAGGAACCCACCGATTCGAAAACGCCGCAGAAGATAATAACGGTGGCGCTTAATTCGGTCATGTCCAAATTCTTCAATATGAACAGAGATGTTGTGGAGCGGCTCCTGGCCGAAACCCTGCTGGAGGAGTCTGACCCGCTTTCGGGGGTCATCAGCCCCTTCGAACAGGTGAGCCTGACATCCGACGGCAATAAGACAGGGGTCACGACCCCTGACACAAACAGACACACACTGGATCAGTACAAACAAGCTTTCTCCGCCGAGGACGATGTGACGTTGGATGAGAACCGCACTAAGAGGAAGTTGAGCGAGAGGAATCTGAGGAAAGGGGTCCGCACCAAGAGGTCCAGACTCCAGTAG

Protein sequence:

>DPOGS211920-PA
MTATATSATQAVSEWVKVFPKQVTENYTSSVTFMKQLTVVAVSTITYLKNAFPEDSYTVETFGGVKLRILKKKCRNELAQFLSTALTQAFEAFDKKFLHQLALCFYEDECKPENLIEYHIFEYSYNDDRVTLDLHSKSRHSRGRHSRHTEHTFESVRERTVHLIRACVVIMQSCQTELPAAYDVSLRLYYNQDAPSDYQAPGFLSTDESEDHLEPYLVDAIKLGWVETPYHKLIARSFIKEHVLASHEAIPSQNRPILSNEVQASGSNIAPDTDIQIVCPCNRHEEGCDVSELLKCLFCGTYQHACCYGVCDVTAAASHCCVSCYQTDTHRTPTDKKLATLTHAKRESLCIFRRTLEWCARIPSIDGASISQRFRISETNAHKLMRLLHSHGVLPEEPTDSKTPQKIITVALNSVMSKFFNMNRDVVERLLAETLLEESDPLSGVISPFEQVSLTSDGNKTGVTTPDTNRHTLDQYKQAFSAEDDVTLDENRTKRKLSERNLRKGVRTKRSRLQ-