Monarch geneset OGS2.0

DPOGS206564
TranscriptDPOGS206564-TA3129 bp
ProteinDPOGS206564-PA1042 aa
Genomic positionDPSCF300108 - 580346-592142
RNAseq coverage319x (Rank: top 36%)
Annotation
HeliconiusHMEL0118372e-14033.61% 
BombyxBGIBMGA013799-TA0.039.03% 
DrosophilaSmc5-PE2e-11729.93% 
EBI UniRef50UniRef50_D6WRS74e-15533.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WRS7_TRICA
NCBI RefSeqXP_975667.17e-15633.40%PREDICTED: similar to structural maintenance of chromosomes 5 smc5 [Tribolium castaneum]
NCBI nr blastpgi|910874051e-15433.40%PREDICTED: similar to structural maintenance of chromosomes 5 smc5 [Tribolium castaneum]
NCBI nr blastxgi|910874051e-15933.14%PREDICTED: similar to structural maintenance of chromosomes 5 smc5 [Tribolium castaneum]
Group
Gene OntologyGO:00055244.9e-16ATP binding
GO:00056944.9e-16chromosome
KEGG pathwaypic:PICST_304605e-62 
 K01553 (E3.6.4.1)maps-> Purine metabolism
InterPro domain[15-997] IPR0033954.9e-16RecF/RecN/SMC
Orthology groupMCL13040 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206564-TA
ATGTCTAGAATAATAAATAAAGGAAGTTTCAAACCCGGTAGTATATATAGAATAGCGCTAGAAAACTTTGTAACCTATAAAGAGGTGGAATTTTATCCAGGCAAATCATTGAATTTAATAATCGGACCCAACGGAACTGGGAAATCAACATTTGTTTGTGCTATAATACTAGGCCTCTGTGGAAATCCTAGGGCTATTGGTAGATCGAAAAACTTGGAAGGTTTTGTCAGACAGGGGTGTGAGAGAGGATCTATAGAGATAGAGTTATATAATAAACCTGGTGAGAGGAATATCATTATAAAAAGGACACTGGATGCAAAGAAATGTTCATCAATATGGAGCTTGGATTACAAAACTGTAACGGAGAAAAGAGTTCAAGAAATTGTTAAATCACTTAATATTCAGGTCGAAAATCTCTGTCAATTGCTACCACAGGATAAAGTCCACGATTTTTCTAAGTTGAATCCCAAAGAACTATTGCATAGTACATTGACAGCTATCGGGGACTTTGATAGTATTAAAGATTGGGACAAATTGATCAAGCTGCAAAATGATCAGAAGGAATTGACTTCGACTCTCAAAAATGGCGAAACAAAACTACAAGAGGAAAAGAGAAAAAACCAAGGATTAAAAGAAGTGATCGATGCTATGAATCAGAGGAAAGCTATAAAGAGGGAGATAAAGATTTGTGAGAAGAAGCTGTTGTGGGCCGAGTACAAGGAGCTGTATGATGCTGTGGAGGAGATCAAGAGGCAGCAGGTCGAGGCCAAGAGAGTGGTTGAGGAGAATAATAATGTCATTGAACCTATGAAACGTGAACTGGATGCGATGAAGCAGCGTATCGGTGTGTTGGAGAGCGGGAAGCGACGGAGTATTGAAAAAATTCGTGATCTCAAAGCTAAACTGCAAGAGACGATATCGACTTTTGAAATACACGAATCGAAGCTGAACGGGATAGACAGAACGTTTCAAGAGAAATATGACGCGCAAAGGAATATAGAGAGAGAATTGACGGAGGCTAGGATCGAAGAGGAAAAACTACAATCGGATAAGAGGGAGCTGGAAGAGAAGGGTGGGAACGAACAGAGCTTGATATTGGAATTACAAAAATTTGAGAAAGAAAGGGCTATAATAAATGCAACGCTTGAAACATATAGGAATAGCAGAGGGCGCCAGTTTTATCCGTTGGACAACGAAATGAGATCGCTCACACATAAGATAAAGAGTTTGGAAAACGTTGAAAGGGGCCGTCTCGACAAACTGAAGACTAAACATAGAGATACATATAAAGCGTGGGTCTGGCTGAAAGAGAATATGCACGAGTTCAAACACCCTGTATATGGACCGATGATGCTTAACATTAACTTCAAAGAACCAAAATTCGCACGTTATTTAGAATCCACGGTGCCGGTGAGGGATTTGAAGGCTTTTACGTTTGAATCCAAAGAGGACATGAACAAGTTCAATAAGATAGTTCGAGAGGAGTTAAAATTAAGACAAGTGAATGCTGTACACAGTGAAGGAGGAGATTTTGACATACGGCCCATAGATATAAGAAACTTGAGTTACTTGGGATTCTACACGTGTATCCTGGACACGATCTCAGCTCCAGCCGCCATCTTGCGCTACCTGTGTTCTGTGTATCGTGTTCATGATATACCGATTGGGAACAACCATACATTTGACAACGTCGAAAGAGTTCCAGACAAGATTAGGTTTTACTTCACAGAAAAGCATAGAATTAGTGCGCGAGTGTCGTACTACAAGGTGCGGTCGACTACGACCATAGAAATAAGAAACGCGGATCTGTTAGCGGACAGTGTTGATTATGAATATGTTAACGCTTTGAAATCTCGGTTATCTGAGGTACAAAAAGAGAAGACAAATTTGGAGAGTCAATACGAAGCGAGACTGAATGTAGAAGGAGATAAACTGAAAGAAATAGTTGGAAAAACAAAGGAGAAGACTGACTCGTTAGAGAAGATTAAATCCATAAATCTAAAAATACACTTCCAAAAGCAAAAAGTTCTTGCATTAGAAAGCGAACCGGCCATAAACATAGAGGCGGAAAAGAGGAAGTGCAAAGAAGATAAACAAGAGTGCGTCCACAAACAGTGCGCAGCGCAGAAGGAGATGTACAACATATTACAACACATACACGAGGAGACAGTCAACATGGAGAAGAATACGATTCACTTATCTGTACATAGAAACGAATTCGTTCAAAAGGAAGCCCAATATAGAAGATTAACAAGCGAGTTCGAGGCGGCCAAGACGATACTAGAAAACGTAAACAACGATATGAAAAGAGCGAGAACGAGAGCGAAGGAGAAGCTGGAACAAGCGAAGTCCAGCTGCGGAGACAAGATGATCAACGCGGACGACTTCCCGTACGCGGACGAGTTCAACGACCTGCCCTCGGACAGAGAACAGCTGCAAATGTACAGGAGCGAGCGAATGGCCAAAGTATCGCTCATGGACAAGGGTGACAACCAGGTACTAAAAGAATATGAGGATAGAGAGAGGGAGATAAGGAATCTTGAAAAGAAATTGAGTTCGTCGACCGACACCAAGAAAATGATAAGAGATGAAATTAAAACGATAACATCAAGATGGCTGCCACCGCTGGAAAATCTAGTGAGTGAGATCAGAGAGAATTTCTCATCCATGTTCCAGAAGTTGGGGTGTGTCGGTGACGTCATACTGTACAAGGGAGCCAATGATGAGGAGTTCTCGTGTTACGGGCTACACATCATGGTGCAGTTCCGTGTTGGTGAGCGTCTGCGACAGTTGACTAGAGACACACAGTCGGGCGGGGAGAGGGCGTTGTCTACGGCCCTATACCTCCTTGCTCTACAGGCAAGGGTCGCTGTACCCTTCAGATGTGTGGATGAAATCAACCAAGGAATGGATGCGAAGAATGAAAGGGACATGTTACAGTTACTGATCAAGGCGACCACTGAATCCGATTCCCAGTACTTCCTCCTCACACCAAAGCTGCTGCTAGATCTGGACTACAATGAGAAGACAACCATACATACAGTGATGAACGGCTGTCATATAATGAATTACAAGAAATGGAATATGAGTGAATTCCTTCAGAACGCTAACAAAATCAACCAAATATGA

Protein sequence:

>DPOGS206564-PA
MSRIINKGSFKPGSIYRIALENFVTYKEVEFYPGKSLNLIIGPNGTGKSTFVCAIILGLCGNPRAIGRSKNLEGFVRQGCERGSIEIELYNKPGERNIIIKRTLDAKKCSSIWSLDYKTVTEKRVQEIVKSLNIQVENLCQLLPQDKVHDFSKLNPKELLHSTLTAIGDFDSIKDWDKLIKLQNDQKELTSTLKNGETKLQEEKRKNQGLKEVIDAMNQRKAIKREIKICEKKLLWAEYKELYDAVEEIKRQQVEAKRVVEENNNVIEPMKRELDAMKQRIGVLESGKRRSIEKIRDLKAKLQETISTFEIHESKLNGIDRTFQEKYDAQRNIERELTEARIEEEKLQSDKRELEEKGGNEQSLILELQKFEKERAIINATLETYRNSRGRQFYPLDNEMRSLTHKIKSLENVERGRLDKLKTKHRDTYKAWVWLKENMHEFKHPVYGPMMLNINFKEPKFARYLESTVPVRDLKAFTFESKEDMNKFNKIVREELKLRQVNAVHSEGGDFDIRPIDIRNLSYLGFYTCILDTISAPAAILRYLCSVYRVHDIPIGNNHTFDNVERVPDKIRFYFTEKHRISARVSYYKVRSTTTIEIRNADLLADSVDYEYVNALKSRLSEVQKEKTNLESQYEARLNVEGDKLKEIVGKTKEKTDSLEKIKSINLKIHFQKQKVLALESEPAINIEAEKRKCKEDKQECVHKQCAAQKEMYNILQHIHEETVNMEKNTIHLSVHRNEFVQKEAQYRRLTSEFEAAKTILENVNNDMKRARTRAKEKLEQAKSSCGDKMINADDFPYADEFNDLPSDREQLQMYRSERMAKVSLMDKGDNQVLKEYEDREREIRNLEKKLSSSTDTKKMIRDEIKTITSRWLPPLENLVSEIRENFSSMFQKLGCVGDVILYKGANDEEFSCYGLHIMVQFRVGERLRQLTRDTQSGGERALSTALYLLALQARVAVPFRCVDEINQGMDAKNERDMLQLLIKATTESDSQYFLLTPKLLLDLDYNEKTTIHTVMNGCHIMNYKKWNMSEFLQNANKINQI-