Monarch geneset OGS2.0

DPOGS205300
TranscriptDPOGS205300-TA1839 bp
ProteinDPOGS205300-PA612 aa
Genomic positionDPSCF300021 + 1036738-1038831
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0162132e-13042.95% 
BombyxBGIBMGA011042-TA2e-8654.41% 
Drosophilamus81-PA6e-4236.77% 
EBI UniRef50UniRef50_UPI00021A77CA6e-6431.70%UPI00021A77CA related cluster n=2 Tax=unknown RepID=UPI00021A77CA
NCBI RefSeqXP_967772.18e-5635.32%PREDICTED: similar to MUS81 endonuclease homolog (yeast) [Tribolium castaneum]
NCBI nr blastpgi|3407210452e-6331.70%PREDICTED: crossover junction endonuclease MUS81-like [Bombus terrestris]
NCBI nr blastxgi|1607732625e-6328.79%Mus81 protein [Danio rerio]
Group
Gene OntologyGO:00036773.7e-28DNA binding
GO:00062593.7e-28DNA metabolic process
GO:00045183.7e-28nuclease activity
GO:00038241.9e-08catalytic activity
KEGG pathwaytca:6561302e-55 
 K08991 (MUS81)maps-> Homologous recombination
InterPro domain[348-497] IPR0113351.4e-29Restriction endonuclease, type II-like
[349-502] IPR0208193.7e-28DNA repair nuclease, XPF-type/Helicase
[352-450] IPR0061663.9e-17ERCC4 domain
[15-92] IPR0109961.9e-08DNA-directed DNA polymerase, family X, beta-like, N-terminal
Orthology groupMCL14002 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205300-TA
ATGGCTGTTAATAGTAAAAGAATAACTTTAAAACGAACCAGACCAAATCCATTATTTCAACGATGGCTTCAGGAGCTACAGGATGAGGCAAAGCTTGAATTGAACAATTTAGAATATTCATTGGATGAAGCTATCAGTTCATTATCCAAGTACCCTTTACCGTTAGAAAGTGGCGCGGAGTGCGCTATACTAAAAGGTTTCGATAATAAACTATGCTCTTTCCTCGATAAGCGTCTACAAGCTTACACAAGCTCAAATAGATTGGATAATAATTCTTCCAAAGTTACTAGCACAACCCTACCTGAACTAGAGCCAATAGATATCAGTTCTAAAAAAGCCAAAAACAATTCCATTACCAAAGATGACTTTGAGCAACAATCTGCAAGAAACATTTTAAAGTGCACCAAGCTTACTACTGATGTGCCTTGTGGAACAGATGACTTATCTGACAATGAAGTACAAGAAGTTCAAGAGGTACCATCCCAAAGTAATGTGCTAAAAGGATCTCGAAAAGATTACTGTTCTAATTTAATTCAAGCTAACTACAGCTCTCTGAGTCCTGAATTAGAAAAATCACTTAGAGGAAGGGAAAGGAAATTAAAATACAAACCTATCTACAAGTCTGGTAGTTATGCTATAGTGATGGGTCTCTGGGAACATTCAATAGTGAATTCAAAGCAGGGTATTAGTAAGATGGATTTGCTAGAACTAGCCCATAAATACATTGAGAGTTCTATGAAAAACGCTTCAGATGCTTTACATAATTTGTTATGGGCAAATATGAATAACCTTGTATCAAAGGGCCTTGTAACGAGAAAGAATGGAGAAACTCCAGTATTTAAATTGACAAAATTGGGTATTAAAACTGCAAAGGTACTGTATAAAGAATACAAAATTAGAGAAAAACCTAAAGTTACAAATTCGAAACAACCCGAAGAAGAAGATTGTGACGGATCTAAAAATAATGTTCCAATTAATTCCAGAAACCGCATTGACACTTGTAATACAGAGGTTGAAGAAGTAGTGGAATTTGAAGCTGGTTCATATGATATTATTTTATTTGTAGATGTTAAAGAAACTTCTGGCTTAGCTAAGAAGAATGATCCTCTGATGCTCCAAATGAAGAAATATCCTAATCTTCAACACGAGTTTAGATCTTTGAGTGTCGGTGATTTTGCATGGATAGCAAGGCACAGGTTAAGTAAAGAAGAATTAGTGCTGCCTTATATAGTTGAGAGGAAAAGAATGGACGATTTCGCTAATAGTATAAAAGATGGTAGATATCATGAACAGAAATTCAGATTGAAGAAAAGTAAAGCAAAAGTTGTTTACTTGGTTGAAAATTATGATAGTAAATATGTTGGTTTGCCCTATCAGACGTTAATGCAAGGATTGGTCAATACGAGAATTAGGGATGAGATTCAGGTACATCGAACAGATTCATTGGCGGCTACTGTTAGATTCTTAGCCATTCTGACAATGAAAATAATTAACGAATATCAGAATTGTTCTGTTAAGGGTCACCACAAAATGGCGGAAGGCGACATGTTGATGACATTCAATTATTTTAAAAAAGCTCTCGTAAAAAATAAACCTTTGTCTTTGAAATGTACCTTTATAAAAATGCTTTTACAACTACGAGGATTAACAGCAGATAAAGCTGTGGCTATAACTAATGAATACGGTACGCCAAAATTATTAATGGATGCATATGAAAATTGTGATAAAAAAAAAGGTGAACTGTTGTTAGCTAATATTAAAGGCAAGAGTAAACGTAATGTAGGACCTCGTGTAAGTAAAAAACTGTACAAATTGTTTACATTGAGAGAATTAACGTAA

Protein sequence:

>DPOGS205300-PA
MAVNSKRITLKRTRPNPLFQRWLQELQDEAKLELNNLEYSLDEAISSLSKYPLPLESGAECAILKGFDNKLCSFLDKRLQAYTSSNRLDNNSSKVTSTTLPELEPIDISSKKAKNNSITKDDFEQQSARNILKCTKLTTDVPCGTDDLSDNEVQEVQEVPSQSNVLKGSRKDYCSNLIQANYSSLSPELEKSLRGRERKLKYKPIYKSGSYAIVMGLWEHSIVNSKQGISKMDLLELAHKYIESSMKNASDALHNLLWANMNNLVSKGLVTRKNGETPVFKLTKLGIKTAKVLYKEYKIREKPKVTNSKQPEEEDCDGSKNNVPINSRNRIDTCNTEVEEVVEFEAGSYDIILFVDVKETSGLAKKNDPLMLQMKKYPNLQHEFRSLSVGDFAWIARHRLSKEELVLPYIVERKRMDDFANSIKDGRYHEQKFRLKKSKAKVVYLVENYDSKYVGLPYQTLMQGLVNTRIRDEIQVHRTDSLAATVRFLAILTMKIINEYQNCSVKGHHKMAEGDMLMTFNYFKKALVKNKPLSLKCTFIKMLLQLRGLTADKAVAITNEYGTPKLLMDAYENCDKKKGELLLANIKGKSKRNVGPRVSKKLYKLFTLRELT-