Monarch geneset OGS2.0

DPOGS206445
TranscriptDPOGS206445-TA3774 bp
ProteinDPOGS206445-PA1257 aa
Genomic positionDPSCF300070 - 329246-341020
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0140010.070.45% 
BombyxBGIBMGA005449-TA0.057.44% 
Drosophilarad50-PD1e-16532.27% 
EBI UniRef50UniRef50_Q7QAJ36e-16430.80%AGAP003676-PA n=6 Tax=Culicidae RepID=Q7QAJ3_ANOGA
NCBI RefSeqXP_002092422.17e-16531.74%GE14184 [Drosophila yakuba]
NCBI nr blastpgi|2240681353e-16631.34%PREDICTED: RAD50 homolog (S. cerevisiae) [Taeniopygia guttata]
NCBI nr blastxgi|2240681350.030.78%PREDICTED: RAD50 homolog (S. cerevisiae) [Taeniopygia guttata]
Group
Gene OntologyGO:00062813.2e-09DNA repair
GO:00055243.2e-09ATP binding
GO:00082703.2e-09zinc ion binding
GO:00045183.2e-09nuclease activity
KEGG pathwaytgu:1002219104e-167 
 K10866 (RAD50)maps-> Non-homologous end-joining
    Homologous recombination
InterPro domain[626-667] IPR0075173.2e-09Rad50 zinc hook
Orthology groupMCL10921 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206445-TA
ATGGCGGGAATCAAATCATTAGCAGTCAGAGGGATTCGAAGTTTTGGACCTGAAGAATGTGATGAACAGCGTATTACATTTGAGAAGCCCTTAACATTAATTTTGGGTCAAAATGGTTGTGGTAAAACAACAATTATTGAATGTTTGCGATATGCTATCACAGGGCAAATGCCTCCCGGTAGTCGCTATGAGTGTTTCGTTCATGATACAAAGGTGAATAGATCAGTAGAAGTCATGGCTCAAGTAAAATTAAAGATTGTAAATGCAAAGGATAAATTATTGGAGGTTTCAAGGTCTATGAAGGTTACAGCGGTTAAAAACAAAAAGCCTAAATTTCAGACCTTAGATTCATTTCTTTCAGTGGATGATGGAAGTGGAAAAACAAAGGACATTTCATCTAGATGTGCAGACTTAGACTTTGTTATGCATGAGGAACTAGGTGTATCAAAAGCCATATTAAATTCTGTAATATTCTGCCATCAAGAGGACTCTAGTTGGCCATTGGATGAAGGGAAGAAGGTGAAAGAAAGATTTGATGAAATCTTTGATGCTGATAAGTACAGTGATTGTTTTGATCGCCTCCGAAAGATTCGGAAGGAGTACGAGCATGAAATTAAATCTCTTGGTCAGCAAGTTTCATATTGGACTGAGAAAAAAGAAGATCTTGATAAGAAAAAATTAGATCTGGTTAACACTAAAACTAGAATGTCTGAAGCTGAAGAGAAAATTTTGGAACTCAGCACTGAATTGAGACCAATATCAGAGAAACTCAACGCCATTGAAACCTTGCAAAAAAACCTAGTATCATTTGAATCGGCCAGAGAGAAAATTAAAAATAGATTAGAACACCAACAAGATTCTGTGAAGGAATTGATGAAATCCATTGATAAATTGTATGAAGGTACAACTGAAGAACTGCATGAAAGATACACAAACTATGGTGCCACCATAGAGGCGAAACACACAGAACTTGATAAATCATACAAACAAAACTTTTCGTTCAATAAAGAGGAGGAGCGTATTGCAAATGAAAAGACTAACAATGAAGTTAAATTCAATAAATTGATTCTTTTGGAGAGTCAAAACCAAGAAAAGATCGATAGAAGGAACAATATGGTATTGGATACAGCCAAGCTTGCTGATGTGGAAGTCCAAAAAATAGAAACAGATGGTGAAGCTGATCAGTGTAAAGCTGCTGTGATGGAAAAAATCAAAGATTTAATGAGAAAATTGGAACAACACAAGTTAGATGCCGACAAGGCAGAGAAAGAATATCAAAAACATGTCGATGACAGTAGAGATGCACTTTCCCGCCACAAACAAAAAATATCAAACAAAGAAACAGAAATACAAACAGTGAAGAAAGAAATAATGAAGATGGGGCAGCAAATAACCGACGCTAATAAATCTAAACAGAGATTAGAAAAATTGGACGCAAAATTAAAAACCGCTGAGGATACAATATTGAAAGAAAAGGATATGATCGAGGAATCTTTGAAACAGAAAGAGAAGCAACTTAATGTACTGAAGAACAAACATCGCTCTGCCATCACCGAGCTGTTGGGGAAAATGATTGAAAACAATTTTGCAATATCTATCAATCAGTTTGAATGTCAAACGAGGACCGAGCTGGAAACCATTAAGAAGAATATATCCGAGAAACAAAATGAGATAACTCGTGTGGAGACGGATCGCGATCACATCAATCGGCAGCTCCGCGAACGACGCGAGGAACTTTCCGCCGCTGAAGACCGGATGTATCGCGAGTGTGGAGCGCAGACTTACGACAACACACTCGCTAAGATCACAGCCACCGTAGACAAGCTACAGGATGAACAAAACGTACTGCAGTCCTCAATGTTCATTATAACGAAATACAAAGGTCAAATAAAGGACAATAACTGCTGCCCCTTATGCAGCCGCGGCTTTGATAATGAAGATGAGGTGAATGACCTCATATCCCAACTAACAACTCAAGTGATGAACGTGCCGGCTAAACTAGAGAAGGTAACAGAGGAGTTGCAGAGGACTTCCGCTAAGAAAGATAACTTGCTAAGTATGAGGTCTTTGAATGAGAGGATTGTGGTCTTGAAGGAAAAAGATATTCCGGACTTGGAGAAACGATTAGTTGAGGCGGATAAGGTGATAGCTAGTTTAACAGAATCAGTTGATGACTTGACAATGCTCTCGAAGGAGCCAGAACAGAAAATGTCAACTTTGCGTCAGATACAGGGAGATATGCCACTGTTGGATAAATTTACCAACGAAATTAGAAGTAGTAAGAAAGACCTCGAATCTGTAAAAGCAAAATGTGCCGATTTTGAGTGCGATATTTCTTTGGACACCGCTACCGTGAAACAAACTGATTTGAGACAGAAGATCGGGATTTTGAAGACCAAGATCAAAAGTAACCAAACGAAATTAAACGAACACACCAAGAAGATACAAAAGCAAGCAGAAGAGAAGAATAAACTTAAAGAAGAACTTCTTAATATACAGAAAATGGTTCAAGAATTGTACAATTTACAAGAGACCCTAAAGCAAATGGAATCTAATAAAGAAAAATATTCGACGGAATTAAAAGAACTGGAAAATTCAACAGAAAAACTGGAAATGGAACTGAAAGAGAAAGAAAAGGCTAAAAATAATGCCGTAACGAAAAATAGGCATGAGATTCAAGAAGCGAGTACATATTTAACAAGAGTAAGCAACGCCTTTGATAAGATAAAAGCAATAGACTCTGAGATCAAGCAACACAAAGATAGGAACATCCAAAAAGAAATGGATCAGATAAAAGAAGCCAACGACAGGCTGAACTGTCGCCACAAACAGATCATTAATGACAGAGACACACTCACTAAGAAAATTGACAGTTTGAAAGATGAAATTGCGAAACAGGAGATATACAAACGTACATTAGAAGACAATATAAAGCTACGAAAAGCGGAAACGGAAATAGAATCCTGTAATAAAGAGCTCCTTGAAATAAACGATAAGTTAAAAGGTGTGAACACAGACATGATCTCAGAAAAGGAACCGCTGATTATGAAACAAACTAAGATATTTAGAGAGAAAGCGCAAACAGAGGGACAATTGGAAGAATTGAAGAAAGTTTACAAACAAAATCAATTGGAACTTAAAAAAGCTCAAAATCAAGAGGTTGAAAAGAAATACAAAGAGAAACTCTATGAATTACACGTTACGAAAGCTATAGATGCAGATATTCGAGATTATTCTATAGCCTTGGATAAATGCCTTATGGAGTTCCATAAGGAGAAAATGGAAAATATCAATCTCATTATAAGAGAACTTTGGAGGAAAATATACAGAGGCAATGATATAGACTATATAGAGATAAAGACAGAGGGAAGTATGTCAGCAGAATCTGAACGCCGAAAATATGATTACAGAGTTGTTCAATGTAAGAACGGCGTGGAGATTGATATGCGTGGAAGATGCAGTGCTGGTCAGAAAGTATTGGCATGTCTTATTATAAGACTTGCGCTGGCAGAAACGTTTAGTTCAAGGTTTGGTATACTAGCTTTAGATGAACCTACAACGAATTTAGACCAGGAAAATGTAGTAAGTCTGTGTTCTGCCCTCGGTGATATAGTACAAGAAAGGATGTCGCAGAGAAATTTCATGTTCATTATCATAACTCATGACAGAGAATTTATTGATACCTTGGGCAATATTGATAAGGTAACTCACTACTATGAAGTGTCCAGAAATGAAGAAGGTAAATCAAGAGTTAAGAAGATTAGATTCACATAG

Protein sequence:

>DPOGS206445-PA
MAGIKSLAVRGIRSFGPEECDEQRITFEKPLTLILGQNGCGKTTIIECLRYAITGQMPPGSRYECFVHDTKVNRSVEVMAQVKLKIVNAKDKLLEVSRSMKVTAVKNKKPKFQTLDSFLSVDDGSGKTKDISSRCADLDFVMHEELGVSKAILNSVIFCHQEDSSWPLDEGKKVKERFDEIFDADKYSDCFDRLRKIRKEYEHEIKSLGQQVSYWTEKKEDLDKKKLDLVNTKTRMSEAEEKILELSTELRPISEKLNAIETLQKNLVSFESAREKIKNRLEHQQDSVKELMKSIDKLYEGTTEELHERYTNYGATIEAKHTELDKSYKQNFSFNKEEERIANEKTNNEVKFNKLILLESQNQEKIDRRNNMVLDTAKLADVEVQKIETDGEADQCKAAVMEKIKDLMRKLEQHKLDADKAEKEYQKHVDDSRDALSRHKQKISNKETEIQTVKKEIMKMGQQITDANKSKQRLEKLDAKLKTAEDTILKEKDMIEESLKQKEKQLNVLKNKHRSAITELLGKMIENNFAISINQFECQTRTELETIKKNISEKQNEITRVETDRDHINRQLRERREELSAAEDRMYRECGAQTYDNTLAKITATVDKLQDEQNVLQSSMFIITKYKGQIKDNNCCPLCSRGFDNEDEVNDLISQLTTQVMNVPAKLEKVTEELQRTSAKKDNLLSMRSLNERIVVLKEKDIPDLEKRLVEADKVIASLTESVDDLTMLSKEPEQKMSTLRQIQGDMPLLDKFTNEIRSSKKDLESVKAKCADFECDISLDTATVKQTDLRQKIGILKTKIKSNQTKLNEHTKKIQKQAEEKNKLKEELLNIQKMVQELYNLQETLKQMESNKEKYSTELKELENSTEKLEMELKEKEKAKNNAVTKNRHEIQEASTYLTRVSNAFDKIKAIDSEIKQHKDRNIQKEMDQIKEANDRLNCRHKQIINDRDTLTKKIDSLKDEIAKQEIYKRTLEDNIKLRKAETEIESCNKELLEINDKLKGVNTDMISEKEPLIMKQTKIFREKAQTEGQLEELKKVYKQNQLELKKAQNQEVEKKYKEKLYELHVTKAIDADIRDYSIALDKCLMEFHKEKMENINLIIRELWRKIYRGNDIDYIEIKTEGSMSAESERRKYDYRVVQCKNGVEIDMRGRCSAGQKVLACLIIRLALAETFSSRFGILALDEPTTNLDQENVVSLCSALGDIVQERMSQRNFMFIIITHDREFIDTLGNIDKVTHYYEVSRNEEGKSRVKKIRFT-