Monarch geneset OGS2.0

DPOGS204693
TranscriptDPOGS204693-TA2121 bp
ProteinDPOGS204693-PA706 aa
Genomic positionDPSCF300170 + 426195-428694
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0082530.057.92% 
Bombyx% 
DrosophilaKu80-PA3e-1920.71% 
EBI UniRef50UniRef50_B0WVS59e-7828.99%Ku P80 DNA helicase n=3 Tax=Culicidae RepID=B0WVS5_CULQU
NCBI RefSeqXP_001861497.12e-7828.99%ku P80 DNA helicase [Culex quinquefasciatus]
NCBI nr blastpgi|1700508493e-7728.99%ku P80 DNA helicase [Culex quinquefasciatus]
NCBI nr blastxgi|1571376791e-7629.12%ku P80 DNA helicase [Aedes aegypti]
Group
Gene OntologyGO:00054882.8e-56binding
GO:00036771.4e-26DNA binding
GO:00063031.4e-26double-strand break repair via nonhomologous end joining
GO:00040031.4e-26ATP-dependent DNA helicase activity
GO:00168174.2e-16hydrolase activity, acting on acid anhydrides
KEGG pathwaycqu:CpipJ_CPIJ0114525e-78 
 K10885 (XRCC5, KU80, G22P2)maps-> Non-homologous end-joining
InterPro domain[213-520] IPR0161942.8e-56Spen Paralogue and Orthologue SPOC, C-terminal-like
[222-427] IPR0061641.4e-26DNA helicase, ATP-dependent, Ku type
[420-520] IPR0051605e-20Ku70/Ku80 C-terminal arm
[10-167] IPR0051611.8e-16Ku70/Ku80, N-terminal alpha/beta
[561-669] IPR0148934.2e-16Ku, C-terminal
Orthology groupMCL15286 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204693-TA
ATGGCGCCAACGAAAGTAGATCAAGGTACTATCATAATTTTAGACGTTGGAAAAAACGTTTCAATATTAGAAGACAAAAATCAGAAAAGTTTTTTTGAAAGTGCAAGAGAATGCGCCGTTCGTATTATAGAGAGAAAAATTTTAAGTCAAGGTAAAAACTTGCTGGGCATAATATTACTCGGATCCAAAATAAGCAAAAATAATCTGTCCGAACAAACGCCGGGTTGTTGTCGGAATATTGAACTTTTAGCGGAACTGCAGTACCCCACCTGGAAAATGATACGAGACTTACCTACACAACCGACAAAATCGACTGGCAATTGGTTGGATGCGCTTATAGTAGCTGTAGACCATTTCAAAAGTCATACATCAAGTTTTAAAATTGCTGATAAAAATATAATTCTGTTGACCAACTTTGAAGCACTATCTGACTTAGAAGAAAGTGATATTGAAACGGCCATTTCGGGCTTTCAAGAAGATGGCTTTGAATTGGATGTGATCGGTCCAGAATTATACAATGAAGACAACAAAAACTCTGATATAGATCTAGCAAGACAATTTGTTGAAGGAACGAATGGCAGCACAGCCACTTTTGACTATGCCATGAGATATTTACTATTCCACAAAAAGAAAACCGTTAATTCTAATCCATGGAATGTCGATTTAAGTATTGGACCGAACATTAAAATACCAGTTTCCGCATACATCAGAATTAAAGATGAACCAGTTGTCAAAAATTTTAACAAGTCTGTTAGAAATCCCGTCACAGAAAAATCAAGTGCTACTGAATATATTGAGAGAAAGAAGACATTCATTAACACAGAAGCTCAAATGGAAGTGGAGTCTACCGAGGTTATTAAGGGTTATCAGTACGGGGAACAGGTGATACCATTTTCAGATTTTGATAAAAGTATGATTTATGATGCTGGAAACAAATCACTCAATGTATATGGCTTTACAAAATCGGGTAATATTACTTGGCAGAACTTGAATGGAGATGGACTATATTATGTTTTTGGACAAAAAGGAGATAAAAAGAGTGAATATGCCGTCAGATGTCTCGTAGAATGCCTTTTGGAGTTGGATTTAGTGGCTATTGTTAGAAGAGTGTATAATAATGGCAATGCTCCAAGAATGTTTGTATTAATGCCGGTCATTGATTCAGAAAATTTTGTTGGTCTTTCAATGGCTGGTTTATGTTACAAAGAAGAAATTAAGAGCATGGCTTTTCCGGCCACTAACCTCAAGAAATATAATTGTTCTGAACAACAGGTTGAAGCATTTAAGGAGCTTATTAAAGCTATGGATCTCACCAAAGCATATGATGAATCAGATTTTGATGATACGGAAGCATTCCCTATAGCAAAGGTTGTTAGTCCATCGGCACAATATATTCTTGACTGTATAGCTTTCAGAGCAATGAATCCTCACCAGCCCTTGCCACAGCCCAGAGATGACATCATGGTGCTGTTTAAAGTGCCTCCTCTTATAGAAAAAAGGGCTAGAGATCCTATGGAAAAGTTAAAAGAGCTTTTTGAATTAAATAGAGTTGAAGTTAAAAAGCCGAAAAGAAAGACAGTTCCTATGGATATTGATGAAAAACCAGGCACTTCTAGAGAACCAGAAATTTCTGATGATATGCCGAAAGTAAACCTTAATGTTGTTAAAAGACCTGATATTATAATTGGAACATTAAATCCTATAAATGATTATGAAAAGCTTAAGAATGAGGGTCGGACGATATGTGATCTGTACAAACAAATGATAGAAGCTATTGAGAGTCTAATACACGGCAACATCGATGGTGACTTTACCAAAGCATTGGATGCAATGGCCTATCTCAGATCAGAATCATGTAAGTCGGATCCGTCATATTATAATAATTGGATAAAGAATTTCAAACTCGATCTAATAGATCGGAAGCAGAATAAAGTTTTGCATTTGATAAGTGAGGAAAATTTGAGCTACATACTTAAAAGTGAGAACAACTTGAGTAATTTCGCCTCCAATGTGAGCGATGAAAGTCAGATGTATGAGAATGACACAGTTCCAACATTAACTCAAGTCAACATAAGTTCGGAAGTCGACAACATGTTTGATAATATGTTTGATGATATGTGA

Protein sequence:

>DPOGS204693-PA
MAPTKVDQGTIIILDVGKNVSILEDKNQKSFFESARECAVRIIERKILSQGKNLLGIILLGSKISKNNLSEQTPGCCRNIELLAELQYPTWKMIRDLPTQPTKSTGNWLDALIVAVDHFKSHTSSFKIADKNIILLTNFEALSDLEESDIETAISGFQEDGFELDVIGPELYNEDNKNSDIDLARQFVEGTNGSTATFDYAMRYLLFHKKKTVNSNPWNVDLSIGPNIKIPVSAYIRIKDEPVVKNFNKSVRNPVTEKSSATEYIERKKTFINTEAQMEVESTEVIKGYQYGEQVIPFSDFDKSMIYDAGNKSLNVYGFTKSGNITWQNLNGDGLYYVFGQKGDKKSEYAVRCLVECLLELDLVAIVRRVYNNGNAPRMFVLMPVIDSENFVGLSMAGLCYKEEIKSMAFPATNLKKYNCSEQQVEAFKELIKAMDLTKAYDESDFDDTEAFPIAKVVSPSAQYILDCIAFRAMNPHQPLPQPRDDIMVLFKVPPLIEKRARDPMEKLKELFELNRVEVKKPKRKTVPMDIDEKPGTSREPEISDDMPKVNLNVVKRPDIIIGTLNPINDYEKLKNEGRTICDLYKQMIEAIESLIHGNIDGDFTKALDAMAYLRSESCKSDPSYYNNWIKNFKLDLIDRKQNKVLHLISEENLSYILKSENNLSNFASNVSDESQMYENDTVPTLTQVNISSEVDNMFDNMFDDM-