Monarch geneset OGS2.0

DPOGS215095
TranscriptDPOGS215095-TA4704 bp
ProteinDPOGS215095-PA1567 aa
Genomic positionDPSCF300139 - 420713-442658
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0225356e-6535.70% 
BombyxBGIBMGA009586-TA4e-4124.33% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastx%
Group
Gene OntologyGO:00054884.6e-08binding
GO:00056345.8e-05nucleus
GO:00046775.8e-05DNA-dependent protein kinase activity
GO:00036775.8e-05DNA binding
GO:00055245.8e-05ATP binding
GO:00063035.8e-05double-strand break repair via nonhomologous end joining
KEGG pathway 
InterPro domain[210-1409] IPR0160244.6e-08Armadillo-type fold
Orthology groupMCL20506 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215095-TA
ATGGGGAAGTCTATGGACAGCGACCTTAGATCCTTAACGGGGCAACAGTACGCTCTCTTCTTCCTCAGGTGCGGGGGGAAGGGGTGCTCCGGCCAGGATGCTAGTTTCAAGGGACATTCCTTGCGTCTAAGTGAAAGGTTATCTCGCGGGTTGAGCTTGTTGTTGAGAAAATATGAAAAGATAAAAAAAATGGCCAGTTTGGACAAAAGATACAGGTTACAATATCATGATTTTCTGGTCAATATGAGGAAGTGGCATACACCGAACATGTCTCTGGCGTGCCGCGATCGTGTCACTGATTTATTTTTGAGGGTGAGTCACTGCCACCGCCGGGGCGAAGCTCCACAGGGCTCGGAGCCCGGCCCGGGGACCGCGGCGCGTACTGTGGCACCGTCATTGATGCAAATCATGGAGGAATGGGAAAAGTTTATAGATCACTACGAAATAAATAGTGATTGTAACAAGTTTCGTCTGGCATTAAAAAGTGCGAAAGTTGCTTTGGACGAAGACTTGGCCCCATTCGAATCAGAATGGATATTCTCAACACTCTTTCATAATCCCGTAAACTTGCTGGAGGTGGTGTCTAAGCGATGGTCAGATGAAGAGTACACTAAAAGCACAATGGATGCTTTAAAACTATTAGATGATGCTATTGTTAAATACGATAATATTGAAAAATACTATGAAGATATTGTACAGCTATGCCTTCTTCCATATCCGACGACTCAGCGCCACAATGCTCTGGCTTGTCTCACACAAGTGGCCCGAAGATCTGTTTGTGGAGCTAGACATTATTATAAACATGTCGCCGCTCTGCAGCAGGGAGTGTGTATGACGCCACTGGTGTGTCTAATAGGTGTTATATGTGAGTTTCACCCACACGTGGTGTCGGAGGATGTGGACAACATCTGGAGAGTGTTCCTCAATATACTGGACTCGAATAAATATTCAGGGACTGTTATGAAGGCGGTGCTGCAAGCTATCCTGCGCTTGTTCAAGAACTTTGGCGAGGATCTGCCTTCGGGGGAATTGATCAGGTTCTACGACCAGCTAGTGAGACACTTTGAAAGCTGTCAATCAGTGTGCATCGAGATTCTGACCCACCACGCCGGTCTATTCTTCACGTGCGTAACCCGCGACTCTCGCACGCGTGCCCACCTGTGGAAGTTGCGGGCGAGCGACGCGCTGGCAGCGGTCTATAAAGTCTGTGATAAGGAGGTCCTGCAGGAAGTGAAGCAATACGTCACATCGGCGGATTACCGGGAAAGGTACACCGCGTTCCGTATCCTGGGGGACGAAGCTCCCGGCCTTGAACTACAGGAATTGGAGTTCCAGCTCAGGAACGGGAACGTGGACTATGAATTGTGTGAAGGAATATCGTGGTGTATACAGACAAACACACCCTCGGTCCACCGTCTGCTCCACGCCACCATCGTTCTATACGAGTCCATACCAAAAACCAAACGACCTGAAATCCTCAAAGCCTTAATGGATTCACACACGGATATTGTTAAAGCCGCAATAACATTTATTATATCGGAAATTGTGAAAGATAAGAAGCTGTCTGTATGGACCGACATATTGATGAGTGAGGACAATCGGTTTGTGTTTGAGATCATAGAAAACTACATTAATATGACGCTGGAGGGTCCGGAACAGTCGGTGGAGGGGCCTCCCGTGAGCTCCCTGGTTCCTCTCCTCATGTCGCTGCCGCCGGTTCCTCTTAAGATCCGCCACCTCTCCGAGTCGTGTCCCCACACCACTGCGCTGCTCGGAGCATTCAGCGGTAGATTACAGAGTAATTCAATAAGCCGATGTACAGACGATGTGAAATTGAAATGCAGTTTAGTAGTTCTGGAGCTGACGGACCAACATACATATCCAGAATCAGAATTACTGACAGCCCTGAGGACAATTATCTCCGATAATGATACTGAGGCGGCCATTTTAAGTAAGGCTGTGAATGCACTGGACTCGTTTGTAATGAATCATTGTGTCGAGGATTATATACTTGCTGAGATTATAAATAATTTGAATACTATAAGAAGACGGACGGACAGGACTAAGAGAGATCATAGAATATTATACAGAGATGTATTGATGTTTGTTGGTAAATTCGAAGAGCCGGTAAAATATAACTGTGAAGTTAACTATCTCATATCACAGTTAAAATCTAACCTCAGCGTGGACTTGCCGTACGACGGTGGTTGTGTTAAGTTAAATTTAAACCGTATGCTCTGTAACGCCTTGCTGTGGGACGATCGAGAAGCCTTGAACCTTCTACTGACAATCCTGTGCGCTAACTTGTCCTCCTGCCCCGGTCCGTCCCTCCGCCGCGTATTGCTCCGCGTGTGTGTCCGTTTGTCACATCACAAGGTCTCAGCTCGAGTCGTGACCGCTCTCGGCATGTTAGATAACAACTCTGATGAACTCGTACAGATTATCATAAACGAGGCGTCTTTCTCGTCACGGAGCACCCTCACGGACGGTCTGGAGCTGATGATACAGAAAAATCCCAGTACATTGGAACGTGTACTCGAGAGGCTCATACGTGTTGAAGGAAATTACAGTCAAGCGTACATGGAGCTGCTGGAGAGAATTCTGGATTTGTTGAGGCACAGAAACGCTGTGTGTGAGAGGTTCCTGCCAGGACTCGTGCTGAAGGTGTTGGAGGCGGAAGACGCGAACGTCATCATAGACAAGTTGTTGAGTCTGTTCCTGGAAAAAATATTTTTATTCGAGACGTCATTGCAGAGCTTCATGGAGGATGTGTTTGTCGTTATAAACGAAGCGAAGTGCCGCGTCAAAATTGATAAAACTATTGCTTTTATGAGGAATGTACTGAATGAGAAGAGGATGGACGGCGATCTGTTCGCACGCGTCGCGCCGGGCCTGAGGGCGTGTGTGGCGATGAAGGGATGGCTCGATTTTGATGAAACTATGAGGAGCGCGGTCGTCCGGGCCGCGGACGGGACGGGCGGCGGCCTGGACGCGGATATTACATTGGAGTTTTTATCAGTCTTCTCAGAAGTTCTTTTTCAAGCAAACTTTGAGGCAACACTGACACTATTATCAAACGTCATAGACAATGCAACACCAAAACAGATTATAAAAAATGGCAAACACTTCGTGTTGTGTGTGCAAATTATCAGAAACATAAAATATGGCTTAAACGATACTAACGACACTGTCGTCCGATCGGCCAGAGTTATTATAGAGGCGTGCGATGGATCGATTTTGAGCAAATCGGACGAATTTATAGTCTCATGGAAGGATATGTTTAATAGTTTTTTGGATTCGCCTCATCTCAGGATGTGTTTAGAGACCAGCTCCGAGGGTCTGACGGATTTGACGTACGTCGCCTTGACATATATGGATGATGTGGAAATTAAAAATATCCTAACGAGAGGTCCAGGTGAAACCGGGCTAAAGTTCGCAGCCTCCCTAATATCAGTGTTCTATCCCTTCTTGAATGCAGCACTACTAGAGTCTTGTAGTTTGCTGTTGTACGATGTTGTACGTTATGCGAACAGAAGGGGGCGGAGGGGGGAGGTCGAGGACGTCATTGAACAGATTTGGCCTCACTACGAAAAAATAGCGACAAACAAACAGAAACAGCAGTTCTTTATGGAATGTCTTCCCGGAGTTAGGAGTGAAGACTGCGTGGTGTACAGGGGACTCGTGGATCTGTTGAAGTCTCGCGAGGACCTGGAGACCAAACTCATGATGATAGATGTTCTACCGAAGTCTCTCCCCCTCTTGTTGTTCCCCCTCCTCCCCTCCCGCCTGTCGGAGCTCCGCGGCTCTTTGGCCAACTGTTTCAGAGCTATACTGGACGCGCTGGCTACGAACGCACATTATCTCGTTATAAAAGCTGTCGCCACACTCGCAGCCACGGACCCCACTCCCGGGTGGTGGGACTCCGCCCTGGACTCGTGCATGAAGTCCCTAGCGAGGACGGAGCACCGCGGCTCCTGGCAGGCCATCTATGATACATGCATTAATCTCGAATACAGCGCTTTCATCAGACTTATGGTGCCATTATTAAGGCACTCTGACTCCTATGACGTGGAGTGGTTCGCGTCCCCCCTACTCCCCGAATTTCTGCGCTGTCTCCGTCAACGCCCGCGTGGCTCGGACACTCGCGTAAACCGGAAGTGTCTCACGGAACAGACGCGAGCCTTCAACATATTGGAGATCATCTTCCAAAAGGTACCAAAACACCACATAGAGTCACCCCAGTCTCTCCTCTACAGATCTCTTTCTGTGGAACCACACTCTCTCTACTATCTAGTCTCTACCGTGTGTAAGCTGTGCGTGGCAACGAGGACCCAATACGATAAAACGGGAGTCGAAGACGAATACAGATTGTTCCAACTCGCTAACTTCGCCTGTCTTTCCTCGGCTCTTCTGTGTCGTGAGCCGCGCGCTCCAGTGTACTCCTGCGTGTTTGATGTCAAAGTCTGGAGCGAGCTGGTCTCCGAGGAGGTGTCTGCAGCGAAGCCTCAGTGGGGTCACCGTGTCACTCGGTACGCCCCCTCTGTCCCCTCTGTCCCGTCCGTCTCGTCTGTCCCGTCCGTCCCGTCCCGCTTACCTTCAAAAAGCCTCAGTCTTCGCTCCCCCGTGTTTCTGAGGACGCTCTCCGAGAATCCGTTCATGTATGATCTCGTGGAGGAGAAAGAGGAGGTGAGTTTCAGGGAATAA

Protein sequence:

>DPOGS215095-PA
MGKSMDSDLRSLTGQQYALFFLRCGGKGCSGQDASFKGHSLRLSERLSRGLSLLLRKYEKIKKMASLDKRYRLQYHDFLVNMRKWHTPNMSLACRDRVTDLFLRVSHCHRRGEAPQGSEPGPGTAARTVAPSLMQIMEEWEKFIDHYEINSDCNKFRLALKSAKVALDEDLAPFESEWIFSTLFHNPVNLLEVVSKRWSDEEYTKSTMDALKLLDDAIVKYDNIEKYYEDIVQLCLLPYPTTQRHNALACLTQVARRSVCGARHYYKHVAALQQGVCMTPLVCLIGVICEFHPHVVSEDVDNIWRVFLNILDSNKYSGTVMKAVLQAILRLFKNFGEDLPSGELIRFYDQLVRHFESCQSVCIEILTHHAGLFFTCVTRDSRTRAHLWKLRASDALAAVYKVCDKEVLQEVKQYVTSADYRERYTAFRILGDEAPGLELQELEFQLRNGNVDYELCEGISWCIQTNTPSVHRLLHATIVLYESIPKTKRPEILKALMDSHTDIVKAAITFIISEIVKDKKLSVWTDILMSEDNRFVFEIIENYINMTLEGPEQSVEGPPVSSLVPLLMSLPPVPLKIRHLSESCPHTTALLGAFSGRLQSNSISRCTDDVKLKCSLVVLELTDQHTYPESELLTALRTIISDNDTEAAILSKAVNALDSFVMNHCVEDYILAEIINNLNTIRRRTDRTKRDHRILYRDVLMFVGKFEEPVKYNCEVNYLISQLKSNLSVDLPYDGGCVKLNLNRMLCNALLWDDREALNLLLTILCANLSSCPGPSLRRVLLRVCVRLSHHKVSARVVTALGMLDNNSDELVQIIINEASFSSRSTLTDGLELMIQKNPSTLERVLERLIRVEGNYSQAYMELLERILDLLRHRNAVCERFLPGLVLKVLEAEDANVIIDKLLSLFLEKIFLFETSLQSFMEDVFVVINEAKCRVKIDKTIAFMRNVLNEKRMDGDLFARVAPGLRACVAMKGWLDFDETMRSAVVRAADGTGGGLDADITLEFLSVFSEVLFQANFEATLTLLSNVIDNATPKQIIKNGKHFVLCVQIIRNIKYGLNDTNDTVVRSARVIIEACDGSILSKSDEFIVSWKDMFNSFLDSPHLRMCLETSSEGLTDLTYVALTYMDDVEIKNILTRGPGETGLKFAASLISVFYPFLNAALLESCSLLLYDVVRYANRRGRRGEVEDVIEQIWPHYEKIATNKQKQQFFMECLPGVRSEDCVVYRGLVDLLKSREDLETKLMMIDVLPKSLPLLLFPLLPSRLSELRGSLANCFRAILDALATNAHYLVIKAVATLAATDPTPGWWDSALDSCMKSLARTEHRGSWQAIYDTCINLEYSAFIRLMVPLLRHSDSYDVEWFASPLLPEFLRCLRQRPRGSDTRVNRKCLTEQTRAFNILEIIFQKVPKHHIESPQSLLYRSLSVEPHSLYYLVSTVCKLCVATRTQYDKTGVEDEYRLFQLANFACLSSALLCREPRAPVYSCVFDVKVWSELVSEEVSAAKPQWGHRVTRYAPSVPSVPSVSSVPSVPSRLPSKSLSLRSPVFLRTLSENPFMYDLVEEKEEVSFRE-