Monarch geneset OGS2.0

DPOGS207031
TranscriptDPOGS207031-TA3603 bp
ProteinDPOGS207031-PA1200 aa
Genomic positionDPSCF300001 + 1604793-1615225
RNAseq coverage198x (Rank: top 47%)
Annotation
HeliconiusHMEL0094190.059.67% 
BombyxBGIBMGA012974-TA0.063.30% 
DrosophilaCap-G-PF8e-7625.87% 
EBI UniRef50UniRef50_E2AIZ34e-10431.35%Condensin complex subunit 3 n=2 Tax=Formicidae RepID=E2AIZ3_CAMFO
NCBI RefSeqXP_001600779.11e-9630.17%PREDICTED: similar to mCG21477 [Nasonia vitripennis]
NCBI nr blastpgi|3838599031e-11332.58%PREDICTED: condensin complex subunit 3-like [Megachile rotundata]
NCBI nr blastxgi|3838599034e-11632.24%PREDICTED: condensin complex subunit 3-like [Megachile rotundata]
Group
Gene OntologyGO:00054885.4e-21binding
KEGG pathwayame:4136915e-92 
 K06678 (YCG1, CAPG)maps-> Cell cycle - yeast
InterPro domain[22-793] IPR0160245.4e-21Armadillo-type fold
[115-266] IPR0119893.8e-06Armadillo-like helical
Orthology groupMCL12404 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207031-TA
ATGCCACCTACCGATGCAGAAGTGCGTCGAGAAATTGTAAAAGCAAATCCACGAAATGATAAGACAATGTTTAAAATATTTCAAAATGTTCAATATAATGTTGTTCAACATAGGAAGTATGTGAAAGAAATGACGAAACTTTACAAAAAGACTGAAGCGGATGACTTTAAGGAAAGCTTTAAAAATGCATTATTTTACCTCTTTACTTTTGGAGATACAAGTACAAATGTAGATCGTGTTATTCAATTTGTAGCAACATTCTGTACATTACTTGACGATGAAGAGGAGTTTCTAATGTTTATATTTGATATTATTTTCGAATGTCAGTGTGTATCTGGTCAGTCAGTAAGATACCGAGCAAGCCAGTTGCTGGCAGCCGTACTAGCTGCACTCGGTGATGAAGCCTCTTTAGATGATGACCTCTGTGACAAGCTGTTACTTCATCAGATGCAACGTCTCCAAGATACACGTGGTGCTGTCAGATGTCGGGCAGCATTAGCCCTCAATAGGTTGCAAAACCCAAGTGATCCAGATGACGAGGTAACCAGGGGTTACCGATTTCACATGAGCTGTGACCCTAGCTCCTCTGTTAGAAGGGCTGTAGTGATGTCAATAGCAAAATGCACTCGGAATGTCCCCTTTGTATTGGAGCGCCTCTGTGACGTTGATGAAGCTGTAAGAAGAGCCGCATTCCTATACATAGCGGCTATGAATGTAACACAATTAAGAGTTAGACAGAGAGTTCTTACATTGAAGGTTGGCCTCACTGAACGCAGCCCGCGAGTGCGTCGTGTGGTAGAAGAGATTTTAATACCCAGCTGGTTGAGTACCTTCCAAGGCAACATCATAGACTTTCTTAAAGCAATACGTCTGGATAATTCACACGACGCGAAAGATTCGCAATACGTCGCAGAGAAGCTCTTGGAGTCGCTTTTCAAACGTCTACCGATATCAGAGCTTCTAGAATGGCTGCCAACTGACAAGTCACTCCGAGTTATCCCCGCTGACAAGTTGAACAAGGAAACAGTTTGGTACTGGCGCCACCTTGCGGAGCATTTACAAAAGAATGATGATGACGAGACCCTCGAGACTGTGCTACCTGATCTAGTTGTACTGACTGGATATATTAAAGCTATCGTGGAATCACCATGTCCGAATGAGGAGGCGGATCCGGTGTCGTATAGCACTCGTCAGTATGTGCTTCACGAGCTGGCGAGATTACTACGGACTTACGACGCCAGCGACCCCGCAGGTAGAGACGCCCTACAGACATTGATCACCGACACACTTACAGGTGACTACGGTCCTATGAGCGGGGACGTGATCCGCGCGTTTGTATCAGCCCTGCAGTTGGTTTTGCCAGATGTGACGAGTAGAGTGGAACTTGTATGCAATGTTCTGTCAACACTACGGGAACCACCGGAAATGGAGGAGGAAGTACCTCCGCCGACCTTGGACGATACAGAGGCAAAATTACAGAGAGCCAGATTGCGTGTCTCCCTAAATGTGGCAATGGAAGCTCAAGAAGAAGCTGTTAGACATGAAAACTATACTCTTGCTGCTCAATGCAAAGCTAAGGTTGCCGACATTCAGAAAAAATTGGAGGAATTAACATTTCAAACAAAACCAGAACAACCATTAACTACAATCAAAGAGAAACAATGTGATGTGACAACATTAAATAAGTGTCTAATAATACTAAATACATTACTGGACACACCACAACTGAACAATGTAACACCGATGTTGAATCTTATGTTCAGCGAGCTAGAAGTTGAAATATTTTCCAAGCCCGAACTATTGGACAATGCTCTTGAAACGGTGGCACTATTTGGCATGCTGGATAAAGAATTTGCAAGAGATCATAAATCATTCTTCTTTGCTAATTTAGTTGATTCAACAAACGAACCAACAGTGTGTAAAGTGCTTAAGTGTATAGTTGACCTGTTGTGTGTACACGGAGCTAAGGTTTTTGACGATGGTACAGAATCCATCGAGGCTTCCAGGAACAGATCTAAACATTCCATAAATACAACTACTATGGATTTTGATGAATCTGTATTGTCATCATCTCAAGCGCACAGCAACGTTATTGAATTACTCCTTAAGTTGATGGATAACGCTTGCCCATCATATAGACTGATAATAGTAGAGGGTCTATGTCGCCTGATGTATCTAGGACATTTAGAATCGCCTTATATACTGAGCAGATTGATACTACTCTGGTTTAACCCAGTCTCAGCGGAAGAAGATGTACTACGACAGACTATAGGCATTTTCTTCCAGACGTTCCCTAGTACTGTCGACGGTGCCCAAGATCAAATACAAAAATCTATGATACCGACACTGCGTGCTCTTTGTTGTGCGCCGTCAAGCTCTCCGATCTGTGAGATCGACCAGGAGGCGGTTGTGAAGTTTTTTGTATCACTAACAAAAGTCAGCTCGGAGTTGACAGACAGCCAGGGTGCTATGGCGTTGACACTATGCGAGTATCTAGTTCGTAAACCGACGGGTCCCGCGTCTGCTCTACTTTGTCGGGCGCTGGCTCTTCTCTCACCGCCTAAAGACGTTCGCACTGCTGCTAATCTGGCGACTATGATCAAAGATCTCTGTCTGAAATTACCAGATAAACAATCCTGTAGGAATCTGACGCGTTATCTCGGTGCCCTGGAGGCGTTGGAAAAGAGTAATCTCAATAAGATGTCTAACATAGGTGAAACAATTGACAGTATACAGTGTGAAGATACAATGAACATGATGGGACGTTCGGCCACCTCATTACCTCAACCTTTAGCTCGTAGTACCCACGTGGTTGTTAATGAGACTGTAGAAGAAGAACCAGAAATTGAAGAAACTTCTTCGGGTGAATCTCCTACCGACCCTATCTCTAGGATATCTGAAGAAAACAAAAATCAAACCGAAGACATGACGGCCGCTGAGCAAACGGAGACGGAAGTTCCTGAAAAGGAGGACAGTGGATCAGACAGCAGTTCTGTGTCGCCTGTAAAGAAAGCTAAAATAAGCAAAGACAAGAAAAAGAACATGACATCAAAGAATGAAAAAGATTTGCGTAAACAAAGGCAACCGAGAAATAAGAAGGACGCTAAAGATAAAGTCAGAAATGAAGAGGATAAAGAGAAGGGAGTCAAGCGTAGTTCACGATCCACAACAGCCGCAATAAGAGCCGAAACTGATAAGAAGTGTCAGGAGCATGACTTACAAGAATCACCACCATCGGACGGTTCTAATACGACGGTACGTAGATCAAGCCGTGGCCTGCAATCCGGGTCAAGCACCGAGTCGACCGGATCAAAAGGGAAAAAGAAAGCCAGTCAGATGGCCAGTAGCGAGCGCTCGTCCCCGAGTCACAGCAGCAATGACAGCGCTCAATTTGATTCTGACACCACAGAACTACACACCATCGTCTACGACGCACCATTGGAACAAGAACTTCTGGATGACTCAATTGAGTTGATGGACAGCAGACGATCGTCCAGAAATAACGTCACTATTCCTGAAACTCCAGAGGCTAGTGAAGAGTCCGACTCTGAACTTGAAGTGGCCATAAAAGGAAAGAGGAGATGTAAGGGAAAAAAGAATTAA

Protein sequence:

>DPOGS207031-PA
MPPTDAEVRREIVKANPRNDKTMFKIFQNVQYNVVQHRKYVKEMTKLYKKTEADDFKESFKNALFYLFTFGDTSTNVDRVIQFVATFCTLLDDEEEFLMFIFDIIFECQCVSGQSVRYRASQLLAAVLAALGDEASLDDDLCDKLLLHQMQRLQDTRGAVRCRAALALNRLQNPSDPDDEVTRGYRFHMSCDPSSSVRRAVVMSIAKCTRNVPFVLERLCDVDEAVRRAAFLYIAAMNVTQLRVRQRVLTLKVGLTERSPRVRRVVEEILIPSWLSTFQGNIIDFLKAIRLDNSHDAKDSQYVAEKLLESLFKRLPISELLEWLPTDKSLRVIPADKLNKETVWYWRHLAEHLQKNDDDETLETVLPDLVVLTGYIKAIVESPCPNEEADPVSYSTRQYVLHELARLLRTYDASDPAGRDALQTLITDTLTGDYGPMSGDVIRAFVSALQLVLPDVTSRVELVCNVLSTLREPPEMEEEVPPPTLDDTEAKLQRARLRVSLNVAMEAQEEAVRHENYTLAAQCKAKVADIQKKLEELTFQTKPEQPLTTIKEKQCDVTTLNKCLIILNTLLDTPQLNNVTPMLNLMFSELEVEIFSKPELLDNALETVALFGMLDKEFARDHKSFFFANLVDSTNEPTVCKVLKCIVDLLCVHGAKVFDDGTESIEASRNRSKHSINTTTMDFDESVLSSSQAHSNVIELLLKLMDNACPSYRLIIVEGLCRLMYLGHLESPYILSRLILLWFNPVSAEEDVLRQTIGIFFQTFPSTVDGAQDQIQKSMIPTLRALCCAPSSSPICEIDQEAVVKFFVSLTKVSSELTDSQGAMALTLCEYLVRKPTGPASALLCRALALLSPPKDVRTAANLATMIKDLCLKLPDKQSCRNLTRYLGALEALEKSNLNKMSNIGETIDSIQCEDTMNMMGRSATSLPQPLARSTHVVVNETVEEEPEIEETSSGESPTDPISRISEENKNQTEDMTAAEQTETEVPEKEDSGSDSSSVSPVKKAKISKDKKKNMTSKNEKDLRKQRQPRNKKDAKDKVRNEEDKEKGVKRSSRSTTAAIRAETDKKCQEHDLQESPPSDGSNTTVRRSSRGLQSGSSTESTGSKGKKKASQMASSERSSPSHSSNDSAQFDSDTTELHTIVYDAPLEQELLDDSIELMDSRRSSRNNVTIPETPEASEESDSELEVAIKGKRRCKGKKN-