Monarch geneset OGS2.0

DPOGS214220
TranscriptDPOGS214220-TA2163 bp
ProteinDPOGS214220-PA720 aa
Genomic positionDPSCF300014 + 789061-794470
RNAseq coverage537x (Rank: top 23%)
Annotation
HeliconiusHMEL0164130.072.18% 
BombyxBGIBMGA005946-TA0.065.90% 
Drosophilacnn-PA1e-1340.00% 
EBI UniRef50UniRef50_UPI00016E703F4e-1344.87%UPI00016E703F related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E703F
NCBI RefSeqXP_392107.22e-1938.73%PREDICTED: similar to centrosomin CG4832-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287829882e-1938.92%PREDICTED: centrosomin [Apis mellifera]
NCBI nr blastxgi|3287829884e-2624.12%PREDICTED: centrosomin [Apis mellifera]
Group
KEGG pathway 
InterPro domain[50-122] IPR0129432.3e-22Spindle associated
Orthology groupMCL25186 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214220-TA
ATGGCAACGGCAACGTTACCTCGTATGAATGCATCTAAGCATGGGTCAAATGGTATCCCGTCACCCTTTAGTACCAGCCCACGTAAATTGCAAGAGATGACTCTCCCAGCTGGTGATGGTCATGATGTGACTACAGCCAGCGGCGTGAGCATGAAGCAATATGAAGAGCAATTGAACGGTTTGAGAAAAGAAAACTTTCATCTGAAACTACGAATTTATTTTTTAGAAGAGAAACTAGGTAGTGGAGCACCGCCAGCGGTACAGGGTCTTTTAGAACATAATGTTCGTCTGCAAGTCGAAGTTGAGGAACTAAGAAGGCAATTAAGCGATAAACAAGAATTATTAGCTGCTGCCGCTGAAGCTATAGACGTCTTGGAAAATCAAGGGTCGATTTCAACGGAAGCTGCGGAAGTATCGATTAATAACAGTGTCACAGAACGGAAAGATGTACAGGAACCAGCAAAAGAGACAGAAAACCAAGCAAACGTAGTAGATGAGATTTACCAGTCGGAATCAGCTGGAGATTACTTGGCGGTAACAGTCAATAACGAAGAAATGGATCAAATAACAAAACTGAAAAAAGAAAAGTGTAAAGCTGTAAAAATTATAAAAGGTTTTGCGAAAAAATTACAACAGCAGGACAGAGAAATAAAGAGATTGAAACTGGACAAAGAGTTATCATACGCCCACGTCAGTGACTCTCAACTTGAAAAATACGAAGAAATTTTGAAATGTAAGGATGAACATATCAAGGACTTAGAAAACAAAGTACGGCAGCTACAGAATAACGTTGATAACATGCAAAGCAGCGATGATTCAGGAAATCTACAGCAACTGCTAACAAAGAGACTACAGGGCTTAACGTATTTCCTGGATAAGTTGTTGTCGCATAAATGTGTTTTAGGAGAAGAAAAGAAAAAGCTAGCTGAAAGTATTTTAGAACAAAGTTTAGCGCTGCCGGTTGGGTTGCAGTTGGACGAGTCGATAGCACCAAATGAACTAACAATGGACGAGAGTAATCTGGATATATCGCAGTCCATGAGAATAGAAGACCTGACCATGATGTTCGGACATCATTTGATGTGCAGCGACGATAATCGTTATTCCCTCAAAAAAGCATTAAGAAATATTCCTCAAGCAGATGCTATATCGGAATCTGAATGTTGGTCAGAGCCGGATAGAAACGTGTCACTGGCTCGCGTCGGTCTTTGCGATACAGGAGTCGCTGTCAGAGAATCGCTGTCAGAAGTCGCCAGATGTCGCGCTCGGCGATCGCGATTGTCTCAAGGTTCACGTTCAGATGAGAATAAATGTTATGATCCAACTTTACAAGAACAGTTAGAATCAGTTTCAAGACGCAATCAAGTGCTCGAAGAGGAGAAGATGGATTTGACCGAAAAACTTGAGATAGCATTGAAGCAAATAAAAAGTAATTTAGACGAGAAAGAAGAACTGAAAGCTACTATCAGCGCTGAAAGAGATAACGCAAATGAAACTACCAAAAATATGGAATTAATGAAAAATACTATCTCATCTCTAGAAATTCGAATTAAGGAAAATGAGCAGCAACTGAATTACAATATTCAACTTATAGAAAGGTTACGTACGGAAAAAATTGAGCTGGAGTCTACATTTATGGAAACGGAACGTTCACTTCGAAGAGCAGCTGATGAAGCGACTGTTCAAGCGTCGCAAGCGGCCTTGGAACGAGCTCGTTTACAACATGACAGGTTGCGAATTGAAAGAGAATTAGAAGAAACTAGAGATAAATTAAACGCAGCGCTGGAAGCTAATGCACAATTAGAAATTGAAGTTACTCGCAAATTGGCCTTCGAGGCTGATAATAAAATTAATGAACTTCAAGTTGGAATGCCGTATGAGGAAGAACGTCCTACATCCCCAGACCAAGGAATTGATAGCGACAGGCTTTCCAGCTTAGAACAGAACGACGTAGTCAACTTATCGCCGCGCTCACTTTACGAAGAAAATGTTACATTAAAGCAGAAGCTAGCAAGAACCAAAGCTACTCTCGCCGAAACACTAACACAGCTCAATGCCGCAAACATGAGAAAGAGAAACGTACAGCGCGCTATTTGCCGGGAGATTCACAAGACTCAAGGAGTGTTACGAAAGGCTCGCGATCAATTGGATCCTCATAATTAA

Protein sequence:

>DPOGS214220-PA
MATATLPRMNASKHGSNGIPSPFSTSPRKLQEMTLPAGDGHDVTTASGVSMKQYEEQLNGLRKENFHLKLRIYFLEEKLGSGAPPAVQGLLEHNVRLQVEVEELRRQLSDKQELLAAAAEAIDVLENQGSISTEAAEVSINNSVTERKDVQEPAKETENQANVVDEIYQSESAGDYLAVTVNNEEMDQITKLKKEKCKAVKIIKGFAKKLQQQDREIKRLKLDKELSYAHVSDSQLEKYEEILKCKDEHIKDLENKVRQLQNNVDNMQSSDDSGNLQQLLTKRLQGLTYFLDKLLSHKCVLGEEKKKLAESILEQSLALPVGLQLDESIAPNELTMDESNLDISQSMRIEDLTMMFGHHLMCSDDNRYSLKKALRNIPQADAISESECWSEPDRNVSLARVGLCDTGVAVRESLSEVARCRARRSRLSQGSRSDENKCYDPTLQEQLESVSRRNQVLEEEKMDLTEKLEIALKQIKSNLDEKEELKATISAERDNANETTKNMELMKNTISSLEIRIKENEQQLNYNIQLIERLRTEKIELESTFMETERSLRRAADEATVQASQAALERARLQHDRLRIERELEETRDKLNAALEANAQLEIEVTRKLAFEADNKINELQVGMPYEEERPTSPDQGIDSDRLSSLEQNDVVNLSPRSLYEENVTLKQKLARTKATLAETLTQLNAANMRKRNVQRAICREIHKTQGVLRKARDQLDPHN-