Monarch geneset OGS2.0

DPOGS215105
TranscriptDPOGS215105-TA1983 bp
ProteinDPOGS215105-PA660 aa
Genomic positionDPSCF300139 - 138580-234127
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0142241e-12398.11% 
BombyxBGIBMGA009595-TA5e-10092.35% 
DrosophilaCstF-50-PA2e-7164.93% 
EBI UniRef50UniRef50_Q9V9V03e-6964.93%CstF-50, isoform A n=40 Tax=Bilateria RepID=Q9V9V0_DROME
NCBI RefSeqXP_393185.22e-7865.18%PREDICTED: similar to CstF-50 CG2261-PA, isoform A isoform 1 [Apis mellifera]
NCBI nr blastpgi|3072127591e-7765.92%Cleavage stimulation factor 50 kDa subunit [Harpegnathos saltator]
NCBI nr blastxgi|3838495478e-7566.37%PREDICTED: cleavage stimulation factor subunit 1-like [Megachile rotundata]
Group
Gene OntologyGO:00055151.3e-15protein binding
KEGG pathway 
InterPro domain[114-210] IPR0159431.3e-15WD40/YVTN repeat-like-containing domain
[109-206] IPR0110463.7e-15WD40 repeat-like-containing domain
[170-205] IPR0197815.5e-10WD40 repeat, subgroup
[166-205] IPR0016802.5e-08WD40 repeat
Orthology groupMCL21054 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215105-TA
ATGAAGGAAACGCCAACTCCGGAAGTAGATACTAGAAATGTCGTGAAGAACAGAGAACTTCTTTATCGAATGATTATAAGTCAGTTACATTACGATGGTTTCCAACCTATAGCCGCGACTCTATCAGCCGCTGTACACGCAGACCCACCGTGCCCTCCGAGCGACAGATTATTAAATCTAATGATGGTCGGTCTACAACATGAACCGGACCGGAAGGACAGGCTGGCGGCATCCAGCGGGGCGGAACATCTGCTGGGAACTACCGGCTTCGATCTCGAGTTTGAGATGGACGCGTCCTCGCTCGCCCCTGAGCCGGCCACGTACGAGACGGCGTATGTGACGTCACACAAGATGTCTTGTAGGGCCGGAGCGTTCAGCGCCTGTGGTCAGCTGGTGGCTACCGGCAGTGTGGATGCTAGCATTAAGATTCTGGACGTGGAGCGGATGTTGGCTAAATCAGCTCCCGAGGAAGTTGATCCCGGGAGAGAGCAGCAGGGACATCCGGTGATACGAACATTATACGATCACACCGACGAGATAACCGCCCTGGATTTCCATCCCCGGGAACAGATCCTGGTATCAGCGTCCCGGGACTGCTGCATCAAACTGTTCGACATAACCAAAGCCTCGGCCAAGAAAGCCTATAAGTCCATTACGGTGAAAATCGACAACACTCTAAAATTCATTCTCGGTATACAATCTGATGCTACGGAACCAAATCATGCAATGGTGACCAAACAGCATCACTTCACACCAAAAACTACAAATTTGGAAGCTATGAGGGATTGTATGCTGAAAAAGATTCAGGCCAGCCCTACCCTACCTTGCACGACGATCCCGGTCTTCCAAATCCCGGAGTTCCTGAACCATGCATGCACGTGCAGCTGCCATGCATGCGAACTGCCGGCGTGCATAATCATCGCATGCCAGATATGCTTCCTAGAAGCGTCCGCGTACTTCAGAGCCAAAGAAAACGAAATAGCCAGGAACTACTTCAATGGAGCCCTGAATGCTTTTGAGATGGCTGAAGTTAAATTGAAAAAGACTTTCGAAGTTTACAAAAGGTATTTAAGAGCTGCCATTGTGGATACCGAGAGACGAAACGTCGAGCGAGACTTTAAACAAGTGCAGATAGAGTTCTATGTTGAATTGTCGTATTTTGAGCTCAGTCAAGGCGATTTTGAGACCTCTGACGAATACGTCCTTAAGATTCACGAAATTATGTCGGATATGAGGGACCTCGATCCTTATCTGAGGAACGAAGTGTACAATTTAATGATAGCGTCAGCACAAATCAGGAAAAATGTCAGAAAACCCAAAGAAATCGGCCTGGAGGTGGAGTTGGAAAATCTGAAACTCAGTCCCGACAAGGACGTGGAGTTACAGAAGACGCCGGAAGCGAAAGCAACCGTCCCGAAGATAGGAACAGTGAGGGTAGTTAAGGACGAGGAAATACCAAAGAGACGGAAGGTCATCAAACTTAACTTGGACGAAGCCAGCGAGGAGAGTACTGAAGAAAGAACAACTAGGAGCAAAACCAAAAAGCCGCAGTTCAAAATACCGGTCCCGGTGACTGCGAGACCCGTGTTAGAGACCATCACACCCAGAGCGACCAGGTCTAGACCGGAAATAATCATACAACAACCATCTCTAGACCACACAGACATCAAAATCTTCACCCCCAAAACCTCGAACACCAACGAATTCTTCACCCCCCGCGAGTCGACCCCAGCCGAACAATTCTTCACACCACAGACATCAATAAAGACTTACTCCAGACGAATCATAAAAAATCTGGACAAGGAATTTTCAACGCCCAAAGGAAAGGAGAATTCACAAGATAATGTGGGGAAGAAAGTTGACACTGGTTCCGTAAAAGTATTGAGAGACAAAAGGGTATTGAGACGAGCGACCAGTCCGGGGAAACTGGTTCAGAAACCCGAGTCTAGACCGAGAAGGATAAGGCAGCCAGTTATAGACTAA

Protein sequence:

>DPOGS215105-PA
MKETPTPEVDTRNVVKNRELLYRMIISQLHYDGFQPIAATLSAAVHADPPCPPSDRLLNLMMVGLQHEPDRKDRLAASSGAEHLLGTTGFDLEFEMDASSLAPEPATYETAYVTSHKMSCRAGAFSACGQLVATGSVDASIKILDVERMLAKSAPEEVDPGREQQGHPVIRTLYDHTDEITALDFHPREQILVSASRDCCIKLFDITKASAKKAYKSITVKIDNTLKFILGIQSDATEPNHAMVTKQHHFTPKTTNLEAMRDCMLKKIQASPTLPCTTIPVFQIPEFLNHACTCSCHACELPACIIIACQICFLEASAYFRAKENEIARNYFNGALNAFEMAEVKLKKTFEVYKRYLRAAIVDTERRNVERDFKQVQIEFYVELSYFELSQGDFETSDEYVLKIHEIMSDMRDLDPYLRNEVYNLMIASAQIRKNVRKPKEIGLEVELENLKLSPDKDVELQKTPEAKATVPKIGTVRVVKDEEIPKRRKVIKLNLDEASEESTEERTTRSKTKKPQFKIPVPVTARPVLETITPRATRSRPEIIIQQPSLDHTDIKIFTPKTSNTNEFFTPRESTPAEQFFTPQTSIKTYSRRIIKNLDKEFSTPKGKENSQDNVGKKVDTGSVKVLRDKRVLRRATSPGKLVQKPESRPRRIRQPVID-