Monarch geneset OGS2.0

DPOGS201658
TranscriptDPOGS201658-TA966 bp
ProteinDPOGS201658-PA321 aa
Genomic positionDPSCF300550 + 7460-10723
RNAseq coverage273x (Rank: top 39%)
Annotation
HeliconiusHMEL0120762e-1321.59% 
BombyxBGIBMGA013842-TA9e-6993.01% 
DrosophilaCpsf160-PA3e-10757.10% 
EBI UniRef50UniRef50_D6WFP31e-12663.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WFP3_TRICA
NCBI RefSeqXP_624382.22e-13466.97%PREDICTED: similar to cleavage and polyadenylation specific factor 1 [Apis mellifera]
NCBI nr blastpgi|3504138214e-13467.27%PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Bombus impatiens]
NCBI nr blastxgi|3504138212e-12967.27%PREDICTED: cleavage and polyadenylation specificity factor subunit 1-like [Bombus impatiens]
Group
Gene OntologyGO:00056347.5e-77nucleus
GO:00036767.5e-77nucleic acid binding
KEGG pathwayath:AT4G054201e-21 
 K10610 (DDB1)maps-> Ubiquitin mediated proteolysis
    Nucleotide excision repair
InterPro domain[5-285] IPR0048717.5e-77Cleavage/polyadenylation specificity factor, A subunit, C-terminal
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201658-TA
TCCAGGTCTGGTCTCCGCGGTTACATCGCGATAGGAACTAATTATAACTACGGAGAGGATATTACTTCTAGAGGGAGGATTTTGATTTACGATATAATAGATGTCGTACCAGAACCGGGCCAGCCGTTGACCAAAAATAGGTTTAAAGAAATATACGCGAAGGAACAGAAGGGTCCCGTGACAGCTCTCACACAAGTTTTAGGGTTCCTCATATCGGCTGTGGGTCAGAAGATATATCTCTGGCAGCTGAAGGACAACGACCTCGTCGGCGTAGCGTTCATTGACACCCAAATTTACGTCCACAGAATGTTGGCTGTTAAGAATCTGATATTGGTAGCTGATGTTTACAAATCAATATCCCTCCTGAGATACCAACACCAACACAGGACGCTGTCGCTCGTGTCCAGGGACCTCAGGACGGCTCAGATATACGACATGCAGTTCATGATAGACAACACGAGTCTGGGCTTCCTCGTGAGTGAGTCGGAGGGTAACTTTGCTATGTACATGCACCAGCCTCAAGCCAGAGAGAGTTACGGAGGTCAGCGTTTGATTCGTAAATGTGATTATCATCTGGGACAAAGAGTACACGCCATGTTTCGTTTGGCGGCTAGAGGGGAGAGACAGACACACGTCACTATGTTCACAACCCTCGACGGGGGAGTAGGCTACGTGCTGCCTGTCTCAGAGAAGGTCTACCGAAGATTATTGATGTTGCAGAACGTCATCAACAACTACTGCTGCCATCTAGCGGGACTCAACCCTAAAGCTTACAGGACGTACAAAGTATCGCGTCGCGCCTTGTGTGGTGGAGCTGCGCGAGGAGTGCTCGACGGAGATCTGGTCTCATTGTACACGAGCATGCCACGGACAGAACAACAGGATATCGCGAGAAAAATTGGTACGAAAGTTGAGGAGATTATGTCTGATCTATATGAGATAGACAGACAGACGGCTCATTTCTAG

Protein sequence:

>DPOGS201658-PA
SRSGLRGYIAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVTALTQVLGFLISAVGQKIYLWQLKDNDLVGVAFIDTQIYVHRMLAVKNLILVADVYKSISLLRYQHQHRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRKCDYHLGQRVHAMFRLAARGERQTHVTMFTTLDGGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYEIDRQTAHF-