Monarch geneset OGS2.0

DPOGS202255
TranscriptDPOGS202255-TA1359 bp
ProteinDPOGS202255-PA452 aa
Genomic positionDPSCF300032 - 569222-571460
RNAseq coverage280x (Rank: top 39%)
Annotation
HeliconiusHMEL0050970.075.33% 
BombyxBGIBMGA004993-TA7e-4357.93% 
DrosophilaCG2100-PA2e-12855.16% 
EBI UniRef50UniRef50_B7Q4B38e-13253.86%Poly(A) polymerase, putative n=11 Tax=Coelomata RepID=B7Q4B3_IXOSC
NCBI RefSeqXP_001863506.17e-13755.88%CCA-adding enzyme [Culex quinquefasciatus]
NCBI nr blastpgi|2700097438e-14560.39%hypothetical protein TcasGA2_TC009040 [Tribolium castaneum]
NCBI nr blastxgi|2700097434e-14259.90%hypothetical protein TcasGA2_TC009040 [Tribolium castaneum]
Group
Gene OntologyGO:00063962.7e-31RNA processing
GO:00037232.7e-31RNA binding
GO:00167792.7e-31nucleotidyltransferase activity
KEGG pathway 
InterPro domain[73-195] IPR0026462.7e-31Poly A polymerase, head domain
Orthology groupMCL14795 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202255-TA
ATGAATCTAGTTTTCTTGAAGAATCTTGTTGTTAGATCATTTAGAGATTACAGCAATATATCAAATTTAAAACGGGTGAGGACAAGGAGAGATATGGAGATTAAGACGCGGGAAGATCCTATAGTTTTAAAATTAGATACAGCAGAGTTCCACAACATATTTACTCCAGAAGTGATAGACTTGAAAAAACTTTTTGAGAAATATCAATATGAAATTAGAATAGCCGGTGGTGCTGTCAGGGACTTGTTACTCGGTCAGCCTGCTAAGGATCTTGATTTCGCAACAACAGCAACACCACAACAGATGAAAGAAATGTTTACCGCTGAGAATGTGAGGATGATTAACATGAGTGGTGAAAAACATGGAACTATCACTCCAAGGATAAATGACAAAGAGAACTTTGAAGTCACCACGCTCAGAGTGGATATAGTCACGGACGGCAGACATGCTGAGGTTGAATTCACTAAGGACTGGAAGCTGGATGCTAATAGACGAGATCTGACAATAAATTCTATGTTTTTAGGGTTTGATGGATCTGTCTATGATTACTTTTATGGATACGAAGATTTGATGAAAAGAAAGGTTGCATTTGTCGGTGATCCCGATATAAGGATAAAAGAAGACTTCTTAAGGATAATGCGGTACTTCCGTTTCTATGGCAGGATCTCTGAAAAACCTGATAACCACGACCGGCACACCCTTGATGTCATAAAGCAAAACGCTGAGGGACTTCAAGGTGTATCAGGAGAGAGAATATGGATGGAATTAAAGAAAACATTACAGGGAAATTTTGCTGGTGACCTACTAAAAACTATGCTTAAACTGGATATCGGCAAATACATAGGTTTACCGAAACCGAATTTGGAAGAATTTGAGGGTCTACTAAAGAGAAGCGAACACCTTAGCTTGCATCCTATGACCTATCTGGCAGGATTATTAAACACTATAGACGATGTAACAATATTACATGCGAGACTCAAATTTTCCAGTTACGAAAGAGACATGGCTTATTTTATTGTTGAACATAGACCAGATAAAGATGCCTCCAGGCCCCTTCTGCCCTACGAAAAACTGGTTCTGAACACAAAAATAAAACAGAAAGACGCAGTTGATTACGTGCGTGAAGTGCTTAAATACCGCGGAGACGAAAAGTTACTCGACATGTTCAATAAATGGGAGGTGCCCAGGTTTCCGATGACAGGGAAGCTTCTTAAAGAAAATGGCGTACCGCCTGGGAAGATGTACGGCCAAATTATAAATAGGCTGAAGGAATACTGGATAGAACAGGAATATAAAACCTCTGCTGAGGATTTAACTAAACTAATACCAAGTTTAATAGATGAGTGTAAGACGAAATAA

Protein sequence:

>DPOGS202255-PA
MNLVFLKNLVVRSFRDYSNISNLKRVRTRRDMEIKTREDPIVLKLDTAEFHNIFTPEVIDLKKLFEKYQYEIRIAGGAVRDLLLGQPAKDLDFATTATPQQMKEMFTAENVRMINMSGEKHGTITPRINDKENFEVTTLRVDIVTDGRHAEVEFTKDWKLDANRRDLTINSMFLGFDGSVYDYFYGYEDLMKRKVAFVGDPDIRIKEDFLRIMRYFRFYGRISEKPDNHDRHTLDVIKQNAEGLQGVSGERIWMELKKTLQGNFAGDLLKTMLKLDIGKYIGLPKPNLEEFEGLLKRSEHLSLHPMTYLAGLLNTIDDVTILHARLKFSSYERDMAYFIVEHRPDKDASRPLLPYEKLVLNTKIKQKDAVDYVREVLKYRGDEKLLDMFNKWEVPRFPMTGKLLKENGVPPGKMYGQIINRLKEYWIEQEYKTSAEDLTKLIPSLIDECKTK-