Monarch geneset OGS2.0

DPOGS205534
TranscriptDPOGS205534-TA1848 bp
ProteinDPOGS205534-PA615 aa
Genomic positionDPSCF300056 + 431752-440111
RNAseq coverage734x (Rank: top 18%)
Annotation
HeliconiusHMEL0112870.085.96% 
BombyxBGIBMGA000084-TA0.082.10% 
Drosophilahrg-PA0.062.96% 
EBI UniRef50UniRef50_Q17AA50.066.30%Poly a polymerase n=7 Tax=Coelomata RepID=Q17AA5_AEDAE
NCBI RefSeqXP_001650777.10.066.30%poly a polymerase [Aedes aegypti]
NCBI nr blastpgi|3838624070.071.05%PREDICTED: poly(A) polymerase gamma [Megachile rotundata]
NCBI nr blastxgi|3072012880.069.30%Poly(A) polymerase gamma [Harpegnathos saltator]
Group
Gene OntologyGO:00056342.6e-304nucleus
GO:00046522.6e-304polynucleotide adenylyltransferase activity
GO:00436312.6e-304RNA polyadenylation
GO:00063513.1e-106transcription, DNA-dependent
GO:00311236.4e-40RNA 3'-end processing
GO:00037236.4e-40RNA binding
GO:00167796.4e-40nucleotidyltransferase activity
KEGG pathwayaag:AaeL_AAEL0053560.0 
 K00970 (E2.7.7.19, pcnB)maps-> RNA degradation
InterPro domain[10-616] IPR0144922.6e-304Poly(A) polymerase
[29-375] IPR0070123.1e-106Poly(A) polymerase, central domain
[376-535] IPR0110686.4e-40Nucleotidyltransferase, class I, C-terminal-like
[377-534] IPR0070101.9e-39Poly(A) polymerase, RNA-binding domain
[107-184] IPR0029341.6e-08Nucleotidyl transferase domain
Orthology groupMCL10471 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205534-TA
ATGTGGCCGGCATCTCAATATTCGCATACAAATCACCAGGCCAACGCCTCCAAGTCCAATGAACACCAAAATCAACAGAACCTGAAGACGCTCGGCATGACTTCAGCTATTTCTATGGCAGGTCCGAAACCCATCGACATTGAAAAGACAAATGAGCTCAAGGAATCCCTGGTGCCGTTTGGTGTGTTTGAATCCGAGGCTGAGATGCATCACAGGATGGAGGTGCTCGGATCCTTACATCGGCTGGTCAGGCAGTGGATAAGAGACGAATCCTTGAGGAAGAACATGCCACCCAGCGTAGCTGACACAGTCGGAGGCAATATATATACATTCGGATCATACAGGCTCGGGGTGCACCACCGAGGCGCGGATATTGACGCCTTGTGCGTGGCTCCAAGACATATCGACCGGTCGGACTACTTCCAGTCATTCTACGAACTGCTCAAGGAACAACCTCAAGTGAAAGATCTCCGAGCTGTGGAGGACGCGTTCGTGCCCGTCATTAAGATGAACTTCGACGGTATCGAAATAGATCTGTTGTTTGCCAGACTAGCTCTCAAGGAAATACCAGATTCCTTCGACCTCCGAGACGACATGCTCCTCAAGAACCTGGACCAGAAGTGCGTGAGGTCGCTGAACGGGTGTAGAGTCACCGATGAAATACTGAGATTGGTCCCCGATATAAATACCTTTAGACTCACCTTGAGGGCTATCAAGCTGTGGGCCAAACGGCATGGGATATATTCTAATACCCTGGGCTACCTCGGCGGAGTGTCCTGGGCCATGCTAGTGGCGCGAACCTGTCAGTTGTATCCCAATGCGTTACCAGCTACATTACTACACAAGTTCTTCCTCGTCTTCAGCCAGTGGAAGTGGCCGCAGCCAGTACTCCTCAAACCACCGGACTCAGTCAATCTGGGATTCCCCGTTTGGGATCCGAGGGTTAACATGTCGGATCGCTACCACCTGATGCCCATCATAACACCGGCTTACCCACAACAGAACTCCACGTTCAATGTGTCGTCATCCACGAGGACGGTCATCATGGAGGAGTTCAGGCTGGGTCTTGCTATAACTGATGAGATAATGCTCGGAAAGTGTGGCTGGGAACGGTTGTTTGAAGCTGCAAATTTCTTCTCCCGCTACAAACACTTCATAGTACTGCTTGCATCATCGGCTAACACCCTGGATCAGCTGCCCTGGTGCGGGCTGGTCGAGAGCAAGATACGACACCTCATCACCACACTGGAACGAAACCAGCATATAACAATTGCTCATGTGAACCCGGAGTGTTACAACTCCGTGCCTCTCAATACTAACAACGGACATCCGCTCGCCTTACCTCCAGGTACACCAGTACAAACAGAGGAACACGGCGCCGCTGAAGTTAAAAATGATAAGGGCGAGATAGTGGCAAACGTCTGCTCAATGTGGTTCATAGGTCTGGTGTTTGACAAGACCAATGTCAATGTTGACCTCACATATGATATATCGTCATTCACAAAGGCCGTACACTACCAGGCCGAGAACACTAATGTACTTAGAGAGGGAATGACTATAGAGGCTCGTCATGTTCGTCGTAAGCAACTTCATCAATACCTGTCTCCGTCACTACTAAGGAGAGAAAAAGTTAACAAGAGAAAGAATGAAACACTCGCTGTTCATACAAAGAAGGCTAAGAGGGTATCGGAAAGCAGTGCGGATGAGGTGAGCGTGCTATCGTACACCGAGGACTCGAACTCGTCTAACATGTATGAAGTGAACGTACAGAACGGCGCGCATCAAGAACAGAAGACGAGCGAGAAGGTCGACAGGGGGTCCAGCTCGAGCGGCATAGCGTGCACGTAG

Protein sequence:

>DPOGS205534-PA
MWPASQYSHTNHQANASKSNEHQNQQNLKTLGMTSAISMAGPKPIDIEKTNELKESLVPFGVFESEAEMHHRMEVLGSLHRLVRQWIRDESLRKNMPPSVADTVGGNIYTFGSYRLGVHHRGADIDALCVAPRHIDRSDYFQSFYELLKEQPQVKDLRAVEDAFVPVIKMNFDGIEIDLLFARLALKEIPDSFDLRDDMLLKNLDQKCVRSLNGCRVTDEILRLVPDINTFRLTLRAIKLWAKRHGIYSNTLGYLGGVSWAMLVARTCQLYPNALPATLLHKFFLVFSQWKWPQPVLLKPPDSVNLGFPVWDPRVNMSDRYHLMPIITPAYPQQNSTFNVSSSTRTVIMEEFRLGLAITDEIMLGKCGWERLFEAANFFSRYKHFIVLLASSANTLDQLPWCGLVESKIRHLITTLERNQHITIAHVNPECYNSVPLNTNNGHPLALPPGTPVQTEEHGAAEVKNDKGEIVANVCSMWFIGLVFDKTNVNVDLTYDISSFTKAVHYQAENTNVLREGMTIEARHVRRKQLHQYLSPSLLRREKVNKRKNETLAVHTKKAKRVSESSADEVSVLSYTEDSNSSNMYEVNVQNGAHQEQKTSEKVDRGSSSSGIACT-