Monarch geneset OGS2.0

DPOGS203818
TranscriptDPOGS203818-TA1158 bp
ProteinDPOGS203818-PA385 aa
Genomic positionDPSCF300010 + 2248508-2250277
RNAseq coverage421x (Rank: top 29%)
Annotation
HeliconiusHMEL0133315e-14071.69% 
BombyxBGIBMGA003724-TA1e-17273.77% 
DrosophilaCG15925-PA1e-1824.93% 
EBI UniRef50UniRef50_E2AX482e-7741.14%Poly [ADP-ribose] polymerase 16 n=1 Tax=Camponotus floridanus RepID=E2AX48_CAMFO
NCBI RefSeqXP_624248.21e-6837.53%PREDICTED: similar to poly (ADP-ribose) polymerase family, member 16 [Apis mellifera]
NCBI nr blastpgi|3071691827e-7741.14%Poly [ADP-ribose] polymerase 16 [Camponotus floridanus]
NCBI nr blastxgi|3071691822e-7541.28%Poly [ADP-ribose] polymerase 16 [Camponotus floridanus]
Group
Gene OntologyGO:00039504.2e-12NAD+ ADP-ribosyltransferase activity
KEGG pathwayppp:PHYPADRAFT_1687429e-08 
 K10582 (UBE2Q)maps-> Ubiquitin mediated proteolysis
InterPro domain[163-314] IPR0123174.2e-12Poly(ADP-ribose) polymerase, catalytic domain
Orthology groupMCL11464 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203818-TA
ATGGATGGCGAACCAGAAGTTTTTTCGGATACCGGTACAAATTGCAATTTTGCATCGGCATCGGAAACGTTGTCAGACAACAATCAAGCTAAGTTGGACTCATTAGAAAAAAAGGCGGTTCACTTAAGACTAGTTTTAGAAAAAGATTTTAAGGCCGCTGATATAAAATGGAGTCTATTTGTCGCAGCAGCATTTAGTTTTAGGTACGAAAGCTGTCTGAGGCCTTTTCCACCGATATTTATGAAAAATGGAATCAAGGATATGGATGAATTACTCAGCGTTATTACGGATGTACCAGCTTTGGATTTGGTTTTACAACAGTTGGACAATCTCGATAACTTGGCAAATATAAGTGACATTCTTGATTTACTTTTTTATGTGTTGGTTAGATTAAAAGAACCTAGTTTAAAAACAATACCACCAGAGGCTCATGAATCCGTTTTAATAAACGTACATTCATTCCTGCCTGCACCTAAACCCCAATACATATTCCAAGTGGTAAACTCTTGCAAGTCGCACTCTGAAATGAAATGGAAGGAGTTATCCAAAGACCAGAAGGTGTTTTATGCGTATCACGGCAACCGCTTGGAAAACTTTTATACCATTTTACATTTCGGACTTCAGCAACATTTGAACAAGGCGGCTATAATGGCAAATGGTGTTTACCTGTCAACGGAACTGAGTATGAGTCTACCTCACAGCCACGGGGGCTTCGGGTGGGGGGCGAGTTGCATCGGAGGTCATCTCTCATGTATAGCTATGTGCGAAGTTATAGACGCTCCCGAAGGCATTAATTATTATAAACCGATTTCCAACGAAGGCGACGGTACCTACGAAGATGATAAGACTAAGGAGTCTGATGACAACAACTTGAACACCAGAACAGCTAGCTACGTTGTTACCAACAGCGAGTTGATGCGTATGCGATACTTACTTGTGTACGCCAAACAGCCAACGTCAATGAGGTTCTCAACAACGAGCACAAATAGAAATGCTGGTGGAATTCGACAATGGCTGACAAGACACAAGCTAGTTTCCATATTGCTTGGTTATGGATTGATGCTCGCAACCATTGGCTTCGCTAATAATCAACCCATCCACTACTATTACAAAATTTTGTTAAAGAAGTTAGATATTGCATTGAGCAATGTTAAATAG

Protein sequence:

>DPOGS203818-PA
MDGEPEVFSDTGTNCNFASASETLSDNNQAKLDSLEKKAVHLRLVLEKDFKAADIKWSLFVAAAFSFRYESCLRPFPPIFMKNGIKDMDELLSVITDVPALDLVLQQLDNLDNLANISDILDLLFYVLVRLKEPSLKTIPPEAHESVLINVHSFLPAPKPQYIFQVVNSCKSHSEMKWKELSKDQKVFYAYHGNRLENFYTILHFGLQQHLNKAAIMANGVYLSTELSMSLPHSHGGFGWGASCIGGHLSCIAMCEVIDAPEGINYYKPISNEGDGTYEDDKTKESDDNNLNTRTASYVVTNSELMRMRYLLVYAKQPTSMRFSTTSTNRNAGGIRQWLTRHKLVSILLGYGLMLATIGFANNQPIHYYYKILLKKLDIALSNVK-