Monarch geneset OGS2.0

DPOGS207583
TranscriptDPOGS207583-TA2958 bp
ProteinDPOGS207583-PA985 aa
Genomic positionDPSCF300072 + 802641-812368
RNAseq coverage234x (Rank: top 43%)
Annotation
HeliconiusHMEL0226240.073.99% 
BombyxBGIBMGA004691-TA0.061.14% 
DrosophilaParp-PB0.045.96% 
EBI UniRef50UniRef50_E9JEI60.061.14%Parp (Fragment) n=1 Tax=Bombyx mori RepID=E9JEI6_BOMMO
NCBI RefSeqXP_001661932.10.052.93%poly [adp-ribose] polymerase [Aedes aegypti]
NCBI nr blastpgi|3044214600.061.14%parp [Bombyx mori]
NCBI nr blastxgi|3044214600.061.14%parp [Bombyx mori]
Group
Gene OntologyGO:00512870NAD binding
GO:00039508.5e-75NAD+ ADP-ribosyltransferase activity
GO:00064711.1e-44protein ADP-ribosylation
GO:00036774e-30DNA binding
GO:00082704e-30zinc ion binding
GO:00056342.1e-20nucleus
GO:00056225.1e-06intracellular
KEGG pathwayaag:AaeL_AAEL0118150.0 
 K10798 (PARP)maps-> Base excision repair
InterPro domain[2-985] IPR0082880NAD+ ADP-ribosyltransferase
[772-980] IPR0123178.5e-75Poly(ADP-ribose) polymerase, catalytic domain
[639-772] IPR0041021.1e-44Poly(ADP-ribose) polymerase, regulatory domain
[500-627] IPR0088933.9e-39WGR domain
[2-92] IPR0015104e-30Zinc finger, PARP-type
[272-324] IPR0129822.1e-20PADR1
[371-449] IPR0013575.1e-06BRCT
Orthology groupMCL12217 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207583-TA
ATGACTGATTTACCTTATCAAGTTGAATATGCGAAAACAGGAAGAGCTTCATGTAAGGCCTGTAAGGCGAAAATTGATCAGGGGGATTTACGAATTGCGATTATGGTTCAGTCAGCCTTTCATGATGGAAAGCAACCAAACTGGCATCATGAAGAATGTTTCTTTAAAAAAAAGTGCCCTGAAAATATTTCTGATATTGCCAATTTTAATAAGTTAAAAAATGAAGATCAGAAAAGAATCAAAAGCAAACTAGGTACTGGCAATCCGTCTGGTGTAGTGTTGCCATCCGAAAAACCAAAAAAAGGAAAAGGTCAAAAAAGGGATAATAATGAAAAAGCTGGTCTTTCAAATTACTCAATAGAATATGCAAAATCAAGCAGAGCAACTTGTAAACACTGTGATATTAAGATTTGTAAGGATGAAGTAAGGGTTTCCAAAATGGGATATGATCCAAAGTATGGAGATCATCCAATGTGGCATCATGTTAAATGTTTTGCAGAGAGGCAAAGTGAGTTTTTATTTTTTGCTGGAGGAGAAGAAATCCCAGGTTTTAAAACATTGAAAAAAGAAGATCAAAATATGGTTAAAGATATAATAAAACCTTGTAAGGAAAGCGAAATACCAATCAAGAAATTAAAAATGGAACCGAAAGATGAAGCCGATATAAAGAAGGAAAAAGACTTGCAGAAGAAAATTGAAAAACAAAATAAAACATTCCATAAATATCGCAGTGCATTAAGTGATTTTTCAAAAAGTAATTTACATAAAATACTGACTGAAAACTCACAAGAAATACTTAAAGGGCAAAATGAGTGTCTAGATCACGTAGCTGATATGATGGCTTTTGGAGTTTTGGACCCTTGTCCCGAATGCAAAGGTCAACTTGTGCTAGACACCTTCTACTATAAATGCTCAGGCAATATAAGTGAATGGTCCAAATGTAGGTATACCACAAAAACACCTAAACGACATTCGATGAAAATTCACAAGGAACACAAGGATTTGGCGCCTTTTAAAAGCTTTAAATCTAAAGTATCGGAGAGAATATTCGAAGTTGAACCGCCACCAACTACTGTTGTCGTGAAGAAGGAAGAACCTGAAAGTTCACAAAAAGCTCTTCCGTTGCCTCCATTGAAAAATTTGCAATTTTTCCTGTATGGGGGCCTGAAAAATAAGGTGGAAACTAAAAATCGTATTTTGAAGATGGGTGGCTTAGTTGTCAGCAAACTGACCGAGACTCTTGCGGCAGTCGTGTCCACAAAGAAGGACTTAGAGAAGATGTCCGGCAAGATGCAGGACATACAAGACATGGATATTGAGGTGGTAGAAGAGTCATTTCTTGATTCAATTGACCCTGAGAATGGAACAATTGCTAAGTCGCTAGAACTGATAAAAGAAAACAATATAGCGGACTGGGGTTCCGATCCCACAAAACGCGTCCCCCAGGACGTGCTGGATGGAAAATCCATACAGAAATCCGGGAGCATGTACGCGAAATCAAAGTCCGCCATCACAAAACTTAAGATTAAAGGTGGAACAGCCGTGGACCCCGACTCAGGTCTAGAGGAGACGGCTCACGTGTACACGTCGCCCAATGGGGACAAGTACTCAGTGGTGCTGGGGAAGACGGACGTGGTGGCTGGGAAGAACTCGTACTATAAGTTGCAATTGCTTAAGGCTGATACTGGAAATAAATTTTGGCTGTTTCGGTCTTGGGGTAGAATCGGTACACCGATTGGTGGAAACAAGCTGGAGCCGTGCACCACCTTACACGATGCTATGGAAAAATTTGAAGATCTTTATCACGAAAGGACCCAAAACCATTGGAAGAAGCGACACAACTTCGTCAAGGTACCGGAAGCGTATGTGCCTATAGAATTAGACTATAGTGATGAACCAGCGCAAGCTCTCCAACAAGATGACAAGTGCTCGCTACCGACCAGTGTACAGAGCCTGCTGCAGAGGATCTTCGACATCGACACCATGAAGAAAACACTACTAGAGTTTGAGCTTGATACAGAGAAAATGCCGTTGGGAAAGCTGTCCAAGAAACAGATCAAGTCCGGATATAACGTACTATCGGAACTACTACAATTACTTGAAAAGGGAGCGGCGAGTGAGAATAAAATTATAGACGCTACCAACAGGTTTTACACTCTCGTTCCACATAATTTCGGGACCGAGAATCCGCCGTTACTGAATAATGTTGAGTCAATCAAAGTCAAGACCGAGATGCTGGACAACTTACTTGAAATAGAAATAGCTTACAAGTCGGATGACGACGTTAGTCCGATGGAGGCACATTATCGCAAATTGAAAGCCGACATCGCTCCCATAGACAAGAAATCAGAGGAATTTAAAATGATTGTGGAATATGTGAAGAACACACACGCGGCCACACACTCTGGCTACACGCTGAACGTTCAGGAGGTGTTCAAAGTAGTCCGTGAAGGTGAAGAGAAGCGGTACAAGCCTTTCAGAAAACTTCACAACAGAAGGCTTCTGTGGCACGGATCCAGAACAACTAATTTCGCTGGAATATTATCTCAAGGTCTTCGAATCGCACCTCCCGAGGCGCCGGTGACTGGTTACATGTTCGGGAAGGGTATCTACTTCGCCGATATGGTGTCCAAATCCGCGAATTATTGTTGTACCTCGAAAACAAATAACATCGGTCTTATGCTGCTGAGTGAGGTCGCCCTCGGGGACATGAAGGAGTGTGTGAAATCCGAGTATGTAACGAAGCTGACGGACAAGCATTCAGTGTGGGGCGTGGGTCGCACGCAGCCCGACCCGGACCGCGCGAAGACCTTACCCGGAGGACTCGTAGTACCGCTGGGACCGCCTGTCAACAGAGACATAACCACCTCGCTGTTATACAACGAATTCATCGTGTACGACGTTGCTCAAGTGAACGTCAAATATTTAATACAAATGGAATTCGACTACAAGTATTGA

Protein sequence:

>DPOGS207583-PA
MTDLPYQVEYAKTGRASCKACKAKIDQGDLRIAIMVQSAFHDGKQPNWHHEECFFKKKCPENISDIANFNKLKNEDQKRIKSKLGTGNPSGVVLPSEKPKKGKGQKRDNNEKAGLSNYSIEYAKSSRATCKHCDIKICKDEVRVSKMGYDPKYGDHPMWHHVKCFAERQSEFLFFAGGEEIPGFKTLKKEDQNMVKDIIKPCKESEIPIKKLKMEPKDEADIKKEKDLQKKIEKQNKTFHKYRSALSDFSKSNLHKILTENSQEILKGQNECLDHVADMMAFGVLDPCPECKGQLVLDTFYYKCSGNISEWSKCRYTTKTPKRHSMKIHKEHKDLAPFKSFKSKVSERIFEVEPPPTTVVVKKEEPESSQKALPLPPLKNLQFFLYGGLKNKVETKNRILKMGGLVVSKLTETLAAVVSTKKDLEKMSGKMQDIQDMDIEVVEESFLDSIDPENGTIAKSLELIKENNIADWGSDPTKRVPQDVLDGKSIQKSGSMYAKSKSAITKLKIKGGTAVDPDSGLEETAHVYTSPNGDKYSVVLGKTDVVAGKNSYYKLQLLKADTGNKFWLFRSWGRIGTPIGGNKLEPCTTLHDAMEKFEDLYHERTQNHWKKRHNFVKVPEAYVPIELDYSDEPAQALQQDDKCSLPTSVQSLLQRIFDIDTMKKTLLEFELDTEKMPLGKLSKKQIKSGYNVLSELLQLLEKGAASENKIIDATNRFYTLVPHNFGTENPPLLNNVESIKVKTEMLDNLLEIEIAYKSDDDVSPMEAHYRKLKADIAPIDKKSEEFKMIVEYVKNTHAATHSGYTLNVQEVFKVVREGEEKRYKPFRKLHNRRLLWHGSRTTNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMVSKSANYCCTSKTNNIGLMLLSEVALGDMKECVKSEYVTKLTDKHSVWGVGRTQPDPDRAKTLPGGLVVPLGPPVNRDITTSLLYNEFIVYDVAQVNVKYLIQMEFDYKY-