Monarch geneset OGS2.0

DPOGS209718
TranscriptDPOGS209718-TA4035 bp
ProteinDPOGS209718-PA1344 aa
Genomic positionDPSCF300105 - 278196-290469
RNAseq coverage560x (Rank: top 23%)
Annotation
HeliconiusHMEL0113571e-13669.77% 
BombyxBGIBMGA008929-TA0.063.92% 
DrosophilamtRNApol-PA0.041.53% 
EBI UniRef50UniRef50_UPI0001791AF50.043.80%UPI0001791AF5 related cluster n=1 Tax=unknown RepID=UPI0001791AF5
NCBI RefSeqXP_001946877.10.043.80%PREDICTED: similar to CG4644 CG4644-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1935991020.043.80%PREDICTED: DNA-directed RNA polymerase, mitochondrial-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1935991020.043.80%PREDICTED: DNA-directed RNA polymerase, mitochondrial-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00038990DNA-directed RNA polymerase activity
GO:00036770DNA binding
GO:00063510transcription, DNA-dependent
KEGG pathway 
InterPro domain[397-1344] IPR0020920DNA-directed RNA polymerase, bacteriophage type
[863-943] IPR0240758.6e-06DNA-directed RNA polymerase, helix hairpin domain
Orthology groupMCL12102 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209718-TA
ATGCATCGACTACTATCGGCAAAAAGTTTATGCCAAAATAATTTACACAGTGTGACAGTGTCTGCATCAAATTCTTTAAAAAATATATCTTTACCCAAAATAAAATGTTCTTTTTGTCAGAAAATTCTTCTTACAACACCAACTGCAGAAAACCTGTTGTCGGCAAGGCATCAGTCCACGAGAACGGTTAATGCTCTGAAATCCTTGAAAAAGAAAAGCAAACATAAAAGCTACAAAAAGTACGGTGAACTCCTGCAAGTAAGTGAGACTAGCATGACGGAAATGCAGGTTTCAATAAATAAGTTAAATGCAGCACATCTATCAAAGCTTGCATCAAGTCCTGTGTCACTTGGACAACTTCATCAATTAACAACAAATCCTTCGAAAAAGTTGAAGGACATAGAGGTTGATAAAGAGCTTCTTCGAACAGTCAGAAATAAAATTTTGAAGAAGACAAGTCCCGATATTAAAATAGAAAATGATATGTGTGCTCTTATATTTACAAATCACAAAATATCAACAACTGAAGACAGGTTGAAAACAAAAAAGATCATGGATATGGTAAAAGAGAGTTACTATAATTTTAGAAAGGCAACAATTTATGATCAAAAACTACAAGACCTCAGGTATGCCATCAATGAAGGTTTGACACAGGAATTTAATTTTGAAACTGACTCTGAACTGAAAAACTTAGATAACTCTCTGCCAAAAAGTCCACATGCTGTTTTTAAAGAATTGTTTACCGACAAGCAAATAGACCGCTATGACCAAGAACTTATTAGTCATATAACACAGTTTCAAATGCAAGGTTTAGCACATCCAGCCATAGATGATATAGATGATCCATCTTTTCACACTGACCTCGGTGAGCTAAGAGATGAATCTATTTTCGATGGTGACACTGAACAAGTTAAGAGCCTAAAGATTAAGAAAGCTCAGCAAGCAATAAAACAAAAGAAGAAGAAGATGCGTGAGAAACGTCGGCAGGCTCAAGAAGCGAGTATGAAGCAGGACATGAAAGAATTGGAGCTCCAGGTTAAAGAAGATGCTTTGCAGAGATTATTAACATCACACTTGCAACTACTGTGTTCACTGGATATGATTTCAGAGGGCCGTCAAGTCTTACAGTATTATAGGAAAAGGAATGCTAAATCACCGGAGCTACCAAAACTCAGAAGTGTTAAAATTTACAACACATTGCTTAATGGATATGCTTCTGTGGGTAACATAGAAAATACAAAAGAACTATTATCATTTATGTCAGAAGACCAGATAGAACCAAACGCGCAGACCTACGCAGCTGTGTTTGAATGTGTGGAGAGAAGTCATCTGGCAGACAAGTCAGCAATATTAAATAATTTTCACAAAGAAATAAAAGACAAGGGCATGACGTTAAACGATTTGTTGGATCAAAGTCAATTCTTGTATGACCAGCGTGAGGTGGTTCTCAGAGCGGTGAGGAGGCTGCAGCCGGGTTTTGAACCACACTACACACCTCCGATACTGGACCACGAGTGTCCGCTACTTGAAGATTTGAAACTAGATAAAGTTAACAATAGATCGGGGCTGTTCACCTCGCCGGCTAAAGGATTGATGACCTTGGAACAGCTGAGAGAGAAGGGCAGGGAACAATTAGACATGGAAATTAATGGAGAGGTCGAAGTACACAACATTTCTTTAAAAGACGAAGCTTCCAAGGAGGTTCTGTTATATCGCGAGAAGCTATCTTCGTCGGAGGCCGAGTGGCGTAGCTCCCTCCGTGAGGCTCTAATCCGCCATCTTGCCACTCTGCGGGCTCGCAGCGGCGCCTCCCACGCCCCCGTTACCCTCTATCCGTATCTGAAGGTTTTAGAAGTTGACGAGTTTGTAGAATTAATGATGAATGAAATCATCAAACTAGTCGACGGAAGTGAATCTTACAGTCCCACCTTGAAGTTACTGCAGAGAGATCTGGGGACACAGGTCTATCAAAAATACCAGATAGAACAGTATCGCCGGAACGGTGTTCTGAAGAAGATCGAGCAAGTGTATGACAAATATTGCAAGTGGTACCTGGAGAGGCATTCCTTGGACGGCACAGACACTCCATACAACAGTAGACAGGCTTGGCAGTTGTTGGTACATCAGAACAGAGATGGCGCTAGTTTGGATGTCGAGGCATCCCCGTGGTCGATGGAAATGAGACAAAGTATAGGAAAGTTTCTATATAACATTATCATTAATGATGTCAAGGTTGATGTGAACATGTTCAAGCCTAACGCCCAAGTTAAGAAGTTGCCAGCAGTGTACAAGGTCCACCGTCCGTGGGGTCGCTTGGTCCGTCTCGAGCTGAAGCCTCACCCGACCCTATCCCGCCTCTGGTCCGCGGCGGCCCGGCCCCGCCTCCGCCTCCGCTCGTCGCTCGTGCCCGCCCGCTCGCCGCCCGCTCCTTGGCACAGCGCCACCGCCTCAGGAGCTTGTCTCCTCACTACTACATCACTTATACGGATGCCGTTCTATGTGATGGGTCTAACGAAAAGATTGGAAGAGGCTCCGCCGGCGACCATGTACCCAGTACTTGACGGATTGAACCAGCTGGGAGACGTGCCCTGGGTTATTAACCAGAGAATACTTGATTTACAACTCAAAGTCTTCAGATCGGGCGGCGACAAAAAGCTGGATATACCGCCTCCTGCGTCCTCGTTGGACGCATCTCAGTGGAAAATGGAAGGAAAGACTGGCGGGGAAGCGTTAAGGAGACGAGTGGTCATCAACAGGGCTAAGGCAGACATGCACTCCCTGTGGTGCGACGCGCTTTACAAACTGTCACTGGCCAATCACTACAGGAACGTAACATTCTGGTTGCCTCACAACATGGACTTCCGCGGTCGTGTGTACGCGGTGGGTCCGCACGTGTCGGCGCTGGGTCCGGACGCAGCGCGCGCCCTCCTGCGGCTGGCAGGCGTCCGGCCGCTCGGAGCGCGCGGACTCGACTGGCTCAAGATACACGCTGTCAACCTCACCGGCACCAAGAAGAGGAGCACCGTGGAAGAAAGGTACAGATACTTATTTAATATGGCAACCTTATTGTGTTGGATTAACGTTGAATATCGTACAGAGTACGAGTGCGGGTTCCCCGTGCATCAGGACGGGTCGTGTAACGGTCTGCAGCACTACGCGGCCCTGGGCGGGGACGCGGCGGGCGCGGCGGCCGTCAACCTGGCGCCCGCGGACAGGCCGCAGGACGTCTACAGCGAGGTGGCCGCACTGGTGGAGACGATGCGGGTCCGCGACGCGGCGAAGGGTGTGACGGCTGCTATGGTGTTGGAGGGGTTCGTGAGGAGGAAGGTCATCAAGCAGACGGTCATGACCACCGTGTATGGAGTCACCAGGTTCGGTGCGAGGCTGCAGATAGCGAAGCAACTTAAAGATATTGACGAGTTCCCCAAAGAGTACGTGTGGCCGTGCTCGCAGTATCTAACAGCTCGCACCTTCGACTCGCTGAGAGAGATGTTCGCCTCCACCAAGCTCATACAAGACTGGTTCACAGACTGCGCCAAGATGATCTCAGGCGTGTGCGGTGAGAGTGTGGAGTGGGTGACGCCTCTCGGCCTGCCCGTGCTGCAGCCGTACTACAGGCGACCGCCCGCTCAGGACACACAGCAGCGGCCGTGTACTATGAAGCAGCGTAACGCGTTCCCTCCTAACTTCATCCACTCCCTGGACGGGTCTCACATGATGCTGACGGCGCTCCGCTGCGGCGCGAGGGGCCTGACGTTCGTGTCCGTCCACGATTGCTTCTGGACGCACCCGGACACGGTGGACGACATGAACAAGATATGTCGAGAGCAGTTTGTCGCTCTTCATTCACAACCGATCTTGGAAGATTTATCAGATTTCCTTGTGAAGCGATATAGCTACCCTGAAAGTGAAATTGAAGCTAGCAACGTCGGTGCGGCGAACAAGAAGCGAGTCAACGCTCTGCTGAGAAGGGTACCGGAAAAAGGCGACTTCGACATCAACAGCGTTCTGAAATCAGTTTACTTTTTCAGTTGA

Protein sequence:

>DPOGS209718-PA
MHRLLSAKSLCQNNLHSVTVSASNSLKNISLPKIKCSFCQKILLTTPTAENLLSARHQSTRTVNALKSLKKKSKHKSYKKYGELLQVSETSMTEMQVSINKLNAAHLSKLASSPVSLGQLHQLTTNPSKKLKDIEVDKELLRTVRNKILKKTSPDIKIENDMCALIFTNHKISTTEDRLKTKKIMDMVKESYYNFRKATIYDQKLQDLRYAINEGLTQEFNFETDSELKNLDNSLPKSPHAVFKELFTDKQIDRYDQELISHITQFQMQGLAHPAIDDIDDPSFHTDLGELRDESIFDGDTEQVKSLKIKKAQQAIKQKKKKMREKRRQAQEASMKQDMKELELQVKEDALQRLLTSHLQLLCSLDMISEGRQVLQYYRKRNAKSPELPKLRSVKIYNTLLNGYASVGNIENTKELLSFMSEDQIEPNAQTYAAVFECVERSHLADKSAILNNFHKEIKDKGMTLNDLLDQSQFLYDQREVVLRAVRRLQPGFEPHYTPPILDHECPLLEDLKLDKVNNRSGLFTSPAKGLMTLEQLREKGREQLDMEINGEVEVHNISLKDEASKEVLLYREKLSSSEAEWRSSLREALIRHLATLRARSGASHAPVTLYPYLKVLEVDEFVELMMNEIIKLVDGSESYSPTLKLLQRDLGTQVYQKYQIEQYRRNGVLKKIEQVYDKYCKWYLERHSLDGTDTPYNSRQAWQLLVHQNRDGASLDVEASPWSMEMRQSIGKFLYNIIINDVKVDVNMFKPNAQVKKLPAVYKVHRPWGRLVRLELKPHPTLSRLWSAAARPRLRLRSSLVPARSPPAPWHSATASGACLLTTTSLIRMPFYVMGLTKRLEEAPPATMYPVLDGLNQLGDVPWVINQRILDLQLKVFRSGGDKKLDIPPPASSLDASQWKMEGKTGGEALRRRVVINRAKADMHSLWCDALYKLSLANHYRNVTFWLPHNMDFRGRVYAVGPHVSALGPDAARALLRLAGVRPLGARGLDWLKIHAVNLTGTKKRSTVEERYRYLFNMATLLCWINVEYRTEYECGFPVHQDGSCNGLQHYAALGGDAAGAAAVNLAPADRPQDVYSEVAALVETMRVRDAAKGVTAAMVLEGFVRRKVIKQTVMTTVYGVTRFGARLQIAKQLKDIDEFPKEYVWPCSQYLTARTFDSLREMFASTKLIQDWFTDCAKMISGVCGESVEWVTPLGLPVLQPYYRRPPAQDTQQRPCTMKQRNAFPPNFIHSLDGSHMMLTALRCGARGLTFVSVHDCFWTHPDTVDDMNKICREQFVALHSQPILEDLSDFLVKRYSYPESEIEASNVGAANKKRVNALLRRVPEKGDFDINSVLKSVYFFS-