Monarch geneset OGS2.0

DPOGS213770
TranscriptDPOGS213770-TA1764 bp
ProteinDPOGS213770-PA587 aa
Genomic positionDPSCF300212 + 161337-165900
RNAseq coverage482x (Rank: top 26%)
Annotation
HeliconiusHMEL0114420.093.30% 
BombyxBGIBMGA009257-TA0.090.76% 
Drosophilaatms-PA2e-15668.92% 
EBI UniRef50UniRef50_UPI000179166B1e-16652.20%UPI000179166B related cluster n=1 Tax=unknown RepID=UPI000179166B
NCBI RefSeqXP_624998.24e-17576.76%PREDICTED: similar to antimeros CG2503-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3407281689e-17777.30%PREDICTED: RNA polymerase II-associated factor 1 homolog [Bombus terrestris]
NCBI nr blastxgi|3320223640.064.81%RNA polymerase II-associated factor 1-like protein [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[1-585] IPR0071331e-269RNA polymerase II-associated, Paf1
Orthology groupMCL12210 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213770-TA
ATGCCTCCAACCGTACAAACTAACGAGCCAGAACGAGAAAAGCGTCCTCAAAGAACTGGAGAGAGAAGATCGGAGCTTGTTACTCGGGTTAAATATTGCAATACCTTACCGGATATACCATTCGATTTAAAATTCCTTACATATCCATTTTCTTCTACTCGTTTTATTCAATATAATCCGACCTCTTTGGAAAAAAATTATCGCTATGAAGTACTAACAGAACACGATCTTGGTGTCCATATTGATTTGATAAATCGAGATATCTACCAAGGAGATGGTAATGCACAATTAGATCCAGCAGATGAAAAGCTTTTAGAAGATGACGTTCTTACGCCTCAGGATTCGAAGCGTTCCCGACATCATGCCAAAAGCGTATCCTGGCTGCGCCGTTCGGAATACATTTCAACAGAACAAACCAGATTTCAACCCCAGTCTATGGAAAAGGTTGAAGCGAAAGTTGGTTACAATGTCAAGAAAATATTCAGCGAAGAAACATTATACATGGATAGAGATAGTCAAATTAAAGCTATTGAAAAGACATTTGAGGATAACAAGAAGACAATTGAGAAGCATTATAGCAAACCCGGAGTCACTCCGGTTGAGATAATGCCTGTTTTCCCCGACTTTGAAATGTGGAAGTACCCATGCGCTCAGGTTATATTTGACTCGGATCCAGCACCGGCGGACAAAAATATTGCCGGACAGATTGAAGCTATGTCGCAGGCTATGATTAGGGGTGTCATGGATGAGAGTGGTGAACAATTCGTTGCATACTGTCTGCCGACGGAGGACACTATACAGAAGAGACGCCGGGACATCACCGAGGGAATTCCCTACATGGATGGGGACACCTATGAATATAAAATGGCCAGAGAATATAATTGGAATGTTAAGAGTAAAGCTTCCAAAGGTTACGAAGAAAACTATTTCTTGGTGGTCCGTAATCATTGTATATATTACAATGAGCTTGAAACACGCGTCCGTCTGTCCAAGCGTCGTGCGAGGGCTGGTGCTGCTGCCCAAGCTCTAACAAGACTGGTTGTAACACACAGACCGTTGAACGCAGCTGAACACAGAATACTTAGACTTAGAGAGAAACAGCTGGAACCTGTACAGGACGAAGATGAAGAAGAAGAGGATGAGGAGGAGGAGGAGGAAGATAAACAGGAACAGGCTGAAGAAAGAAGATCGCGTTCCAGGTCAAAATCGGGTTCGAAATCTCCTCAAGGGTCAGAGAAGTCCAAGGCTTCCAGGTCTAGATCTCGCTCCAGGTCCAAATCTAGGTCAAGGTCGGCGTCGGGGTCTCCAGCACGTTCCAAGTCCAAGTCCAGGTCGCGGTCCCGCAGCCAGTCCGGCTCTAGGAAGTCGAGGTCCAGGTCCGCATCAGCTCAATCTGGATCGGCCAGGTCTGGGTCGGCTCGTTCCCGTTCTGGTTCCAGGGCTTCATCTCGCGCCAGTTCCAAGAGCAAAGCGTCCAGCAGACGGTCCAGGTCCAGGTCTAGAAGAAGCTCTAAGGGCAGTGATAAGGGGTCCAAGCAGTCGTCGCGGGCGTCGTCTCGTGCGTCGTCGGTCGCTTCGCGGTCGTCCCGCAAATCTTCCCGTGCCTCGTCCGCGTCGAAGTCGTCTCGCAAGTCTTCCCGGGCCAGCTCCCGCGCCAGGTCCAGCGGATCACGTTCCAGAAGCAGGTCTGCCAGCGGTTCAAGACAAGGTTCCAGAAGCGGTTCCGCATCAAGTGCCTCATCACGTCATTCAGGCTCCGATTGA

Protein sequence:

>DPOGS213770-PA
MPPTVQTNEPEREKRPQRTGERRSELVTRVKYCNTLPDIPFDLKFLTYPFSSTRFIQYNPTSLEKNYRYEVLTEHDLGVHIDLINRDIYQGDGNAQLDPADEKLLEDDVLTPQDSKRSRHHAKSVSWLRRSEYISTEQTRFQPQSMEKVEAKVGYNVKKIFSEETLYMDRDSQIKAIEKTFEDNKKTIEKHYSKPGVTPVEIMPVFPDFEMWKYPCAQVIFDSDPAPADKNIAGQIEAMSQAMIRGVMDESGEQFVAYCLPTEDTIQKRRRDITEGIPYMDGDTYEYKMAREYNWNVKSKASKGYEENYFLVVRNHCIYYNELETRVRLSKRRARAGAAAQALTRLVVTHRPLNAAEHRILRLREKQLEPVQDEDEEEEDEEEEEEDKQEQAEERRSRSRSKSGSKSPQGSEKSKASRSRSRSRSKSRSRSASGSPARSKSKSRSRSRSQSGSRKSRSRSASAQSGSARSGSARSRSGSRASSRASSKSKASSRRSRSRSRRSSKGSDKGSKQSSRASSRASSVASRSSRKSSRASSASKSSRKSSRASSRARSSGSRSRSRSASGSRQGSRSGSASSASSRHSGSD-