Monarch geneset OGS2.0

DPOGS207122
TranscriptDPOGS207122-TA1548 bp
ProteinDPOGS207122-PA515 aa
Genomic positionDPSCF300001 + 3427741-3429577
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0095997e-17861.09% 
BombyxBGIBMGA013083-TA3e-16358.43% 
DrosophilaCG32104-PB4e-5030.37% 
EBI UniRef50UniRef50_D6WZF81e-6634.83%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZF8_TRICA
NCBI RefSeqXP_970145.22e-6734.83%PREDICTED: similar to CG32104 CG32104-PB [Tribolium castaneum]
NCBI nr blastpgi|3504236801e-6635.69%PREDICTED: RNA polymerase II-associated protein 1-like [Bombus impatiens]
NCBI nr blastxgi|3504236802e-7134.68%PREDICTED: RNA polymerase II-associated protein 1-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[330-394] IPR0139293e-20RNA polymerase II-associated protein 1, C-terminal
[202-240] IPR0139301.1e-12RNA polymerase II-associated protein 1, N-terminal
Orthology groupMCL14163 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207122-TA
ATGATAAGACGTCCCAAGAAAGGTGAAAATGAAGAAGATTTATTGCGAATGCAAGAAGAATTTCTAAGAGAAAAAAATGCGCCTTCAGCACAAGTGGTAAATTTACGTAAAACCGAACACCAAACAACTAAAAGAACTAATTCTAGTACTTCAGACAGAAAGCTATCTAAATATGCTAAATCTAAAGGACTTCAAAATTCGGAAAAAAGGACTAAAGTTGATAACAGTACTGGTTCCCTTTTTGGAGACATAATGGAAAAGAATGTGTCTGAAGAACCACAACCAGAACGTACGGAATTCGAAGATGATAAAGTTTATTATCCTAAAGTGCTTCCATTTGTTCTTGGTGATATAGTGGAAAAAAGCAATGATGACATTTTAAGCTTGGATTTTAAGATGACACCCCAAGGCTTTCCAGCTGCTATCAAAAATGATTTAAAATTGAAACCTATCCCAAAGAAAGGGTCCCTACCCTTTAAAAAATTAGGTGACATTGAAGAAGAAAAGATGGATATTGATTCGTCTTCCGATCATCATGCAAGTAATACATCAAAGTTAAATATTCCTAACAAAAGTTATATTCTCAATTCAAATGAGGCAAATGCTATTCACAGTGAAAATGTGAATACGCTCAGTAAAATGACAGAAGAACAGATATTATCTGAACAACATAAACTGTTGTCTAGCTTGGACCCAAAACTGGTAGATTTTATAAAAAGTGTGAGGAAACCAAGTAACACTGATCACATACAACTTGAAAATCAGTCACAAAATCAATTAATGGATGTTTCTGAGCCTAAACAAGAGGAGACAGAAAAAGTTGTACAAGAAAATGATCCAGTTAATAATGATACGCTATGGGAGAGTGATGTGCTTTCTCATCCACATATCAATCAATGGATTCATTTTAATGATTTAGAAAAAGAAAAATTAGAATGGATGAAAGGCATTGAAGAGAGTAAAAAACTTAAACCTAATGAACCTTTTGAAGCAAGATTTGATTTTAAAGGCTACCTTCTACCTTATACTATGGAGTATACTGAGGAAACAAAAACTTTGTTTCATCATGGTGAGGAACCACACCGACCAGGCTACTCCATTACAGAACTCATTGAGCTCTCTCGCTCTACTATCATACAACAAAGAGTTATGGCTCTAAATACTATAGCTGAGCTTTTAGAATATTACATTTCAGATGTGATAGAAATTCCACTGAGCAAACTATTTTTTGTTATCAGAATTGCTATGGATGAAAATAAGACCATTCTGTTACAAGCAGCACTTAAAGCTATGAGAAATTTACTGTACAACAGAATTGATGAAGCCTGTCTTGATGCTTTATTGGGATTTGAAGAAGGCTCTTATCAGCCTTGTTTAGAAAATGATAAATCAGAAATTTCTGAAATAGAATCAGAGGAATCCGAACTAAAAGATTTTCACTTGGCTGAAATAGATCTTTTGTCCGCTGTGCTTAGAACAGATATATTACAAAGACTTTACTATATCTTAGAATGTAACCTCTATCAGAATACATACGATCAGGTGTAG

Protein sequence:

>DPOGS207122-PA
MIRRPKKGENEEDLLRMQEEFLREKNAPSAQVVNLRKTEHQTTKRTNSSTSDRKLSKYAKSKGLQNSEKRTKVDNSTGSLFGDIMEKNVSEEPQPERTEFEDDKVYYPKVLPFVLGDIVEKSNDDILSLDFKMTPQGFPAAIKNDLKLKPIPKKGSLPFKKLGDIEEEKMDIDSSSDHHASNTSKLNIPNKSYILNSNEANAIHSENVNTLSKMTEEQILSEQHKLLSSLDPKLVDFIKSVRKPSNTDHIQLENQSQNQLMDVSEPKQEETEKVVQENDPVNNDTLWESDVLSHPHINQWIHFNDLEKEKLEWMKGIEESKKLKPNEPFEARFDFKGYLLPYTMEYTEETKTLFHHGEEPHRPGYSITELIELSRSTIIQQRVMALNTIAELLEYYISDVIEIPLSKLFFVIRIAMDENKTILLQAALKAMRNLLYNRIDEACLDALLGFEEGSYQPCLENDKSEISEIESEESELKDFHLAEIDLLSAVLRTDILQRLYYILECNLYQNTYDQV-