Monarch geneset OGS2.0

DPOGS208771
TranscriptDPOGS208771-TA1707 bp
ProteinDPOGS208771-PA568 aa
Genomic positionDPSCF300036 - 891935-893641
RNAseq coverage104x (Rank: top 60%)
Annotation
HeliconiusHMEL0154170.061.89% 
BombyxBGIBMGA007632-TA6e-18055.74% 
DrosophilaCG34183-PA1e-2948.12% 
EBI UniRef50UniRef50_Q5I0E61e-5230.02%RNA polymerase II subunit B1 CTD phosphatase Rpap2 n=50 Tax=Theria RepID=RPAP2_RAT
NCBI RefSeqXP_001650225.16e-3958.70%hypothetical protein AaeL_AAEL005036 [Aedes aegypti]
NCBI nr blastpgi|3838584455e-7636.46%PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like [Megachile rotundata]
NCBI nr blastxgi|3838584459e-8136.86%PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[58-130] IPR0073083.2e-21Protein of unknown function DUF408
Orthology groupMCL15860 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208771-TA
ATGGAATTTAAAAAAAGTACAAAGCGACCACCTAAAATCGAAGAAATGTCCAAGGAACAAATACGAAAAGCTATCATTAAAAAACGCGAGTGTAACGCCAAAGCACAAAACATAGTTGAAAAATTACTTGAAAAGTGTGTGAATGAGGAATATTTTTTAAAATGCTTACTCGATATTAACCAAAGTCATTTCGATGATGTGATTGAGGAACGGTCTATATTACAACTATGTGGTTATCCATTATGTCAGAGAACATTGTTAGAGAAGGACATTCCTAAACAAAAATACAGGATATCGTTGAAAACAAATAAGGTTTACGATATAACAACAAGGAAATGCTTCTGCAGTAACATCTGTTACAAATCGGCAATGCATGTTAAAAAACAAATGTTGACGAGTCCTTTATGGTTTAGGGAATATGAAGAAATACCGACAGGACAAGAGGTAGACTTGGGTGGACCCGCAAAAATAGAAATAAATAAAGATGATTTTATCACCACTTCACAATTCACTAAATCCAGCTTTCAGCATGCATCTGATATTGTAGATTCAAATAAGATTGATGTCCATAAAATTGAAACTAAATTATTAACCAATAGTACGGATGTAATAGGAAGCAACATAGTCAATACTAATGATGAAAATATTACACCCAGTAATCATTTAGAATCTTCTAAACCGTCATACACACAGGAACCTGATGTGAGGAAGAAACCTAATAAGAAAACAACAAACCCATTAAATATTGTTGGTGATATAGTGGAGAAACCAGAGAAACAGATTGACCCTATACTCATTAATAGACCTTCAAGTAAAGACAAAGAACCTGAGAAACCAACCACCAGCTTAACCAGAACTATACATCAGAAGAAACCGCCTTCAATCACAGCCATAACAATAAATGTAGAAAAATGTTTAGCGGAATGGTGTACGATTGACACACTGCTGTTTATATATGGGGAGGAAAACGTAAAGAAAATGTTATCCAACAAAGGACAATTTATAACAGACTACTTAAACAATTACTCCAAAAGCATTTTCTACACTTCAAACACATACGACCAGTACCAAGCATTGTGCCGCAAACTTAACTTGTTAGAATTAGAATCGAGACGACAAGATGCTCAGATATTAAATAAAGAAACTAGACCATTACCAGATTATTCAATTCTGAAGGAAGAGAGTAAGAAAATACAGTTTAAAGTCAGAGCCTTTTTTGCGGGAGAAATTGAAATCCCCGAACCAGAGGAGCCCACGGAGGTTGATGCATCCAATGAACATGATAACTCAACTGTGTTACCACTAGTTGATAAGAATTCACAAAATGCACTGCGAAGGAAAATTGTTTGCCAGCATTTAAACAAGGTGTTGCCAGATTTATTACGATCACTTGGCTTATTAAATCTAACAATCAGTTCTGACATACGACTGCTTGTAAATACATTCAAATTGAAAGCAGACAATATTATGTTCAAACCTATACAGTGGACATTGATCGCTTTAGTGTTTATAAAATTGTTATCTATAAGGGACGAACAATTAAAAGGTTTACTGGAGCATGAAACGGCATTCAAACACATGCAGCTCTTGTTGCTCAGTTATAATCAAGACGGAGGTTATTTAGACAGGCTCATTTCTTGGTTGACCGATGTAAATAGGTTACTCGACGTAAATGATAATCAAATGACTATTGAAAAAAATATGTAG

Protein sequence:

>DPOGS208771-PA
MEFKKSTKRPPKIEEMSKEQIRKAIIKKRECNAKAQNIVEKLLEKCVNEEYFLKCLLDINQSHFDDVIEERSILQLCGYPLCQRTLLEKDIPKQKYRISLKTNKVYDITTRKCFCSNICYKSAMHVKKQMLTSPLWFREYEEIPTGQEVDLGGPAKIEINKDDFITTSQFTKSSFQHASDIVDSNKIDVHKIETKLLTNSTDVIGSNIVNTNDENITPSNHLESSKPSYTQEPDVRKKPNKKTTNPLNIVGDIVEKPEKQIDPILINRPSSKDKEPEKPTTSLTRTIHQKKPPSITAITINVEKCLAEWCTIDTLLFIYGEENVKKMLSNKGQFITDYLNNYSKSIFYTSNTYDQYQALCRKLNLLELESRRQDAQILNKETRPLPDYSILKEESKKIQFKVRAFFAGEIEIPEPEEPTEVDASNEHDNSTVLPLVDKNSQNALRRKIVCQHLNKVLPDLLRSLGLLNLTISSDIRLLVNTFKLKADNIMFKPIQWTLIALVFIKLLSIRDEQLKGLLEHETAFKHMQLLLLSYNQDGGYLDRLISWLTDVNRLLDVNDNQMTIEKNM-