Monarch geneset OGS2.0

DPOGS213448
TranscriptDPOGS213448-TA1215 bp
ProteinDPOGS213448-PA404 aa
Genomic positionDPSCF300745 - 6485-8912
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0134409e-11750.88% 
BombyxBGIBMGA008779-TA2e-17879.95% 
DrosophilaCG5931-PA2e-16260.89% 
EBI UniRef50UniRef50_B4ITW81e-16060.89%GE23129 n=1 Tax=Drosophila yakuba RepID=B4ITW8_DROYA
NCBI RefSeqXP_970554.14e-16563.53%PREDICTED: similar to pre-mRNA-splicing helicase BRR2 [Tribolium castaneum]
NCBI nr blastpgi|2700027174e-16462.91%hypothetical protein TcasGA2_TC016163 [Tribolium castaneum]
NCBI nr blastxgi|2700027171e-15662.91%hypothetical protein TcasGA2_TC016163 [Tribolium castaneum]
Group
KEGG pathwaytca:6591291e-164 
 K12854 (SNRNP200, BRR2)maps-> Spliceosome
InterPro domain[75-393] IPR0041795.5e-132Sec63 domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213448-TA
ATGCTGGGCCGGGCCTGCAGACCTCTGGAAGACGAGCACGCAGTCGCAGTACTCATGTGCGCTCAGCATCACAAGACGTTCTTCACGAAGCTTCTCAACGACTGCCTACCCTTAGAGGTATCTACTCCACATACACATCTCTCCGATCACCTGTCAGAGCTGGTCGAGTCAACCCTATCAGATCTGGAACAGTCGAAGTGTATAGCCATCGAGGATGATATGGACCTGCAGCCTCTCAACCTTGGAATGATAGCCTCCTACTACTACATAAACTACACTACTATAGAACTGTTCAGCTTGTCTCTGACTTCAAAGACGAAGATCCGAGGTCTTCTGGAGATCATATCTTCAGCGGCGGAGTACTCCGAGCTGAGCGTCAGGCATAGAGAGGAGAACGTCATCAAGACGCTCGCTGCCAAGGTGCCTCACAAATCGTCATCGCCGACAGTCCGCTACAACTCCCCCCACGTGAAGGCGCACGTGTTATTACAAGCTCATCTCTCAAGGATGCAACTTCCGGCGGAACTGCAGGCTGATACCGCCATTGTGCTCACAAAGGCTATACGTTTGATCCAGGCGTGTGTGGACGTGGTTTCCAGCAGTGGCTGGCTATCCCCAGCTGTTGCGGCTATGGAACTAGCGCAAATGGTGACCCAGGCTATGTGGGCGAAGGACTCCTACCTCAAACAGCTGCCTCACTTCACCCCGGAACTGCTCCAACGGTGTTCCGAACGCGGGGTTGATACGGTCTTCGATGTCATGGAACTCGAGGATTCAGCTCGCACGGAGCTGCTTAGGCTCACGCCGACGGAAATGGCGGATGTGGCTAGATTCTGTAACAGATATCCGAATGTTGAGCTGAGTTATGAGGTGTTGGATAGTAGGAGAGTGAGGAGTGGTGGGCCGGTAGTGTTGAAGGTTACGTTGGAGAGGGAAGACGAGGTGACGGGGCCGGTCGCAGCGCCTAGATTCCCGCAGAAGAGGGAGGAAGGTTGGTGGGTGGTGGTTGGAGAGCCGAGGACCAACAGCCTGTTGTCCATCAAACGGGTTCAGCTTGGGCGATCAGCTACTTTGAAGCTGGACTGGCTAGCCGGCGCGCCGGGACGACACACCTACACTCTGTACTTCATGAGCGACGCTTACCTGGGAGCGGATCAGGAATACAAGTTCAATGTGGATGTGTCCGATGCCAGGTCACCGGATAATGCAGACTGA

Protein sequence:

>DPOGS213448-PA
MLGRACRPLEDEHAVAVLMCAQHHKTFFTKLLNDCLPLEVSTPHTHLSDHLSELVESTLSDLEQSKCIAIEDDMDLQPLNLGMIASYYYINYTTIELFSLSLTSKTKIRGLLEIISSAAEYSELSVRHREENVIKTLAAKVPHKSSSPTVRYNSPHVKAHVLLQAHLSRMQLPAELQADTAIVLTKAIRLIQACVDVVSSSGWLSPAVAAMELAQMVTQAMWAKDSYLKQLPHFTPELLQRCSERGVDTVFDVMELEDSARTELLRLTPTEMADVARFCNRYPNVELSYEVLDSRRVRSGGPVVLKVTLEREDEVTGPVAAPRFPQKREEGWWVVVGEPRTNSLLSIKRVQLGRSATLKLDWLAGAPGRHTYTLYFMSDAYLGADQEYKFNVDVSDARSPDNAD-