Monarch geneset OGS2.0

DPOGS211156
TranscriptDPOGS211156-TA1881 bp
ProteinDPOGS211156-PA626 aa
Genomic positionDPSCF300007 + 44518-47616
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0171950.096.44% 
BombyxBGIBMGA003142-TA0.089.48% 
DrosophilaCG6197-PA0.075.24% 
EBI UniRef50UniRef50_A1Z9G20.075.24%CG6197 n=36 Tax=Metazoa RepID=A1Z9G2_DROME
NCBI RefSeqXP_968085.10.082.45%PREDICTED: similar to XPA-binding protein 2 [Tribolium castaneum]
NCBI nr blastpgi|910925440.082.45%PREDICTED: similar to XPA-binding protein 2 [Tribolium castaneum]
NCBI nr blastxgi|910925440.082.45%PREDICTED: similar to XPA-binding protein 2 [Tribolium castaneum]
Group
Gene OntologyGO:00054884.2e-14binding
KEGG pathwaytca:6564620.0 
 K12867 (SYF1, XAB2)maps-> Spliceosome
InterPro domain[346-427] IPR0119904.2e-14Tetratricopeptide-like helical
Orthology groupMCL16069 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211156-TA
ATGCCTGTACTTGATGGGAAAGAAATAGACATATTTTTTAGTGAGGAGGACCTACCATATGAAGAAGAAATATTAAGGAATCCATTTTCGGTTAGGCATTGGCTAAGATACATAGAACATAAAAAAGCAGCTCCCAAATATGAAATTAATATCATTTATGAAAGAGCCCTGAAAGAACTACCAGGATCATTTAAGCTGTGGTACAACTACTTAAAACTTAGGAGAAAACAAATAAGAGGTCGATGTATAACAGATCCTGCGTATGAAGATGTTAATAATTGTTTTGAAAGATCCCTCGTATTTATGCACAAAATGCCTAGAATATGGATGGACTATTGTACATTTTTGACTGATCAGTGGAAAATTACCGCTACAAGAAAAGCATTTGATTCTGCTTTACGGGCTTTACCAATTACACAGCATCACAGAATATGGCCCCTCTATTTAAATTTCTTGAAAAAGCATAATATTCCAGAAACTGCTGTTAGGGTATTCAGACGCTACCTAAAGTTGTGTCCCGAAGATACTGAAGAATATATTGATTATTTAATATCTATAGAAAAATTAGATGAAGCTGCTTTAAAATTAGCTCAACTTGTAAACAATGAGAATTTTCAATCCAAACATGGAAAATCCAACCACCAGCTTTGGAATGAATTGTGTGAACTAATATCCAAAAACCCAGACAAAATTCATTCACTTAATGTTGATGCCATAATAAGAGGTGGACTTCGTCGCTACACTGACCAGCTGGGTCATCTGTGGAATTCACTAGCTGATTACTATGTTAGAAGTGGGTTATTTGAGAGAGCCAGAGATATATATGAGGAAGCCATTCAGACTGTCACAACAGTAAGAGATTTCACCCAAGTCTTTGATGCCTATGCTCAATTTGAAGAGTTGAGTTTGAGTAAAAAGATGGAAGAAGTTGCAAAGAAACCCAACCCCACTGAGGATGAAGATATTGATTTAGAATTACGTCTTGCTAGGTTTGAATATTTGATGGAAAGAAGATTGTTACTGCTAAATTCAGTACTATTAAGACAAAATCCACATAATATTGCTGAATGGCACAAAAGAGTAAAGCTCTATGAAGGTAAACCTCATGAAATCATAGATACATATACAGAAGCTGTGCAGACAGTAGATCCAAAATTAGCGGTAGGAAAACTTTATACACTGTGGGTTGGTTTTGCAAAATTTTATGAGAGCAATGACCAAATTGATGATGCAAGGTTAATTTTTGAGAAAGCGACCCAAGCTGCAGAAATATATGGCGTACCCAAAACGCGACAAATATATGAAAAAGCAATCGAGACTCTACCAGATGAAAAAGCTAGAGAGATGTGCTTGCGATTTTCGGAAATGGAAACGAAACTTGGGGAGATCGACAGAGCTCGTGCCATATACGCTCACTGTAGTCAGATGTGCGATCCAAGGATTACGACAGAATTCTGGAATACGTGGAAAGAATTTGAAGTAAGGCATGGTAATGAAGATACTATGAGGGAAATGCTTAGAATTAAGAGAAGTGTACAAGCTACTTATAACACGCAAGTCAATATGATGTCAGCCCAAATGCTAGGCTCAGCTGCTCAGGCTGCGGGTACAATATCGGATCTTGCACCCGGAATGAAGGACGGCATGAGATTGTTGGAGGCTAAAGCTGCCGAAATGGCTGTCCAAAGCAAGGGCAATATATTGTTCGTCAGAGGTGAAACACAAGGTCTCAAAGAAAACGATAAAGTTGTTAATCCTGATGAAATTGATATTGATGACGAAGAATCTGATAATAGTAATGATGACGAGGAAGTTGCACCTGTACAGAAAAAGGAAATTCCTGCAGCAGTGTTTGGAGGCTTGGTTCCAGAAAATCAATAA

Protein sequence:

>DPOGS211156-PA
MPVLDGKEIDIFFSEEDLPYEEEILRNPFSVRHWLRYIEHKKAAPKYEINIIYERALKELPGSFKLWYNYLKLRRKQIRGRCITDPAYEDVNNCFERSLVFMHKMPRIWMDYCTFLTDQWKITATRKAFDSALRALPITQHHRIWPLYLNFLKKHNIPETAVRVFRRYLKLCPEDTEEYIDYLISIEKLDEAALKLAQLVNNENFQSKHGKSNHQLWNELCELISKNPDKIHSLNVDAIIRGGLRRYTDQLGHLWNSLADYYVRSGLFERARDIYEEAIQTVTTVRDFTQVFDAYAQFEELSLSKKMEEVAKKPNPTEDEDIDLELRLARFEYLMERRLLLLNSVLLRQNPHNIAEWHKRVKLYEGKPHEIIDTYTEAVQTVDPKLAVGKLYTLWVGFAKFYESNDQIDDARLIFEKATQAAEIYGVPKTRQIYEKAIETLPDEKAREMCLRFSEMETKLGEIDRARAIYAHCSQMCDPRITTEFWNTWKEFEVRHGNEDTMREMLRIKRSVQATYNTQVNMMSAQMLGSAAQAAGTISDLAPGMKDGMRLLEAKAAEMAVQSKGNILFVRGETQGLKENDKVVNPDEIDIDDEESDNSNDDEEVAPVQKKEIPAAVFGGLVPENQ-