Monarch geneset OGS2.0

DPOGS203041
TranscriptDPOGS203041-TA963 bp
ProteinDPOGS203041-PA320 aa
Genomic positionDPSCF300206 - 44522-46525
RNAseq coverage555x (Rank: top 23%)
Annotation
HeliconiusHMEL0161481e-2650.68% 
BombyxBGIBMGA006541-TA3e-5668.35% 
DrosophilaCG6961-PA2e-2252.81% 
EBI UniRef50UniRef50_Q0ZAL54e-7252.20%Polymerase delta interacting protein 3 n=2 Tax=Obtectomera RepID=Q0ZAL5_BOMMO
NCBI RefSeqNP_001037640.17e-7352.20%polymerase delta interacting protein 3 [Bombyx mori]
NCBI nr blastpgi|1129826571e-7152.20%polymerase delta interacting protein 3 [Bombyx mori]
NCBI nr blastxgi|1129826576e-8253.08%polymerase delta interacting protein 3 [Bombyx mori]
Group
Gene OntologyGO:00001669.5e-12nucleotide binding
GO:00036765.3e-10nucleic acid binding
KEGG pathwaydgr:Dgri_GH185943e-07 
 K12881 (THOC4, ALY)maps-> Spliceosome
InterPro domain[166-300] IPR0126779.5e-12Nucleotide-binding, alpha-beta plait
[224-290] IPR0005045.3e-10RNA recognition motif domain
Orthology groupMCL16848 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203041-TA
ATGGCTTCATATGTTAATATGAGTCTAGATGATATAATACAAAAACAGAAAAAGGATTCTGTTAGTAGGAACACATTTAAAAATAAAAAACAATTACCAGTGAAAAATAAAACTATCATGGATGCAAGAAACAAAATAATTTCTAAAAAACGCACTCAAATCACAGATGCCAGAGAAAAATTAGGCGAACTTGCCAAGCAAAAAGATGCACGTTTAAGATTGGAGCAATTAAGAGCGAAAAGGGCCGTGACCAAGATGCAGGTGTTTACAGATCCATCAAGACAAGTAGCTTATCAATCCCTCACACCAAAGTTTGTGAAAAAGAATGTCTTTAATAAAAGATTTGTAGGAGAGAGACCAAATGAACCAAGAAATTTTGCTGATGTAAAATGCAGTTACAAACCTATTATAAGGACTGTCGAAAATGATATTGCAATAATTGATGATCCACTTGAGAAAATGTGTGAGCCAATAGGAACATCACTAACAAATAGGAAAGCGAGTCTGCAATTGAAAATAGTCACACATAATAGTGATGCGCACAGGACAGCTACAATTGAGAAAGAGAAAAGCCCACCGAGACCTCCGTCTATATTAAAGAAAAGGCCGATGGCTGCCTTGCGTACTGAAAATAAAGTTGAAAAGCCTGAAAAATCTAATCACGAGTACAGAATCATTGTTAGCAATTTAAGAAACTCTGTAACTGGAGGTGATATTGAGGAATTGTTTGGAGATGTTGGCGGTATGGTAGAATCTCGACTAGTAAGACCTGGCACAGCAGAAGTTATATACAAAACTGTTCAAGACGCCCAAAAAGCTGTGGAACTCTACCACAACAGACAGTTAGATGGTCAGCCCATGACATGTCTCCTTGTCACACCACGACCTAACACTGAACACAGGCAAACCAGCATTGTACAGCACCAACTCAAATGTTGTTCCAGACATATCTACCTTCCATAA

Protein sequence:

>DPOGS203041-PA
MASYVNMSLDDIIQKQKKDSVSRNTFKNKKQLPVKNKTIMDARNKIISKKRTQITDAREKLGELAKQKDARLRLEQLRAKRAVTKMQVFTDPSRQVAYQSLTPKFVKKNVFNKRFVGERPNEPRNFADVKCSYKPIIRTVENDIAIIDDPLEKMCEPIGTSLTNRKASLQLKIVTHNSDAHRTATIEKEKSPPRPPSILKKRPMAALRTENKVEKPEKSNHEYRIIVSNLRNSVTGGDIEELFGDVGGMVESRLVRPGTAEVIYKTVQDAQKAVELYHNRQLDGQPMTCLLVTPRPNTEHRQTSIVQHQLKCCSRHIYLP-